We stand with Ukraine

We stand with Ukraine

Michael Shieh

According to our database¹, Michael Shieh authored at least 36 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Michael Qizhe Shieh

CoRR, April, 2026

Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows.

[DOI]

,

,

,

,

,

,

,

Michael Qizhe Shieh

,

Alvaro A. Cárdenas

,

,

CoRR, April, 2026

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw.

[DOI]

,

,

,

,

,

,

,

,

Michael Qizhe Shieh

,

,

,

,

,

CoRR, April, 2026

Gym-V: A Unified Vision Environment System for Agentic Vision Research.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

Michael Qizhe Shieh

CoRR, March, 2026

In-Context Reinforcement Learning for Tool Use in Large Language Models.

[DOI]

,

,

,

,

Kenji Kawaguchi

,

,

Michael Qizhe Shieh

CoRR, March, 2026

ImageEdit-R1: Boosting Multi-Agent Image Editing via Reinforcement Learning.

[DOI]

,

,

,

Michael Qizhe Shieh

,

CoRR, March, 2026

LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards.

[DOI]

,

Michael Qizhe Shieh

,

CoRR, March, 2026

Gradually Compacting Large Language Models for Reasoning Like a Boiling Frog.

[DOI]

,

,

,

,

,

,

Kenji Kawaguchi

,

,

,

Michael Qizhe Shieh

CoRR, February, 2026

MM-Eureka: Toward Stable Multimodal Reasoning via Rule-based Reinforcement Learning with Policy Drift Control.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Michael Qizhe Shieh

,

Qiaosheng Zhang

,

Trans. Mach. Learn. Res., 2026

2025

Diffusion Language Models are Super Data Learners.

[DOI]

,

,

,

,

,

,

,

Michael Qizhe Shieh

CoRR, November, 2025

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning.

[DOI]

,

,

Huichen Will Wang

,

,

Michael Qizhe Shieh

,

,

,

CoRR, October, 2025

Training Optimal Large Diffusion Language Models.

[DOI]

,

,

,

,

,

,

,

Michael Qizhe Shieh

CoRR, October, 2025

GEM: A Gym for Agentic LLMs.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Michael Qizhe Shieh

CoRR, September, 2025

The Emergence of Abstract Thought in Large Language Models Beyond Any Language.

[DOI]

,

,

,

,

Kenji Kawaguchi

,

,

,

,

Michael Qizhe Shieh

,

CoRR, June, 2025

SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis.

[DOI]

,

,

,

,

,

Michael Qizhe Shieh

CoRR, June, 2025

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation.

[DOI]

,

,

,

,

,

,

,

Michael Qizhe Shieh

CoRR, April, 2025

Efficient Process Reward Model Training via Active Learning.

[DOI]

,

,

,

,

,

,

Michael Qizhe Shieh

,

CoRR, April, 2025

Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction.

[DOI]

,

,

Michael Qizhe Shieh

CoRR, March, 2025

Long-Context Inference with Retrieval-Augmented Speculative Decoding.

[DOI]

,

,

,

,

Michael Qizhe Shieh

CoRR, February, 2025

CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases.

[DOI]

,

,

,

,

,

,

Michael Qizhe Shieh

,

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Unnatural Languages Are Not Bugs but Features for LLMs.

[DOI]

,

,

,

,

,

,

,

,

Kenji Kawaguchi

,

,

,

Michael Qizhe Shieh

Proceedings of the Forty-second International Conference on Machine Learning, 2025

RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding.

[DOI]

,

,

,

,

Michael Qizhe Shieh

Proceedings of the Forty-second International Conference on Machine Learning, 2025

MixEval-X: Any-to-any Evaluations from Real-world Data Mixture.

[DOI]

,

,

Deepanway Ghosal

,

,

David Junhao Zhang

,

,

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization.

[DOI]

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron.

[DOI]

,

,

,

,

Kenji Kawaguchi

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Single Character Perturbations Break LLM Alignment.

[DOI]

,

,

Kenji Kawaguchi

,

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures.

[DOI]

,

,

Deepanway Ghosal

,

,

David Junhao Zhang

,

,

,

,

,

,

,

,

CoRR, 2024

Self-Evaluation as a Defense Against Adversarial Attacks on LLMs.

[DOI]

,

,

Kenji Kawaguchi

,

CoRR, 2024

Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning.

[DOI]

,

,

,

,

Timothy P. Lillicrap

,

Kenji Kawaguchi

,

CoRR, 2024

Accelerating Greedy Coordinate Gradient via Probe Sampling.

[DOI]

,

,

,

,

Kenji Kawaguchi

,

,

CoRR, 2024

Accelerating Greedy Coordinate Gradient and General Prompt Optimization via Probe Sampling.

[DOI]

,

,

,

,

Kenji Kawaguchi

,

,

Michael Qizhe Shieh

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models.

[DOI]

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Reasoning Robustness of LLMs to Adversarial Typographical Errors.

[DOI]

,

,

,

,

,

Kenji Kawaguchi

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Prompt Optimization via Adversarial In-Context Learning.

[DOI]

,

,

,

,

,

,

Kenji Kawaguchi

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

InstructCoder: Instruction Tuning Large Language Models for Code Editing.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), 2024

Loading...