Michael Shieh

According to our database1, Michael Shieh authored at least 36 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents.
CoRR, April, 2026

Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows.
CoRR, April, 2026

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw.
CoRR, April, 2026

Gym-V: A Unified Vision Environment System for Agentic Vision Research.
CoRR, March, 2026

In-Context Reinforcement Learning for Tool Use in Large Language Models.
CoRR, March, 2026

ImageEdit-R1: Boosting Multi-Agent Image Editing via Reinforcement Learning.
CoRR, March, 2026

LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards.
CoRR, March, 2026

Gradually Compacting Large Language Models for Reasoning Like a Boiling Frog.
CoRR, February, 2026

MM-Eureka: Toward Stable Multimodal Reasoning via Rule-based Reinforcement Learning with Policy Drift Control.
Trans. Mach. Learn. Res., 2026

2025
Diffusion Language Models are Super Data Learners.
CoRR, November, 2025

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning.
CoRR, October, 2025

Training Optimal Large Diffusion Language Models.
CoRR, October, 2025

GEM: A Gym for Agentic LLMs.
CoRR, October, 2025

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use.
CoRR, September, 2025

The Emergence of Abstract Thought in Large Language Models Beyond Any Language.
CoRR, June, 2025

SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis.
CoRR, June, 2025

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation.
CoRR, April, 2025

Efficient Process Reward Model Training via Active Learning.
CoRR, April, 2025

Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction.
CoRR, March, 2025

Long-Context Inference with Retrieval-Augmented Speculative Decoding.
CoRR, February, 2025

CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Unnatural Languages Are Not Bugs but Features for LLMs.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

MixEval-X: Any-to-any Evaluations from Real-world Data Mixture.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Single Character Perturbations Break LLM Alignment.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures.
CoRR, 2024

Self-Evaluation as a Defense Against Adversarial Attacks on LLMs.
CoRR, 2024

Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning.
CoRR, 2024

Accelerating Greedy Coordinate Gradient via Probe Sampling.
CoRR, 2024

Accelerating Greedy Coordinate Gradient and General Prompt Optimization via Probe Sampling.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Reasoning Robustness of LLMs to Adversarial Typographical Errors.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Prompt Optimization via Adversarial In-Context Learning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

InstructCoder: Instruction Tuning Large Language Models for Code Editing.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), 2024


  Loading...