Cheng Qian

Orcid: 0000-0001-9913-820X

Affiliations:

University of Illinois Urbana-Champaign, Champaign, IL, USA
Tsinghua University, Department of Computer Science and Technology, Beijing, China (former)

According to our database¹, Cheng Qian authored at least 69 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

MemGuard: Preventing Memory Contamination in Long-Term Memory-Augmented Large Language Models.

[BibT_eX]

[DOI]

CoRR, May, 2026

UserHarness: Harnessing User Minds for Stronger Agent Theory-of-Mind.

[BibT_eX]

[DOI]

Cheng Qian

Jiayu Liu

Heng Ji

CoRR, May, 2026

Advancing Creative Physical Intelligence in Large Multimodal Models.

[BibT_eX]

[DOI]

CoRR, May, 2026

Code as Agent Harness.

[BibT_eX]

[DOI]

CoRR, May, 2026

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing.

[BibT_eX]

[DOI]

CoRR, May, 2026

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe.

[BibT_eX]

[DOI]

CoRR, April, 2026

How Far Can Unsupervised RLVR Scale LLM Training?

[BibT_eX]

[DOI]

CoRR, March, 2026

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data.

[BibT_eX]

[DOI]

CoRR, February, 2026

Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs.

[BibT_eX]

[DOI]

CoRR, February, 2026

[BibT_eX]

[DOI]

CoRR, February, 2026

Teaching LLMs to Learn Tool Trialing and Execution through Environment Interaction.

[BibT_eX]

[DOI]

CoRR, January, 2026

Agentic Reasoning for Large Language Models.

[BibT_eX]

[DOI]

CoRR, January, 2026

A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to Artificial Super Intelligence.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2026

WiNELL: Wikipedia Never-Ending Updating with LLM Agents.

[BibT_eX]

[DOI]

Proceedings of the ACM Web Conference 2026, 2026

Current Agents Fail to Leverage World Model as Tool for Foresight.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

From Word to World: Can Large Language Models be Implicit Text-based World Models?

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

PEARL: Self-Evolving Assistant for Time Management with Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

ShortageSim: Simulating Drug Shortages Under Information Asymmetry.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

From Word to World: Can Large Language Models be Implicit Text-based World Models?

[BibT_eX]

[DOI]

CoRR, December, 2025

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe.

[BibT_eX]

[DOI]

CoRR, December, 2025

Geometric-Disentangelment Unlearning.

[BibT_eX]

[DOI]

CoRR, November, 2025

LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering.

[BibT_eX]

[DOI]

CoRR, November, 2025

xRouter: Training Cost-Aware LLMs Orchestration System via Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, October, 2025

Self-Improving LLM Agents at Test-Time.

[BibT_eX]

[DOI]

CoRR, October, 2025

Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, October, 2025

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering.

[BibT_eX]

[DOI]

CoRR, September, 2025

Context Engineering for Trustworthiness: Rescorla Wagner Steering Under Mixed and Inappropriate Contexts.

[BibT_eX]

[DOI]

CoRR, September, 2025

UserBench: An Interactive Gym Environment for User-Centric Agents.

[BibT_eX]

[DOI]

CoRR, July, 2025

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence.

[BibT_eX]

[DOI]

CoRR, July, 2025

Atomic Reasoning for Scientific Table Claim Verification.

[BibT_eX]

[DOI]

CoRR, June, 2025

Toward a Theory of Agents as Tool-Use Decision-Makers.

[BibT_eX]

[DOI]

CoRR, June, 2025

RM-R1: Reward Modeling as Reasoning.

[BibT_eX]

[DOI]

CoRR, May, 2025

Tool Learning with Foundation Models.

[BibT_eX]

[DOI]

ACM Comput. Surv., April, 2025

A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions.

[BibT_eX]

[DOI]

CoRR, April, 2025

OTC: Optimal Tool Calls via Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, April, 2025

Alice: Proactive Learning with Teacher's Demonstrations for Weak-to-Strong Generalization.

[BibT_eX]

[DOI]

CoRR, April, 2025

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset.

[BibT_eX]

[DOI]

CoRR, April, 2025

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents.

[BibT_eX]

[DOI]

CoRR, March, 2025

Internal Activation as the Polar Star for Steering Unsafe LLM Behavior.

[BibT_eX]

[DOI]

CoRR, February, 2025

ToolRL: Reward is All Tool Learning Needs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents.

[BibT_eX]

[DOI]

Teja Venkat Koripella

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ISACL: Internal State Analyzer for Copyrighted Training Data Leakage.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Rescorla-Wagner Steering of LLMs for Undesired Behaviors over Disproportionate Inappropriate Context.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

SafeSwitch: Steering Unsafe LLM Behavior via Internal Activation Signals.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

DecisionFlow: Advancing Large Language Model as Principled Decision Maker.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Aligning LLMs with Individual Preferences via Interaction.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

EscapeBench: Towards Advancing Creative Intelligence of Language Model Agents.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SMART: Self-Aware Agent for Tool Overuse Mitigation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

The Law of Knowledge Overshadowing: Towards Understanding, Predicting and Preventing LLM Hallucination.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Enhancing Open-Domain Task-Solving Capability of LLMs via Autonomous Tool Integration from GitHub.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

EscapeBench: Pushing Language Models to Think Outside the Box.

[BibT_eX]

[DOI]

CoRR, 2024

Aligning LLMs with Individual Preferences via Interaction.

[BibT_eX]

[DOI]

CoRR, 2024

Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity.

[BibT_eX]

[DOI]

CoRR, 2024

Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent Self-Evolution.

[BibT_eX]

[DOI]

CoRR, 2024

Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source Model.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents.

[BibT_eX]

[DOI]