Cheng Qian

Orcid: 0000-0001-9913-820X

Affiliations:
  • University of Illinois Urbana-Champaign, Champaign, IL, USA
  • Tsinghua University, Department of Computer Science and Technology, Beijing, China (former)


According to our database1, Cheng Qian authored at least 62 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe.
CoRR, April, 2026

How Far Can Unsupervised RLVR Scale LLM Training?
CoRR, March, 2026

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data.
CoRR, February, 2026

Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs.
CoRR, February, 2026

Copyright Detective: A Forensic System to Evidence LLMs Flickering Copyright Leakage Risks.
CoRR, February, 2026

Teaching LLMs to Learn Tool Trialing and Execution through Environment Interaction.
CoRR, January, 2026

Agentic Reasoning for Large Language Models.
CoRR, January, 2026

PEARL: Self-Evolving Assistant for Time Management with Reinforcement Learning.
CoRR, January, 2026

A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to Artificial Super Intelligence.
Trans. Mach. Learn. Res., 2026

WiNELL: Wikipedia Never-Ending Updating with LLM Agents.
Proceedings of the ACM Web Conference 2026, 2026

ShortageSim: Simulating Drug Shortages Under Information Asymmetry.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
From Word to World: Can Large Language Models be Implicit Text-based World Models?
CoRR, December, 2025

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe.
CoRR, December, 2025

Geometric-Disentangelment Unlearning.
CoRR, November, 2025

LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering.
CoRR, November, 2025

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents.
CoRR, November, 2025

xRouter: Training Cost-Aware LLMs Orchestration System via Reinforcement Learning.
CoRR, October, 2025

Self-Improving LLM Agents at Test-Time.
CoRR, October, 2025

Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning.
CoRR, October, 2025

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning.
CoRR, September, 2025

LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering.
CoRR, September, 2025

Context Engineering for Trustworthiness: Rescorla Wagner Steering Under Mixed and Inappropriate Contexts.
CoRR, September, 2025

UserBench: An Interactive Gym Environment for User-Centric Agents.
CoRR, July, 2025

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence.
CoRR, July, 2025

Atomic Reasoning for Scientific Table Claim Verification.
CoRR, June, 2025

Toward a Theory of Agents as Tool-Use Decision-Makers.
CoRR, June, 2025

RM-R1: Reward Modeling as Reasoning.
CoRR, May, 2025

Tool Learning with Foundation Models.
ACM Comput. Surv., April, 2025

A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions.
CoRR, April, 2025

OTC: Optimal Tool Calls via Reinforcement Learning.
CoRR, April, 2025

ToolRL: Reward is All Tool Learning Needs.
CoRR, April, 2025

Alice: Proactive Learning with Teacher's Demonstrations for Weak-to-Strong Generalization.
CoRR, April, 2025

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset.
CoRR, April, 2025

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents.
CoRR, March, 2025

Internal Activation as the Polar Star for Steering Unsafe LLM Behavior.
CoRR, February, 2025

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ISACL: Internal State Analyzer for Copyrighted Training Data Leakage.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Rescorla-Wagner Steering of LLMs for Undesired Behaviors over Disproportionate Inappropriate Context.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

SafeSwitch: Steering Unsafe LLM Behavior via Internal Activation Signals.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

DecisionFlow: Advancing Large Language Model as Principled Decision Maker.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Aligning LLMs with Individual Preferences via Interaction.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

EscapeBench: Towards Advancing Creative Intelligence of Language Model Agents.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SMART: Self-Aware Agent for Tool Overuse Mitigation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

The Law of Knowledge Overshadowing: Towards Understanding, Predicting and Preventing LLM Hallucination.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Enhancing Open-Domain Task-Solving Capability of LLMs via Autonomous Tool Integration from GitHub.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
EscapeBench: Pushing Language Models to Think Outside the Box.
CoRR, 2024

Aligning LLMs with Individual Preferences via Interaction.
CoRR, 2024

Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity.
CoRR, 2024

Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent Self-Evolution.
CoRR, 2024

Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source Model.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
"Merge Conflicts!" Exploring the Impacts of External Distractors to Parametric Knowledge Graphs.
CoRR, 2023

CREATOR: Disentangling Abstract and Concrete Reasonings of Large Language Models through Tool Creation.
CoRR, 2023

Tool Learning with Foundation Models.
CoRR, 2023

CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Recyclable Tuning for Continual Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Exploring Mode Connectivity for Pre-trained Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022


  Loading...