Zhongyuan Peng

According to our database1, Zhongyuan Peng authored at least 13 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
KAT-V1: Kwai-AutoThink Technical Report.
CoRR, July, 2025

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization.
CoRR, July, 2025

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models.
CoRR, May, 2025

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs.
CoRR, April, 2025

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
CoRR, February, 2025

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models.
CoRR, February, 2025

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines.
CoRR, February, 2025

Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information Flow.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model.
CoRR, 2024

MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models.
CoRR, 2024

RoleAgent: Building, Interacting, and Benchmarking High-quality Role-Playing Agents from Scripts.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models.
CoRR, 2023


  Loading...