Aohan Zeng

Orcid: 0000-0002-8766-0153

According to our database1, Aohan Zeng authored at least 30 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
IF-CRITIC: Towards a Fine-Grained LLM Critic for Instruction-Following Evaluation.
CoRR, November, 2025

Data-Efficient RLVR via Off-Policy Influence Guidance.
CoRR, October, 2025

An Efficient, Reliable and Observable Collective Communication Library in Large-scale GPU Training Clusters.
CoRR, October, 2025

WebGLM: Towards an Efficient and Reliable Web-Enhanced Question-Answering System.
ACM Trans. Inf. Syst., September, 2025

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models.
CoRR, August, 2025

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.
CoRR, July, 2025

Scaling Speech-Text Pre-training with Synthetic Interleaved Data.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Does RLHF Scale? Exploring the Impacts From Data, Model, and Method.
CoRR, 2024

GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken Chatbot.
CoRR, 2024

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents.
CoRR, 2024

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools.
CoRR, 2024

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback.
CoRR, 2024

APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding.
CoRR, 2024

xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein.
CoRR, 2024

Understanding Emergent Abilities of Language Models from the Loss Perspective.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AgentBench: Evaluating LLMs as Agents.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

AgentTuning: Enabling Generalized Agent Abilities for LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Revisiting Parallel Context Windows: A Frustratingly Simple Alternative and Chain-of-Thought Deterioration.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation.
CoRR, 2023

CogDL: A Comprehensive Library for Graph Deep Learning.
Proceedings of the ACM Web Conference 2023, 2023

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

GLM-130B: An Open Bilingual Pre-trained Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
GLM-130B: An Open Bilingual Pre-trained Model.
CoRR, 2022

BaGuaLu: targeting brain scale pretrained models with over 37 million cores.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

2021
FastMoE: A Fast Mixture-of-Expert Training System.
CoRR, 2021

CogDL: An Extensive Toolkit for Deep Learning on Graphs.
CoRR, 2021


  Loading...