Aohan Zeng

Orcid: 0000-0002-8766-0153

According to our database1, Aohan Zeng authored at least 26 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
WebGLM: Towards an Efficient and Reliable Web-Enhanced Question-Answering System.
ACM Trans. Inf. Syst., September, 2025

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models.
CoRR, August, 2025

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.
CoRR, July, 2025

Scaling Speech-Text Pre-training with Synthetic Interleaved Data.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Does RLHF Scale? Exploring the Impacts From Data, Model, and Method.
CoRR, 2024

GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken Chatbot.
CoRR, 2024

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents.
CoRR, 2024

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools.
CoRR, 2024

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback.
CoRR, 2024

APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding.
CoRR, 2024

xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein.
CoRR, 2024

Understanding Emergent Abilities of Language Models from the Loss Perspective.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AgentBench: Evaluating LLMs as Agents.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

AgentTuning: Enabling Generalized Agent Abilities for LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Revisiting Parallel Context Windows: A Frustratingly Simple Alternative and Chain-of-Thought Deterioration.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation.
CoRR, 2023

CogDL: A Comprehensive Library for Graph Deep Learning.
Proceedings of the ACM Web Conference 2023, 2023

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

GLM-130B: An Open Bilingual Pre-trained Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
GLM-130B: An Open Bilingual Pre-trained Model.
CoRR, 2022

BaGuaLu: targeting brain scale pretrained models with over 37 million cores.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

2021
FastMoE: A Fast Mixture-of-Expert Training System.
CoRR, 2021

CogDL: An Extensive Toolkit for Deep Learning on Graphs.
CoRR, 2021


  Loading...