Aohan Zeng

Orcid: 0000-0002-8766-0153

According to our database1, Aohan Zeng authored at least 15 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Understanding Emergent Abilities of Language Models from the Loss Perspective.
CoRR, 2024

APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding.
CoRR, 2024

xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein.
CoRR, 2024

2023
CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation.
CoRR, 2023

AgentTuning: Enabling Generalized Agent Abilities for LLMs.
CoRR, 2023

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding.
CoRR, 2023

AgentBench: Evaluating LLMs as Agents.
CoRR, 2023

Revisiting Parallel Context Windows: A Frustratingly Simple Alternative and Chain-of-Thought Deterioration.
CoRR, 2023

CogDL: A Comprehensive Library for Graph Deep Learning.
Proceedings of the ACM Web Conference 2023, 2023

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

GLM-130B: An Open Bilingual Pre-trained Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
GLM-130B: An Open Bilingual Pre-trained Model.
CoRR, 2022

BaGuaLu: targeting brain scale pretrained models with over 37 million cores.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

2021
FastMoE: A Fast Mixture-of-Expert Training System.
CoRR, 2021

CogDL: An Extensive Toolkit for Deep Learning on Graphs.
CoRR, 2021


  Loading...