Yonghao Zhuang

Orcid: 0009-0001-8969-7478

Affiliations:
  • Carnegie Mellon University, Computer Science Department, Pittsburgh, PA, USA
  • Shanghai Jiao Tong University, Shanghai, China (former)


According to our database1, Yonghao Zhuang authored at least 14 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective.
CoRR, June, 2025

LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch.
CoRR, January, 2025

Scaling Long Context Training Data by Long-Distance Referrals.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

2024
Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs.
CoRR, 2024

Toward Inference-optimal Mixture-of-Expert Large Language Models.
CoRR, 2024

RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, 2024

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
LLM360: Towards Fully Transparent Open-Source LLMs.
CoRR, 2023

Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs.
CoRR, 2023

Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On Optimizing the Communication of Model Parallelism.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

2022
Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning.
CoRR, 2022

Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022


  Loading...