We stand with Ukraine

We stand with Ukraine

Yonghao Zhuang

Orcid: 0009-0001-8969-7478

Affiliations:

Carnegie Mellon University, Computer Science Department, Pittsburgh, PA, USA
Shanghai Jiao Tong University, Shanghai, China (former)

According to our database¹, Yonghao Zhuang authored at least 16 papers between 2022 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

On csauthors.net:

Bibliography

2025

Efficient Long-context Language Model Training by Core Attention Disaggregation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, October, 2025

K2-Think: A Parameter-Efficient Reasoning System.

[BibT_eX]

[DOI]

,

,

,

Taylor W. Killian

,

,

,

,

Alexander Moreno

,

,

,

,

,

,

,

,

Varad Pimpalkhute

,

,

Aaryamonvikram Singh

,

,

,

,

,

,

,

Mikhail Yurochkin

,

,

,

,

,

,

CoRR, September, 2025

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Abulhair Saparov

,

,

Taylor W. Killian

,

Mikhail Yurochkin

,

,

,

CoRR, June, 2025

LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch.

[BibT_eX]

[DOI]

,

,

,

Willie Neiswanger

,

,

,

,

,

,

Omkar Pangarkar

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2025

Scaling Long Context Training Data by Long-Distance Referrals.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

2024

Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Toward Inference-optimal Mixture-of-Expert Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, 2024

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Joseph E. Gonzalez

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

LLM360: Towards Fully Transparent Open-Source LLMs.

[BibT_eX]

[DOI]

,

,

Willie Neiswanger

,

,

,

,

,

,

,

Omkar Pangarkar

,

,

,

,

,

,

,

,

,

,

,

,

Roberto Iriondo

,

,

,

,

,

,

CoRR, 2023

Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Joseph E. Gonzalez

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On Optimizing the Communication of Model Parallelism.

[BibT_eX]

[DOI]

,

,

,

,

,

Joseph Gonzalez

,

,

,

Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

2022

Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Joseph E. Gonzalez

,

CoRR, 2022

Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Joseph E. Gonzalez

,

Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

Loading...