Zuquan Song

According to our database1, Zuquan Song authored at least 6 papers between 2024 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
BootSeer: Analyzing and Mitigating Initialization Bottlenecks in Large-Scale LLM Training.
CoRR, July, 2025

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model.
CoRR, April, 2025

Understanding Stragglers in Large Model Training Using What-if Analysis.
Proceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation, 2025

ByteCheckpoint: A Unified Checkpointing System for Large Foundation Model Development.
Proceedings of the 22nd USENIX Symposium on Networked Systems Design and Implementation, 2025

Minder: Faulty Machine Detection for Large-scale Distributed Model Training.
Proceedings of the 22nd USENIX Symposium on Networked Systems Design and Implementation, 2025

2024
FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion.
CoRR, 2024


  Loading...