Shenhan Zhu

Orcid: 0009-0004-0267-775X

According to our database1, Shenhan Zhu authored at least 9 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Galvatron: An Automatic Distributed System for Efficient Foundation Model Training.
CoRR, April, 2025

NetMoE: Accelerating MoE Training through Dynamic Sample Placement.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Spindle: Efficient Distributed Training of Multi-Task Large Models via Wavefront Scheduling.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

FlexSP: Accelerating Large Language Model Training via Flexible Sequence Parallelism.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

2024
Improving Automatic Parallel Training via Balanced Memory Workload Optimization.
IEEE Trans. Knowl. Data Eng., 2024

Data-Centric and Heterogeneity-Adaptive Sequence Parallelism for Efficient LLM Training.
CoRR, 2024

Efficient Multi-Task Large Model Training via Data Heterogeneity-aware Model Management.
CoRR, 2024

LSH-MoE: Communication-efficient MoE Training via Locality-Sensitive Hashing.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

X-former Elucidator: Reviving Efficient Attention for Long Context Language Modeling.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024


  Loading...