Shenhan Zhu
Orcid: 0009-0004-0267-775X
According to our database1,
Shenhan Zhu
authored at least 9 papers
between 2024 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
CoRR, April, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Spindle: Efficient Distributed Training of Multi-Task Large Models via Wavefront Scheduling.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025
FlexSP: Accelerating Large Language Model Training via Flexible Sequence Parallelism.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025
2024
IEEE Trans. Knowl. Data Eng., 2024
Data-Centric and Heterogeneity-Adaptive Sequence Parallelism for Efficient LLM Training.
CoRR, 2024
Efficient Multi-Task Large Model Training via Data Heterogeneity-aware Model Management.
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
X-former Elucidator: Reviving Efficient Attention for Long Context Language Modeling.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024