Yinmin Zhong
Orcid: 0000-0002-2504-7652
According to our database1,
Yinmin Zhong
authored at least 10 papers
between 2023 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
IEEE/ACM Trans. Netw., June, 2024
RLHFuse: Efficient RLHF Training for Large Language Models with Inter- and Intra-Stage Fusion.
CoRR, 2024
DistTrain: Addressing Model and Data Heterogeneity with Disaggregated Training for Multimodal Large Language Models.
CoRR, 2024
CoRR, 2024
LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence Parallelism.
Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles, 2024
DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024
2023
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023