Shenggan Cheng
Orcid: 0000-0002-7966-2941
According to our database1,
Shenggan Cheng authored at least 27 papers
between 2020 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
Transforming the Use of Earth Observation Data: Exascale Training of a Generative Compression Model with Historical Priors for up to 10,000x Data Reduction.
CoRR, May, 2026
DiT-HC: Enabling Efficient Training of Visual Generation Model DiT on HPC-oriented CPU Cluster.
CoRR, January, 2026
HelixPipe: Efficient Distributed Training of Long Sequence Transformers with Attention Parallel Pipeline Parallelism.
Proceedings of the 31st ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2026
Proceedings of the 2026 International Conference on Multimedia Retrieval, 2026
2025
Expert-as-a-Service: Towards Efficient, Scalable, and Robust Large-scale MoE Serving.
CoRR, September, 2025
SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation.
CoRR, May, 2025
StarTrail: Concentric Ring Sequence Parallelism for Efficient Near-Infinite-Context Transformer Model Training.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
Concerto: Automatic Communication Optimization and Scheduling for Large-Scale Deep Learning.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025
2024
WallFacer: Guiding Transformer Model Training Out of the Long-Context Dark Forest with N-body Problem.
CoRR, 2024
HeteGen: Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained Devices.
CoRR, 2024
CoRR, 2024
Liger: Interleaving Intra- and Inter-Operator Parallelism for Distributed Large Model Inference.
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024
HeteGen: Efficient Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained Devices.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
Hanayo: Harnessing Wave-like Pipeline Parallelism for Enhanced Large Model Training Efficiency.
Proceedings of the International Conference for High Performance Computing, 2023
2022
2021
Proceedings of the IEEE International Conference on Cluster Computing, 2021
2020
HMS-Net: Hierarchical Multi-Scale Sparsity-Invariant Network for Sparse Depth Completion.
IEEE Trans. Image Process., 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the 20th IEEE/ACM International Symposium on Cluster, 2020