Yong Li

Orcid: 0000-0001-9072-3170

Affiliations:

Alibaba Group, Beijing, China

According to our database¹, Yong Li authored at least 32 papers between 2019 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Skrull: Towards Efficient Long Context Fine-tuning through Dynamic Data Scheduling.

[BibT_eX]

[DOI]

CoRR, May, 2025

Wan: Open and Advanced Large-Scale Video Generative Models.

[BibT_eX]

[DOI]

CoRR, March, 2025

Helios: Efficient Distributed Dynamic Graph Sampling for Online GNN Inference.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025

Efficient Long Context Fine-tuning with Chunk Flow.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

CaliEX: A Disk-Based Large-Scale GNN Training System with Joint Design of Caching and Execution.

[BibT_eX]

[DOI]

Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

Spindle: Efficient Distributed Training of Multi-Task Large Models via Wavefront Scheduling.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

2024

ElasticBatch: A Learning-Augmented Elastic Scheduling System for Batch Inference on MIG.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2024

BladeDISC++: Memory Optimizations Based On Symbolic Shape.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient Multi-Task Large Model Training via Data Heterogeneity-aware Model Management.

[BibT_eX]

[DOI]

CoRR, 2024

Rubick: Exploiting Job Reconfigurability for Deep Learning Cluster Scheduling.

[BibT_eX]

[DOI]

CoRR, 2024

Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache.

[BibT_eX]

[DOI]

CoRR, 2024

Llumnix: Dynamic Scheduling for Large Language Model Serving.

[BibT_eX]

[DOI]

Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

2023

Flash-LLM: Enabling Low-Cost and Highly-Efficient Large Generative Model Inference With Unstructured Sparsity.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2023

GoldMiner: Elastic Scaling of Training Data Pre-Processing Pipelines for Deep Learning.

[BibT_eX]

[DOI]

Proc. ACM Manag. Data, 2023

ROAM: memory-efficient large DNN training via optimized operator ordering and memory layout.

[BibT_eX]

[DOI]

CoRR, 2023

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity.

[BibT_eX]

[DOI]

CoRR, 2023

TAP: Accelerating Large-Scale DNN Training Through Tensor Automatic Parallelisation.

[BibT_eX]

[DOI]

CoRR, 2023

EasyScale: Elastic Training with Consistent Accuracy and Improved Utilization on GPUs.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2023

uGrapher: High-Performance Graph Operator Computation via Unified Abstraction for Graph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022

EasyScale: Accuracy-consistent Elastic Training for Deep Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Structure Enhanced Graph Neural Networks for Link Prediction.

[BibT_eX]

[DOI]

CoRR, 2022

Whale: Efficient Giant Model Training over Heterogeneous GPUs.

[BibT_eX]

[DOI]

Proceedings of the 2022 USENIX Annual Technical Conference, 2022

MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters.

[BibT_eX]

[DOI]

Proceedings of the 19th USENIX Symposium on Networked Systems Design and Implementation, 2022

PICASSO: Unleashing the Potential of GPU-centric Training for Wide-and-deep Recommender Systems.

[BibT_eX]

[DOI]

Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

2021

M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining.

[BibT_eX]

[DOI]

CoRR, 2021

Exploring Sparse Expert Models and Beyond.

[BibT_eX]

[DOI]

CoRR, 2021

M6: A Chinese Multimodal Pretrainer.

[BibT_eX]

[DOI]

CoRR, 2021

2020

Focusing More on Conflicts with Mis-Predictions Helps Language Pre-Training.

[BibT_eX]

[DOI]

CoRR, 2020

EasyTransfer - A Simple and Scalable Deep Transfer Learning Platform for NLP Applications.

[BibT_eX]

[DOI]

CoRR, 2020

Whale: A Unified Distributed Training Framework.

[BibT_eX]

[DOI]

CoRR, 2020

AntMan: Dynamic Scaling on GPU Clusters for Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

2019

AliGraph: A Comprehensive Graph Neural Network Platform.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2019

Yong Li

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...