Shengwei Li

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026

Pro-Prophet: A Systematic Load Balancing Method for Efficient Parallel Training of Large-Scale MoE Models.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., May, 2026

Di-PS: System-Algorithm Co-Design for Asynchronous and Heterogeneous Cross-cluster LLM Training at Scale.

[BibT_eX]

[DOI]

Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

AdaCheck: An Adaptive Checkpointing System for Efficient LLM Training with Redundancy Utilization.

[BibT_eX]

[DOI]

Proceedings of the 24th USENIX Conference on File and Storage Technologies, 2026

2025

Efficient deep neural network training via decreasing precision with layer capacity.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., October, 2025

Oases: Efficient Large-Scale Model Training on Commodity Servers via Overlapped and Automated Tensor Model Parallelism.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., September, 2025

Toward Understanding the Generalizability of Delayed Stochastic Gradient Descent.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., September, 2025

H2:Towards Efficient Large-Scale LLM Training on Hyper-Heterogeneous Cluster over 1,000 Chips.

[BibT_eX]

[DOI]

CoRR, May, 2025

AutoPipe-H: A Heterogeneity-Aware Data-Paralleled Pipeline Approach on Commodity GPU Servers.

[BibT_eX]

[DOI]

IEEE Trans. Computers, April, 2025

An Efficient Sequential Decentralized Federated Progressive Channel Pruning Strategy for Smart Grid Electricity Theft Detection.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Informatics, March, 2025

Real-Time Modeling Method for Large-Scale Photovoltaic Power Stations Using Nested Fast and Simultaneous Solution.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Electron., March, 2025

Internal Short-Circuit Fault Diagnosis for Batteries of Energy Storage Stations Based on Multivariate Multiscale Sample Entropy.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Electron., February, 2025

Acoustic Feature-Driven Cross-Domain Fault Diagnosis of Industrial Equipment: An Adaptive Feature Fusion Approach.

[BibT_eX]

[DOI]

Proceedings of the 51st Annual Conference of the IEEE Industrial Electronics Society, 2025

Capricorn: Efficient In-Memory Checkpointing for MoE Model Training with Dynamicity Awareness.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2025

2024

A Multidimensional Communication Scheduling Method for Hybrid Parallel DNN Training.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., August, 2024

A Memory-Efficient Hybrid Parallel Framework for Deep Neural Network Training.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., April, 2024

Stability and Generalization of Asynchronous SGD: Sharper Bounds Beyond Lipschitz and Smoothness.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

HSDP: Accelerating Large-scale Model Training via Efficient Sharded Data Parallelism.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2024

End-To-End Control of a Quadrotor Using Gaussian Ensemble Model-Based Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Intelligence Science V - 6th IFIP TC 12 International Conference, 2024

Coordinated Multi-regional Logistics Path Planning: A Broad Reinforcement Learning Framework.

[BibT_eX]

[DOI]

Shengwei Li

Congcong Zhu

Zeping Tong

Proceedings of the Algorithms and Architectures for Parallel Processing, 2024

2023

Merak: An Efficient Distributed DNN Training Framework With Automated 3D Parallelism for Giant Foundation Models.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., May, 2023

Towards Understanding the Generalizability of Delayed Stochastic Gradient Descent.

[BibT_eX]

[DOI]

CoRR, 2023

Automated Tensor Model Parallelism with Overlapped Communication for Efficient Foundation Model Training.

[BibT_eX]

[DOI]

CoRR, 2023

Improved Accuracy of XCO2 Retrieval Based on OCO-2 Rtretrieval Framework Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2023

Observing Anthropogenic CO2 Emissions with TanSat in Northeast China.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2023

Monitoring of Xch4 Changes and Anomaly in Hebei Province, China Based on Tropomi.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2023

Communication Analysis for Multidimensional Parallel Training of Large-scale DNN Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on High Performance Computing & Communications, 2023

Prophet: Fine-grained Load Balancing for Parallel Training of Large-scale MoE Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2023

Stability-Based Generalization Analysis of the Asynchronous Decentralized SGD.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Keyword Spotting Based on CTC and Similarity Matching for Chinese Speech.

[BibT_eX]

[DOI]

Shengwei Li

Huajun Zhang

Proceedings of the 23rd IEEE/ACIS International Conference on Computer and Information Science, 2023

2022

EmbRace: Accelerating Sparse Communication for Distributed Training of Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 51st International Conference on Parallel Processing, 2022

AutoPipe: A Fast Pipeline Parallelism Approach with Balanced Partitioning and Micro-batch Slicing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2022

HPH: Hybrid Parallelism on Heterogeneous Clusters for Accelerating Large-scale DNNs Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2022

2021

EmbRace: Accelerating Sparse Communication for Distributed Training of NLP Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2021

Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training.

[BibT_eX]

[DOI]

Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

2PGraph: Accelerating GNN Training over Large Graphs on GPU Clusters.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2021

2009

Mining closed frequent itemset based on FP-Tree.

[BibT_eX]

[DOI]

Shengwei Li

Lingsheng Li

Chong Han

Proceedings of the 2009 IEEE International Conference on Granular Computing, 2009

Shengwei Li

Bibliography

Loading...