Shengwei Li
This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.
Bibliography
2025
Frontiers Comput. Sci., October, 2025
Oases: Efficient Large-Scale Model Training on Commodity Servers via Overlapped and Automated Tensor Model Parallelism.
IEEE Trans. Parallel Distributed Syst., September, 2025
IEEE Trans. Pattern Anal. Mach. Intell., September, 2025
H2:Towards Efficient Large-Scale LLM Training on Hyper-Heterogeneous Cluster over 1,000 Chips.
CoRR, May, 2025
AutoPipe-H: A Heterogeneity-Aware Data-Paralleled Pipeline Approach on Commodity GPU Servers.
IEEE Trans. Computers, April, 2025
An Efficient Sequential Decentralized Federated Progressive Channel Pruning Strategy for Smart Grid Electricity Theft Detection.
IEEE Trans. Ind. Informatics, March, 2025
Real-Time Modeling Method for Large-Scale Photovoltaic Power Stations Using Nested Fast and Simultaneous Solution.
IEEE Trans. Ind. Electron., March, 2025
Internal Short-Circuit Fault Diagnosis for Batteries of Energy Storage Stations Based on Multivariate Multiscale Sample Entropy.
IEEE Trans. Ind. Electron., February, 2025
2024
IEEE Trans. Parallel Distributed Syst., August, 2024
IEEE Trans. Parallel Distributed Syst., April, 2024
Pro-Prophet: A Systematic Load Balancing Method for Efficient Parallel Training of Large-scale MoE Models.
CoRR, 2024
Stability and Generalization of Asynchronous SGD: Sharper Bounds Beyond Lipschitz and Smoothness.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
HSDP: Accelerating Large-scale Model Training via Efficient Sharded Data Parallelism.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2024
End-To-End Control of a Quadrotor Using Gaussian Ensemble Model-Based Reinforcement Learning.
Proceedings of the Intelligence Science V - 6th IFIP TC 12 International Conference, 2024
Coordinated Multi-regional Logistics Path Planning: A Broad Reinforcement Learning Framework.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2024
2023
Merak: An Efficient Distributed DNN Training Framework With Automated 3D Parallelism for Giant Foundation Models.
IEEE Trans. Parallel Distributed Syst., May, 2023
CoRR, 2023
Automated Tensor Model Parallelism with Overlapped Communication for Efficient Foundation Model Training.
CoRR, 2023
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2023
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2023
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2023
Communication Analysis for Multidimensional Parallel Training of Large-scale DNN Models.
Proceedings of the IEEE International Conference on High Performance Computing & Communications, 2023
Prophet: Fine-grained Load Balancing for Parallel Training of Large-scale MoE Models.
Proceedings of the IEEE International Conference on Cluster Computing, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the 23rd IEEE/ACIS International Conference on Computer and Information Science, 2023
2022
EmbRace: Accelerating Sparse Communication for Distributed Training of Deep Neural Networks.
Proceedings of the 51st International Conference on Parallel Processing, 2022
AutoPipe: A Fast Pipeline Parallelism Approach with Balanced Partitioning and Micro-batch Slicing.
Proceedings of the IEEE International Conference on Cluster Computing, 2022
HPH: Hybrid Parallelism on Heterogeneous Clusters for Accelerating Large-scale DNNs Training.
Proceedings of the IEEE International Conference on Cluster Computing, 2022
2021
EmbRace: Accelerating Sparse Communication for Distributed Training of NLP Neural Networks.
CoRR, 2021
Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021
Proceedings of the IEEE International Conference on Cluster Computing, 2021
2009
Proceedings of the 2009 IEEE International Conference on Granular Computing, 2009