Mingzhen Li
Orcid: 0000-0002-4115-9072Affiliations:
- Chinese Academy of Sciences, Insititute of Computing Technology, State Key Lab of Processors (SKLP), Beijing, China
- Beihang University, Beijing, China (PhD 2023)
According to our database1,
Mingzhen Li
authored at least 31 papers
between 2018 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
29-Billion Atoms Molecular Dynamics Simulation With Ab Initio Accuracy on 35 Million Cores of New Sunway Supercomputer.
IEEE Trans. Computers, May, 2025
CCF Trans. High Perform. Comput., April, 2025
Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025
Proceedings of the 2025 IEEE International Parallel and Distributed Processing Symposium, 2025
Proceedings of the 22nd ACM International Conference on Computing Frontiers, 2025
2024
Towards optimized tensor code generation for deep learning on sunway many-core processor.
Frontiers Comput. Sci., April, 2024
ElasticBatch: A Learning-Augmented Elastic Scheduling System for Batch Inference on MIG.
IEEE Trans. Parallel Distributed Syst., 2024
Building a domain-specific compiler for emerging processors with a reusable approach.
Sci. China Inf. Sci., 2024
Proceedings of the 31st IEEE International Conference on High Performance Computing, 2024
Proceedings of the Euro-Par 2024: Parallel Processing, 2024
2023
CCF Trans. High Perform. Comput., September, 2023
EasyScale: Elastic Training with Consistent Accuracy and Improved Utilization on GPUs.
Proceedings of the International Conference for High Performance Computing, 2023
Exploiting Input Tensor Dynamics in Activation Checkpointing for Efficient Training on GPU.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
Proceedings of the 52nd International Conference on Parallel Processing, 2023
2022
QoS-aware dynamic resource allocation with improved utilization and energy efficiency on GPU.
Parallel Comput., 2022
CoRR, 2022
FamilySeer: Towards Optimized Tensor Codes by Exploiting Computation Subgraph Similarity.
CoRR, 2022
Proceedings of the SC22: International Conference for High Performance Computing, 2022
Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022
2021
IEEE Trans. Parallel Distributed Syst., 2021
IEEE Trans. Emerg. Top. Comput., 2021
Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021
PriPro: Towards Effective Privacy Protection on Edge-Cloud System running DNN Inference.
Proceedings of the 21st IEEE/ACM International Symposium on Cluster, 2021
2020
IEEE Trans. Parallel Distributed Syst., 2020
Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020
swRodinia: A Benchmark Suite for Exploiting Architecture Properties of Sunway Processor.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2020
2018
Block-Checksum-Based Fault Tolerance for Matrix Multiplication on Large-Scale Parallel Systems.
Proceedings of the 20th IEEE International Conference on High Performance Computing and Communications; 16th IEEE International Conference on Smart City; 4th IEEE International Conference on Data Science and Systems, 2018
Proceedings of the 20th IEEE International Conference on High Performance Computing and Communications; 16th IEEE International Conference on Smart City; 4th IEEE International Conference on Data Science and Systems, 2018