Xiang Li

Orcid: 0000-0002-4930-0878

According to our database1, Xiang Li authored at least 17 papers between 2017 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
ResCheckpointer: Building Program Error Resilience-Aware Checkpointing Mechanism for HPC Systems.
J. Comput. Sci. Technol., May, 2025

Convergence-aware optimal checkpointing for exploratory deep learning training jobs.
Future Gener. Comput. Syst., 2025

GraphFT: A Lightweight Fault-tolerant Framework for Iterative Graph Processing.
Proceedings of the Wireless Artificial Intelligent Computing Systems and Applications, 2025

ArrayPipe: Introducing Job-Array Pipeline Parallelism for High Throughput Model Exploration.
Proceedings of the IEEE INFOCOM 2025, 2025

2024
Interference-aware opportunistic job placement for shared distributed deep learning clusters.
J. Parallel Distributed Comput., January, 2024

HPC-Crash: Characterizing Crash-Proneness of HPC Programs from Various Perspectives.
Proceedings of the 10th IEEE International Conference on High Performance and Smart Computing, 2024

2023
ExplSched: Maximizing Deep Learning Cluster Efficiency for Exploratory Jobs.
Proceedings of the IEEE International Conference on Cluster Computing, 2023

2022
Cooperative task assignment in spatial crowdsourcing via multi-agent deep reinforcement learning.
J. Syst. Archit., 2022

MSSA-FL: High-Performance Multi-stage Semi-asynchronous Federated Learning with Non-IID Data.
Proceedings of the Knowledge Science, Engineering and Management, 2022

Hybrid Parameter Update: Alleviating Imbalance Impacts for Distributed Deep Learning.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

2021
AreaTransfer: A Cross-City Crowd Flow Prediction Framework Based on Transfer Learning.
Proceedings of the Smart Computing and Communication - 6th International Conference, 2021

2020
Job Placement Strategy with Opportunistic Resource Sharing for Distributed Deep Learning Clusters.
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020

2019
Pec: Proactive Elastic Collaborative Resource Scheduling in Data Stream Processing.
IEEE Trans. Parallel Distributed Syst., 2019

2018
A Task Allocation Method for Stream Processing with Recovery Latency Constraint.
J. Comput. Sci. Technol., 2018

2017
Minimum Backups for Stream Processing With Recovery Latency Guarantees.
IEEE Trans. Reliab., 2017

Integrated recovery and task allocation for stream processing.
Proceedings of the 36th IEEE International Performance Computing and Communications Conference, 2017

Task Allocation for Stream Processing with Recovery Latency Guarantee.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017


  Loading...