Di Zhang

Orcid: 0009-0005-3115-0276

Affiliations:
  • University of North Carolina at Charlotte, NC, USA


According to our database1, Di Zhang authored at least 12 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
An Empirical Study of Machine Learning-Based Synthetic Job Trace Generation Methods.
Proceedings of the Job Scheduling Strategies for Parallel Processing, 2024

Cross-System Analysis of Job Characterization and Scheduling in Large-Scale Computing Clusters.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2023
ClusterLog: Clustering Logs for Effective Log-based Anomaly Detection.
CoRR, 2023

A Reinforcement Learning Based Backfilling Strategy for HPC Batch Jobs.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Drill: Log-based Anomaly Detection for Large-scale Storage Systems Using Source Code Analysis.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Early Exploration of Using ChatGPT for Log-based Anomaly Detection on Parallel File Systems Logs.
Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, 2023

2022
A Study of Failure Recovery and Logging of High-Performance Parallel File Systems.
ACM Trans. Storage, 2022

SchedInspector: A Batch Job Scheduling Inspector Using Reinforcement Learning.
Proceedings of the HPDC '22: The 31st International Symposium on High-Performance Parallel and Distributed Computing, Minneapolis, MN, USA, 27 June 2022, 2022

ClusterLog: Clustering Logs for Effeftxsctive Log-based Anomaly Detection.
Proceedings of the 12th IEEE/ACM Workshop on Fault Tolerance for HPC at eXtreme Scale, 2022

2021
SentiLog: Anomaly Detecting on Parallel File Systems via Log-based Sentiment Analysis.
Proceedings of the HotStorage '21: 13th ACM Workshop on Hot Topics in Storage and File Systems, 2021

2020
RLScheduler: an automated HPC batch job scheduler using reinforcement learning.
Proceedings of the International Conference for High Performance Computing, 2020

2019
RLScheduler: Learn to Schedule HPC Batch Jobs Using Deep Reinforcement Learning.
CoRR, 2019


  Loading...