Ze Li

Orcid: 0000-0003-4789-0170

Affiliations:
  • Microsoft Azure, Redmond, WA, USA


According to our database1, Ze Li authored at least 11 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Orchestrating Cross-Layer Anomaly Detection and Mitigation to Address Gray Failures in Large-Scale Cloud Infrastructure.
Proceedings of the IEEE/ACM International Workshop on Cloud Intelligence & AIOps, 2025

2024
Why does Prediction Accuracy Decrease over Time? Uncertain Positive Learning for Cloud Failure Prediction.
CoRR, 2024

Large Language Models Can Provide Accurate and Interpretable Incident Triage.
Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024

Early Bird: Ensuring Reliability of Cloud Systems Through Early Failure Prediction.
Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024

Can We Trust Auto-Mitigation? Improving Cloud Failure Prediction with Uncertain Positive Learning.
Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024

Deoxys: A Causal Inference Engine for Unhealthy Node Mitigation in Large-scale Cloud Infrastructure.
Proceedings of the 2024 ACM Symposium on Cloud Computing, 2024

2021
Onion: identifying incident-indicating logs for cloud systems.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

HALO: Hierarchy-aware Fault Localization for Cloud Systems.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

2020
Efficient incident identification from multi-dimensional issue reports via meta-heuristic search.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

Gandalf: An Intelligent, End-To-End Analytics Service for Safe Deployment in Large-Scale Cloud Infrastructure.
Proceedings of the 17th USENIX Symposium on Networked Systems Design and Implementation, 2020

2019
Robust log-based anomaly detection on unstable log data.
Proceedings of the ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2019


  Loading...