Zihan Jiang

Orcid: 0000-0003-0632-7402

Affiliations:
  • Huawei Technologies, China


According to our database1, Zihan Jiang authored at least 19 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
RTT- or Bandwidth-Bound? Demystifying the KV Cache Transfer in Large Language Model Serving.
Proceedings of the 2nd Workshop on Networks for AI Computing, 2025

2023
CMLCompiler: A Unified Compiler for Classical Machine Learning.
Proceedings of the 37th International Conference on Supercomputing, 2023

2022
A systematic study on benchmarking AI inference accelerators.
CCF Trans. High Perform. Comput., 2022

2021
OpenClinicalAI: enabling AI to diagnose diseases in real-world clinical settings.
CoRR, 2021

HPC AI500: Representative, Repeatable and Simple HPC AI Benchmarking.
CoRR, 2021

AIBench Training: Balanced Industry-Standard AI Training Benchmarking.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2021

Pinpointing the Memory Behaviors of DNN Training.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2021

HPC AI500 V2.0: The Methodology, Tools, and Metrics for Benchmarking HPC AI Systems.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

AIBench Scenario: Scenario-Distilling AI Benchmarking.
Proceedings of the 30th International Conference on Parallel Architectures and Compilation Techniques, 2021

2020
HPC AI500: The Methodology, Tools, Roofline Performance Models, and Metrics for Benchmarking HPC AI Systems.
CoRR, 2020

AIBench: Scenario-distilling AI Benchmarking.
CoRR, 2020

AIBench: An Industry Standard AI Benchmark Suite from Internet Services.
CoRR, 2020

AIBench: An Agile Domain-specific Benchmarking Methodology and an AI Benchmark Suite.
CoRR, 2020

Characterizing the I/O Pipeline in the Deployment of CNNs on Commercial Accelerators.
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2020

2019
HPC AI500: A Benchmark Suite for HPC AI Systems.
CoRR, 2019

A Semantic-based Medical Image Fusion Approach.
CoRR, 2019

BOPS, A New Computation-Centric Metric for Datacenter Computing.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2019

2018
HPC AI500: A Benchmark Suite for HPC AI Systems.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2018

AIBench: Towards Scalable and Comprehensive Datacenter AI Benchmarking.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2018


  Loading...