Chengzhi Lu

According to our database1, Chengzhi Lu authored at least 17 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
FASER: Fine-Grained Phase Management for Speculative Decoding in Dynamic LLM Serving.
CoRR, April, 2026

FlexPipe: Adapting Dynamic LLM Serving Through Inflight Pipeline Refactoring in Fragmented Serverless Clusters.
Proceedings of the 21st European Conference on Computer Systems, 2026

High Throughput and Low Latency LLM Serving via Adaptive KV Caching.
Proceedings of the 21st European Conference on Computer Systems, 2026

2025
TokenScale: Timely and Accurate Autoscaling for Disaggregated LLM Serving with Token Velocity.
CoRR, December, 2025

Serving LLM in Distributed GPU Cluster With Fine-Grain Pipeline Constraints.
IEEE Trans. Serv. Comput., 2025

Multiplexing Dynamic Deep Learning Workloads with SLO-awareness in GPU Clusters.
Proceedings of the Twentieth European Conference on Computer Systems, 2025

2024
SMIless: Serving DAG-based Inference with Dynamic Invocations under Serverless Computing.
Proceedings of the International Conference for High Performance Computing, 2024

Planck: Optimizing LLM Inference Performance in Pipeline Parallelism with Fine-Grained SLO Constraint.
Proceedings of the IEEE International Conference on Web Services, 2024

2023
Understanding and Optimizing Workloads for Unified Resource Management in Large Cloud Platforms.
Proceedings of the Eighteenth European Conference on Computer Systems, 2023

2022
An In-Depth Study of Microservice Call Graph and Runtime Performance.
IEEE Trans. Parallel Distributed Syst., 2022

2021
Neuroprotective Effect of Monosialotetrahexosylganglioside (GM1) on Patients with Parkinson's Disease Anesthetized by Ketamine under Denoising Algorithm-Based Ultrasound Image Diagnosis.
Sci. Program., 2021

RPTCN: Resource Prediction for High-dynamic Workloads in Clouds based on Deep Learning.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

Characterizing Microservice Dependency and Performance: Alibaba Trace Analysis.
Proceedings of the SoCC '21: ACM Symposium on Cloud Computing, 2021

2020
Interference Analysis of Co-Located Container Workloads: A Perspective from Hardware Performance Counters.
J. Comput. Sci. Technol., 2020

2019
ADGS: Anomaly Detection and Localization Based on Graph Similarity in Container-Based Clouds.
Proceedings of the 25th IEEE International Conference on Parallel and Distributed Systems, 2019

2018
Modeling Application Performance in Docker Containers Using Machine Learning Techniques.
Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems, 2018

2017
Imbalance in the cloud: An analysis on Alibaba cluster trace.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017


  Loading...