Qizheng Zhang

Orcid: 0009-0009-3208-4601

According to our database1, Qizheng Zhang authored at least 16 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
FlowRL: Matching Reward Distributions for LLM Reasoning.
CoRR, September, 2025

Cost-Efficient Serving of LLM Agents via Test-Time Plan Caching.
CoRR, June, 2025

LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits.
CoRR, February, 2025

CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion.
Proceedings of the Twentieth European Conference on Computer Systems, 2025

2024
CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving.
Proceedings of the ACM SIGCOMM 2024 Conference, 2024

Caravan: Practical Online Learning of In-Network ML Models with Labeling Agents.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

GRACE: Loss-Resilient Real-Time Video through Neural Codecs.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

The Dataflow Abstract Machine Simulator Framework.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

2023
Grace++: Loss-Resilient Real-Time Video Communication under High Network Latency.
CoRR, 2023

Optimizing Real-Time Video Experience with Data Scalable Codec.
Proceedings of the 2023 Workshop on Emerging Multimedia Systems, 2023

OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation.
Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, 2023

2022
GRACE: Loss-Resilient Real-Time Video Communication Using Data-Scalable Autoencoder.
CoRR, 2022

AccMPEG: Optimizing Video Encoding for Video Analytics.
CoRR, 2022

Understanding the potential of server-driven edge video analytics.
Proceedings of the HotMobile '22: The 23rd International Workshop on Mobile Computing Systems and Applications, Tempe, Arizona, USA, March 9, 2022

AccMPEG: Optimizing Video Encoding for Accurate Video Analytics.
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022

2020
Server-Driven Video Streaming for Deep Learning Inference.
Proceedings of the SIGCOMM '20: Proceedings of the 2020 Annual conference of the ACM Special Interest Group on Data Communication on the applications, 2020


  Loading...