Zhuohan Gu

Orcid: 0009-0005-1076-6549

According to our database1, Zhuohan Gu authored at least 4 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
AdaptCache: KV Cache Native Storage Hierarchy for Low-Delay and High-Quality Language Model Serving.
CoRR, September, 2025

METIS: Fast Quality-Aware RAG Systems with Configuration Adaptation.
Proceedings of the ACM SIGOPS 31st Symposium on Operating Systems Principles, 2025

2024
RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation.
CoRR, 2024

LLMSteer: Improving Long-Context LLM Inference by Steering Attention on Reused Contexts.
CoRR, 2024


  Loading...