Zhuohan Gu

Orcid: 0009-0005-1076-6549

According to our database1, Zhuohan Gu authored at least 7 papers between 2024 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
PEEK: Context Map as an Orientation Cache for Long-Context LLM Agents.
CoRR, May, 2026

DroidSpeak: KV Cache Sharing Across Fine-tuned Model Variants.
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

2025
EVICPRESS: Joint KV-Cache Compression and Eviction for Efficient LLM Serving.
CoRR, December, 2025

AdaptCache: KV Cache Native Storage Hierarchy for Low-Delay and High-Quality Language Model Serving.
CoRR, September, 2025

METIS: Fast Quality-Aware RAG Systems with Configuration Adaptation.
Proceedings of the ACM SIGOPS 31st Symposium on Operating Systems Principles, 2025

2024
RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation.
CoRR, 2024

LLMSteer: Improving Long-Context LLM Inference by Steering Attention on Reused Contexts.
CoRR, 2024


  Loading...