Beiquan Cao

According to our database1, Beiquan Cao authored at least 2 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
Dispenser: Hierarchical KV Cache Management for Efficient LLM Generative Inference.
Proceedings of the 31th IEEE International Conference on Parallel and Distributed Systems, 2025

2024
ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024


  Loading...