Zeyu Zhang

Orcid: 0009-0005-7853-6854

Affiliations:
  • University of Virginia, USA


According to our database1, Zeyu Zhang authored at least 5 papers between 2023 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference.
CoRR, February, 2025

Towards Efficient Large Multimodal Model Serving.
CoRR, February, 2025

2024
CSPS: A Communication-Efficient Sequence-Parallelism based Serving System for Transformer based Models with Long Prompts.
CoRR, 2024

Zero-Delay QKV Compression for Mitigating KV Cache and Network Bottlenecks in LLM Inference.
CoRR, 2024

2023
Embracing Uncertainty for Equity in Resource Allocation in ML Training.
Proceedings of the 52nd International Conference on Parallel Processing, 2023


  Loading...