Siddhant Ray

Orcid: 0000-0003-0265-2144

According to our database1, Siddhant Ray authored at least 7 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion.
Proceedings of the Twentieth European Conference on Computer Systems, 2025

2024
RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation.
CoRR, 2024

SwiftQueue: Optimizing Low-Latency Applications with Swift Packet Queuing.
CoRR, 2024

Chatterbox: Robust Transport for LLM Token Streaming under Unstable Network.
CoRR, 2024

CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving.
Proceedings of the ACM SIGCOMM 2024 Conference, 2024

Eloquent: A More Robust Transmission Scheme for LLM Token Streaming.
Proceedings of the 2024 SIGCOMM Workshop on Networks for AI Computing, 2024

2022
A new hope for network model generalization.
Proceedings of the 21st ACM Workshop on Hot Topics in Networks, 2022


  Loading...