Ramya Prabhu

Orcid: 0009-0002-7266-5522

According to our database1, Ramya Prabhu authored at least 4 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
The Cost of Expertise: Understanding MoE Decode Performance.
Proceedings of the Sixth European Workshop on Machine Learning and Systems, EuroMLSys 2026, 2026

2025
vAttention: Dynamic Memory Management for Serving LLMs without PagedAttention.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

POD-Attention: Unlocking Full Prefill-Decode Overlap for Faster LLM Inference.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

2022
Knowledge Graph-based Thematic Similarity for Indian Legal Judgement Documents using Rhetorical Roles.
Proceedings of the 19th International Conference on Natural Language Processing, 2022


  Loading...