Rui Pan

Orcid: 0000-0002-6973-3259

Affiliations:
  • Princeton University, Systems for AI Lab (SAIL), Princeton, NJ, USA


According to our database1, Rui Pan authored at least 10 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning.
CoRR, April, 2025

Mowgli: Passively Learned Rate Control for Real-Time Video.
Proceedings of the 22nd USENIX Symposium on Networked Systems Design and Implementation, 2025

2024
RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation.
CoRR, 2024

Marconi: Prefix Caching for the Era of Hybrid LLMs.
CoRR, 2024

Optimizing Mixture-of-Experts Inference Time Combining Model Deployment and Communication Scheduling.
CoRR, 2024

Tarzan: Passively-Learned Real-Time Rate Control for Video Conferencing.
CoRR, 2024

Improving DNN Inference Throughput Using Practical, Per-Input Compute Adaptation.
Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles, 2024

Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving.
Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles, 2024

2023
Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning.
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

2022
Efficient flow scheduling in distributed deep learning training with echelon formation.
Proceedings of the 21st ACM Workshop on Hot Topics in Networks, 2022


  Loading...