Kamran Razavi

Orcid: 0000-0002-3232-5657

According to our database1, Kamran Razavi authored at least 10 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency.
Proceedings of the Companion of the 16th ACM/SPEC International Conference on Performance Engineering, 2025

2024
Resource Efficient Inference Serving With SLO Guarantee.
PhD thesis, 2024

A Tale of Two Scales: Reconciling Horizontal and Vertical Scaling for Inference Serving Systems.
CoRR, 2024

NetNN: Neural Intrusion Detection System in Programmable Networks.
Proceedings of the IEEE Symposium on Computers and Communications, 2024

Sponge: Inference Serving with Dynamic SLOs Using In-Place Vertical Scaling.
Proceedings of the 4th Workshop on Machine Learning and Systems, 2024

Towards Democratic Computing.
Proceedings of the From Multimedia Communications to the Future Internet, 2024

2023
Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems.
Proceedings of the 3rd Workshop on Machine Learning and Systems, 2023

2022
FA2: Fast, Accurate Autoscaling for Serving Deep Learning Inference with SLA Guarantees.
Proceedings of the 28th IEEE Real-Time and Embedded Technology and Applications Symposium, 2022

Distributed DNN serving in the network data plane.
Proceedings of the 5th International Workshop on P4 in Europe, 2022

2020
Operator as a Service: Stateful Serverless Complex Event Processing.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020


  Loading...