Dmitrii Ustiugov

Orcid: 0000-0003-3156-010X

According to our database1, Dmitrii Ustiugov authored at least 27 papers between 2016 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
FASER: Fine-Grained Phase Management for Speculative Decoding in Dynamic LLM Serving.
CoRR, April, 2026

Nexus: Transparent I/O Offloading for High-Density Serverless Computing.
CoRR, April, 2026

CodecSight: Leveraging Video Codec Signals for Efficient Streaming VLM Inference.
CoRR, April, 2026

PromptTuner: SLO-Aware Elastic System for LLM Prompt Tuning.
CoRR, March, 2026

MemTrust: A Zero-Trust Architecture for Unified AI Memory System.
CoRR, January, 2026

2025
TokenScale: Timely and Accurate Autoscaling for Disaggregated LLM Serving with Token Velocity.
CoRR, December, 2025

The High Cost of Keeping Warm: Characterizing Overhead in Serverless Autoscaling Policies.
CoRR, September, 2025

Melding the Serverless Control Plane with the Conventional Cluster Manager for Speed and Compatibility.
CoRR, May, 2025

Shattering the Ephemeral Storage Cost Barrier for Data-Intensive Serverless Workflows.
Proceedings of the 3rd Workshop on SErverless Systems, Applications and MEthodologies, 2025

Manage the Workloads not the Cluster: Designing a Control Plane for Large-Scale AI Clusters.
Proceedings of the 5th Workshop on Machine Learning and Systems, 2025

Slim-SC: Thought Pruning for Efficient Scaling with Self-Consistency.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models.
CoRR, 2024

ServerlessLLM: Low-Latency Serverless Inference for Large Language Models.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

2023
Expedited Data Transfers for Serverless Clouds.
CoRR, 2023

Enabling In-Vitro Serverless Systems Research.
Proceedings of the 4th Workshop on Resource Disaggregation and Serverless, 2023

2022
Lukewarm serverless functions: characterization and optimization.
Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

2021
Analyzing Tail Latency in Serverless Clouds with STeLLAR.
Proceedings of the IEEE International Symposium on Workload Characterization, 2021

Benchmarking, analysis, and optimization of serverless function snapshots.
Proceedings of the ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021

PTEMagnet: fine-grained physical memory reservation for faster page walks in public clouds.
Proceedings of the ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021

2020
Bankrupt Covert Channel: Turning Network Predictability into Vulnerability.
Proceedings of the 14th USENIX Workshop on Offensive Technologies, 2020

2019
Prefetched Address Translation.
Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

2018
Mitigating Load Imbalance in Distributed Data Serving with Rack-Scale Memory Pooling.
ACM Trans. Comput. Syst., 2018

Algorithm/Architecture Co-Design for Near-Memory Processing.
ACM SIGOPS Oper. Syst. Rev., 2018

Storage-Class Memory Hierarchies for Scale-Out Servers.
CoRR, 2018

Design guidelines for high-performance SCM hierarchies.
Proceedings of the International Symposium on Memory Systems, 2018

2017
The Mondrian Data Engine.
Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017

2016
SABRes: Atomic object reads for in-memory rack-scale computing.
Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016


  Loading...