Jiuchen Shi

Orcid: 0000-0002-5470-210X

According to our database1, Jiuchen Shi authored at least 17 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
QoS Awareness and Improved Throughput of Point Cloud Services With Dynamic Workloads.
IEEE Trans. Computers, March, 2026

ELORA: Efficient LoRA and KV Cache Management for Multi-LoRA LLM Serving.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2026

2025
Kairos: Low-latency Multi-Agent Serving with Shared LLMs and Excessive Loads in the Public Cloud.
CoRR, August, 2025

Improving the Serving Performance of Multi-LoRA Large Language Models via Efficient LoRA and KV Cache Management.
CoRR, May, 2025

ORION: Optimizing OLAP Query Execution with Proactive Caching and Separate Operators.
Proceedings of the 39th ACM International Conference on Supercomputing, 2025

Generating Microservice Graphs with Production Characteristics for Efficient Resource Scaling.
Proceedings of the 39th ACM International Conference on Supercomputing, 2025

Comber: QoS-Aware and Efficient Deployment for Co-located Microservices and Best-Effort Tasks in Disaggregated Datacenters.
Proceedings of the Advanced Parallel Processing Technologies, 2025

Veyth: Adaptive Container Placement for Optimizing Cross-Server Network Traffic of Microservice Applications.
Proceedings of the Advanced Parallel Processing Technologies, 2025

2024
Adaptive QoS-Aware Microservice Deployment With Excessive Loads via Intra- and Inter-Datacenter Scheduling.
IEEE Trans. Parallel Distributed Syst., September, 2024

A Microservice Graph Generator with Production Characteristics.
CoRR, 2024

2023
Nodens: Enabling Resource Efficient and Fast QoS Recovery of Dynamic Microservice Applications in Datacenters.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

BLAD: Adaptive Load Balanced Scheduling and Operator Overlap Pipeline For Accelerating The Dynamic GNN Training.
Proceedings of the International Conference for High Performance Computing, 2023

2022
Reliability and Incentive of Performance Assessment for Decentralized Clouds.
J. Comput. Sci. Technol., 2022

QoS-Aware Irregular Collaborative Inference for Improving Throughput of DNN Services.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

QoS-awareness of Microservices with Excessive Loads via Inter-Datacenter Scheduling.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

Characterizing and orchestrating VM reservation in geo-distributed clouds to improve the resource efficiency.
Proceedings of the 13th Symposium on Cloud Computing, SoCC 2022, 2022

2020
OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020


  Loading...