Yiyuan He

Orcid: 0009-0003-2128-2852

According to our database1, Yiyuan He authored at least 7 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
BanaServe: Unified KV Cache and Dynamic Module Migration for Balancing Disaggregated LLM Serving in AI Infrastructure.
CoRR, October, 2025

Cloud Native System for LLM Inference Serving.
CoRR, July, 2025

Unlock the Potential of Fine-grained LLM Serving via Dynamic Module Scaling.
CoRR, July, 2025

Cloudnativesim: A Toolkit for Modeling and Simulation of Cloud-Native Applications.
Softw. Pract. Exp., 2025

2024
UELLM: A Unified and Efficient Approach for LLM Inference Serving.
CoRR, 2024

UELLM: A Unified and Efficient Approach for Large Language Model Inference Serving.
Proceedings of the Service-Oriented Computing - 22nd International Conference, 2024

Resource Management for GPT-Based Model Deployed on Clouds: Challenges, Solutions, and Future Directions.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2024


  Loading...