Tianyu Guo

Orcid: 0009-0005-2979-4486

Affiliations:
  • Sun Yat-sen University, Guangzhou, China


According to our database1, Tianyu Guo authored at least 4 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling.
CoRR, April, 2025

EFIM: Efficient Serving of LLMs for Infilling Tasks with Improved KV Cache Reuse.
Proceedings of the Euro-Par 2025: Parallel Processing, 2025

Mpache: Interaction Aware Multi-level Cache Bypassing on GPUs.
Proceedings of the 30th Asia and South Pacific Design Automation Conference, 2025

2024
SMILE: LLC-based Shared Memory Expansion to Improve GPU Thread Level Parallelism.
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024


  Loading...