Xingqi Cui

According to our database1, Xingqi Cui authored at least 6 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Taming Latency-Memory Trade-Off in MoE-Based LLM Serving via Fine-Grained Expert Offloading.
Proceedings of the 21st European Conference on Computer Systems, 2026

2025
From Models to Operators: Rethinking Autoscaling Granularity for Large Generative Models.
CoRR, November, 2025

RouterArena: An Open Platform for Comprehensive Comparison of LLM Routers.
CoRR, October, 2025

Towards Efficient and Practical GPU Multitasking in the Era of LLM.
CoRR, August, 2025

fMoE: Fine-Grained Expert Offloading for Large Mixture-of-Experts Serving.
CoRR, February, 2025

2022
HiTDL: High-Throughput Deep Learning Inference at the Hybrid Mobile Edge.
IEEE Trans. Parallel Distributed Syst., 2022


  Loading...