Weiyu Xie
Orcid: 0000-0003-0173-1027
According to our database1,
Weiyu Xie authored at least 4 papers
between 2024 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
From Prefix Cache to Fusion RAG Cache: Accelerating LLM Inference in Retrieval-Augmented Generation.
CoRR, January, 2026
2025
KTransformers: Unleashing the Full Potential of CPU/GPU Hybrid Inference for MoE Models.
Proceedings of the ACM SIGOPS 31st Symposium on Operating Systems Principles, 2025
Scaling Asynchronous Graph Query Processing via Partitioned Stateful Traversal Machines.
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025
2024
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024