Xiangrui Yu
Orcid: 0009-0005-2478-1512
According to our database1,
Xiangrui Yu authored at least 4 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
ROME: Maximizing GPU Efficiency for All-Pairs Shortest Path via Taming Fine-Grained Irregularities.
Proceedings of the 31st ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2026
ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression.
Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026
2025
SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs.
Proceedings of the Twentieth European Conference on Computer Systems, 2025
2023
Balancing Computation and Communication in Distributed Sparse Matrix-Vector Multiplication.
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023