Yaqi Xia
Orcid: 0009-0006-8101-785X
According to our database1,
Yaqi Xia authored at least 12 papers
between 2021 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
JanusQuant: Accurate and Efficient 2-bit KV Cache Quantization for Long-Context Inference.
Proceedings of the 31st ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2026
2025
Voltrix: Sparse Matrix-Matrix Multiplication on Tensor Cores with Asynchronous and Balanced Kernel Optimization.
Proceedings of the 2025 USENIX Annual Technical Conference, 2025
Proceedings of the International Conference for High Performance Computing, 2025
Harnessing Inter-GPU Shared Memory for Seamless MoE Communication-Computation Fusion.
Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025
2024
Redundancy-Free and Load-Balanced TGNN Training With Hierarchical Pipeline Parallelism.
IEEE Trans. Parallel Distributed Syst., November, 2024
Raptor-T: A Fused and Memory-Efficient Sparse Transformer for Long and Variable-Length Sequences.
IEEE Trans. Computers, July, 2024
MPMoE: Memory Efficient MoE for Pre-Trained Models With Adaptive Pipeline Parallelism.
IEEE Trans. Parallel Distributed Syst., June, 2024
Scaling New Heights: Transformative Cross-GPU Sampling for Training Billion-Edge Graphs.
Proceedings of the International Conference for High Performance Computing, 2024
Accelerating Distributed DLRM Training with Optimized TT Decomposition and Micro-Batching.
Proceedings of the International Conference for High Performance Computing, 2024
2023
MPipeMoE: Memory Efficient MoE for Pre-trained Models with Adaptive Pipeline Parallelism.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
Redundancy-Free High-Performance Dynamic GNN Training with Hierarchical Pipeline Parallelism.
Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, 2023
2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021