Yaqi Xia

Orcid: 0009-0006-8101-785X

According to our database1, Yaqi Xia authored at least 12 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
JanusQuant: Accurate and Efficient 2-bit KV Cache Quantization for Long-Context Inference.
Proceedings of the 31st ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2026

2025
Voltrix: Sparse Matrix-Matrix Multiplication on Tensor Cores with Asynchronous and Balanced Kernel Optimization.
Proceedings of the 2025 USENIX Annual Technical Conference, 2025

MXBLAS: Accelerating 8-bit Deep Learning with a Unified Micro-Scaled GEMM Library.
Proceedings of the International Conference for High Performance Computing, 2025

Harnessing Inter-GPU Shared Memory for Seamless MoE Communication-Computation Fusion.
Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025

2024
Redundancy-Free and Load-Balanced TGNN Training With Hierarchical Pipeline Parallelism.
IEEE Trans. Parallel Distributed Syst., November, 2024

Raptor-T: A Fused and Memory-Efficient Sparse Transformer for Long and Variable-Length Sequences.
IEEE Trans. Computers, July, 2024

MPMoE: Memory Efficient MoE for Pre-Trained Models With Adaptive Pipeline Parallelism.
IEEE Trans. Parallel Distributed Syst., June, 2024

Scaling New Heights: Transformative Cross-GPU Sampling for Training Billion-Edge Graphs.
Proceedings of the International Conference for High Performance Computing, 2024

Accelerating Distributed DLRM Training with Optimized TT Decomposition and Micro-Batching.
Proceedings of the International Conference for High Performance Computing, 2024

2023
MPipeMoE: Memory Efficient MoE for Pre-trained Models with Adaptive Pipeline Parallelism.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Redundancy-Free High-Performance Dynamic GNN Training with Hierarchical Pipeline Parallelism.
Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, 2023

2021
ASFM-Net: Asymmetrical Siamese Feature Matching Network for Point Completion.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021


  Loading...