Yaqi Xia

Orcid: 0009-0006-8101-785X

According to our database¹, Yaqi Xia authored at least 12 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

JanusQuant: Accurate and Efficient 2-bit KV Cache Quantization for Long-Context Inference.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2026

2025

Voltrix: Sparse Matrix-Matrix Multiplication on Tensor Cores with Asynchronous and Balanced Kernel Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2025 USENIX Annual Technical Conference, 2025

MXBLAS: Accelerating 8-bit Deep Learning with a Unified Micro-Scaled GEMM Library.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2025

Harnessing Inter-GPU Shared Memory for Seamless MoE Communication-Computation Fusion.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025

2024

Redundancy-Free and Load-Balanced TGNN Training With Hierarchical Pipeline Parallelism.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., November, 2024

Raptor-T: A Fused and Memory-Efficient Sparse Transformer for Long and Variable-Length Sequences.

[BibT_eX]

[DOI]

IEEE Trans. Computers, July, 2024

MPMoE: Memory Efficient MoE for Pre-Trained Models With Adaptive Pipeline Parallelism.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., June, 2024

Scaling New Heights: Transformative Cross-GPU Sampling for Training Billion-Edge Graphs.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2024

Accelerating Distributed DLRM Training with Optimized TT Decomposition and Micro-Batching.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2024

2023

MPipeMoE: Memory Efficient MoE for Pre-trained Models with Adaptive Pipeline Parallelism.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Redundancy-Free High-Performance Dynamic GNN Training with Hierarchical Pipeline Parallelism.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, 2023

2021

ASFM-Net: Asymmetrical Siamese Feature Matching Network for Point Completion.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Yaqi Xia

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...