Jinghan Yao
Orcid: 0009-0002-7129-9508
According to our database1,
Jinghan Yao authored at least 17 papers
between 2019 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
From Skew to Symmetry: Node-Interconnect Multi-Path Balancing with Execution-time Planning for Modern GPU Clusters.
CoRR, April, 2026
MAC-Attention: a Match-Amend-Complete Scheme for Fast and Accurate Attention Computation.
CoRR, April, 2026
2025
Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer.
Proceedings of the Eighth Conference on Machine Learning and Systems, 2025
A High-Density Transcranial Electrical Stimulation System on Chip with Real-Time Bio-Impedance Sensing.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2025
Unified Designs of Multi-Rail-Aware MPI Allreduce and Alltoall Operations Across Diverse GPU and Interconnect Systems.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2025
Design and Optimization of GPU-Aware MPI Allreduce Using Direct Sendrecv Communication.
Proceedings of the 54th International Conference on Parallel Processing, 2025
2024
Microelectron. J., 2024
Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer.
CoRR, 2024
Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
HyperSack: Distributed Hyperparameter Optimization for Deep Learning using Resource-Aware Scheduling on Heterogeneous GPU Systems.
Proceedings of the 31st IEEE International Conference on High Performance Computing, 2024
Proceedings of the IEEE Biomedical Circuits and Systems Conference, 2024
2023
MPI-xCCL: A Portable MPI Library over Collective Communication Libraries for Various Accelerators.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
A Novel Framework for Efficient Offloading of Communication Operations to Bluefield SmartNICs.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference.
Proceedings of the 30th IEEE International Conference on High Performance Computing, 2023
2021
IEEE Trans. Cybern., 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
2019