Ziheng Jiang
Orcid: 0009-0008-6318-1407
According to our database1,
Ziheng Jiang authored at least 46 papers
between 2010 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
DisagMoE: Computation-Communication overlapped MoE Training via Disaggregated AF-Pipe Parallelism.
CoRR, May, 2026
SRT: Accelerating Reinforcement Learning via Speculative Rollout with Tree-Structured Cache.
CoRR, January, 2026
MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production.
Proceedings of the 21st European Conference on Computer Systems, 2026
SwiftSpec: Disaggregated Speculative Decoding and Fused Kernels for Low-Latency LLM Inference.
Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026
2025
Mesh-Attention: A New Communication-Efficient Distributed Attention with Improved Data Locality.
CoRR, December, 2025
SwiftSpec: Ultra-Low Latency LLM Decoding by Scaling Asynchronous Speculative Decoding.
CoRR, June, 2025
MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production.
CoRR, May, 2025
Triton-distributed: Programming Overlapping Kernels on Distributed AI Systems with the Triton Compiler.
CoRR, April, 2025
MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism.
CoRR, April, 2025
Batch Informed Vines (BIV*): Heuristically Guided Exploration of Narrow Passages by Batch Vine Expansion.
IEEE Robotics Autom. Lett., February, 2025
MegaScale-Infer: Efficient Mixture-of-Experts Model Serving with Disaggregated Expert Parallelism.
Proceedings of the ACM SIGCOMM 2025 Conference, 2025
Proceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation, 2025
Proceedings of the Eighth Conference on Machine Learning and Systems, 2025
TileLink: Generating Efficient Compute-Communication Overlapping Kernels using Tile-Centric Primitives.
Proceedings of the Eighth Conference on Machine Learning and Systems, 2025
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025
2024
CoRR, 2024
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024
Towards Automated Chinese Ancient Character Restoration: A Diffusion-Based Method with a New Dataset.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
High Accuracy Near-Field Electromagnetic Emission Identification System Using Characteristics Recognition and Image Localization.
IEEE Trans. Instrum. Meas., 2023
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023
2022
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
2021
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2021
CoRR, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the ACM CHIL '21: ACM Conference on Health, 2021
2020
MetaPhys: Unsupervised Few-Shot Adaptation for Non-Contact Physiological Measurement.
CoRR, 2020
2019
IEEE Micro, 2019
2018
A Path-constrained Framework for Discriminating Substitutable and Complementary Products in E-commerce.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
2015
Cogn. Comput., 2015
2014
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
2013
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013
2012
Neural Comput. Appl., 2012
Proceedings of the 12th IEEE International Conference on Data Mining Workshops, 2012
2011
Proceedings of the IEEE International Conference on Systems, 2011
2010
Proceedings of the 10th International Conference on Intelligent Systems Design and Applications, 2010