Zihao Ye
Orcid: 0000-0002-6450-8108Affiliations:
- NVIDIA, USA
- University of Washington, School of Computer Science and Engineering, Seattle, WA, USA (PhD 2025)
- Amazon Web Services, Shanghai AI Lab, Shanghai, China (2019 - 2021)
According to our database1,
Zihao Ye authored at least 25 papers
between 2019 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
CoRR, April, 2026
CoRR, March, 2026
SOL-ExecBench: Speed-of-Light Benchmarking for Real-World GPU Kernels Against Hardware Limits.
CoRR, March, 2026
CoRR, January, 2026
CoRR, January, 2026
2025
Mirage Persistent Kernel: A Compiler and Runtime for Mega-Kernelizing Tensor Programs.
CoRR, December, 2025
Mirage Persistent Kernel: A Compiler and Runtime for Mega-Kernelizing Tensor Programs.
CoRR, December, 2025
Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding.
CoRR, December, 2025
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval.
CoRR, February, 2025
Proceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation, 2025
Proceedings of the Eighth Conference on Machine Learning and Systems, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025
2024
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024
vMCU: Coordinated Memory Management and Kernel Optimization for DNN Inference on MCUs.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024
2023
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023
2022
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022
2020
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020
Proceedings of the International Conference for High Performance Computing, 2020
2019
CoRR, 2019