Xinhao Cheng
Orcid: 0009-0006-4391-041X
According to our database1,
Xinhao Cheng authored at least 13 papers
between 2014 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
CoRR, April, 2026
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems.
ACM Comput. Surv., January, 2026
Multi-State Reliability Modeling and Analysis for More Electric Aircraft Electrical Power System Considering State Transition Uncertainty.
IEEE Trans. Reliab., 2026
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026
AdaServe: Accelerating Multi-SLO LLM Serving with SLO-Customized Speculative Decoding.
Proceedings of the 21st European Conference on Computer Systems, 2026
2025
Mirage Persistent Kernel: A Compiler and Runtime for Mega-Kernelizing Tensor Programs.
CoRR, December, 2025
Proceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation, 2025
2024
SpecInfer: Accelerating Generative Large Language Model Serving with Speculative Inference and Token Tree Verification
PhD thesis, 2024
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning.
CoRR, 2024
SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024
2023
SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification.
CoRR, 2023
2014
Proceedings of the IEEE 79th Vehicular Technology Conference, 2014