Xinhao Cheng

Orcid: 0009-0006-4391-041X

According to our database1, Xinhao Cheng authored at least 13 papers between 2014 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Event Tensor: A Unified Abstraction for Compiling Dynamic Megakernel.
CoRR, April, 2026

Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems.
ACM Comput. Surv., January, 2026

Multi-State Reliability Modeling and Analysis for More Electric Aircraft Electrical Power System Considering State Transition Uncertainty.
IEEE Trans. Reliab., 2026

FlexLLM: Token-Level Co-Serving of LLM Inference and Finetuning with SLO Guarantees.
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

AdaServe: Accelerating Multi-SLO LLM Serving with SLO-Customized Speculative Decoding.
Proceedings of the 21st European Conference on Computer Systems, 2026

2025
Mirage Persistent Kernel: A Compiler and Runtime for Mega-Kernelizing Tensor Programs.
CoRR, December, 2025

Mirage: A Multi-Level Superoptimizer for Tensor Programs.
Proceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation, 2025

2024
SpecInfer: Accelerating Generative Large Language Model Serving with Speculative Inference and Token Tree Verification
PhD thesis, 2024

A Multi-Level Superoptimizer for Tensor Programs.
CoRR, 2024

FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning.
CoRR, 2024

SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification.
CoRR, 2023

2014
Green Traffic Compression in Wireless Sensor Networks.
Proceedings of the IEEE 79th Vehicular Technology Conference, 2014


  Loading...