Xiaozhe Yao

Orcid: 0000-0002-4661-533X

According to our database1, Xiaozhe Yao authored at least 18 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter.
Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

2025
Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter.
CoRR, November, 2025

Mixtera: A Data Plane for Foundation Model Training.
CoRR, February, 2025

ThunderServe: High-performance and Cost-efficient LLM Serving in Cloud Environments.
Proceedings of the Eighth Conference on Machine Learning and Systems, 2025

Demystifying Cost-Efficiency in LLM Serving over Heterogeneous GPUs.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

DeltaZip: Efficient Serving of Multiple Full-Model-Tuned LLMs.
Proceedings of the Twentieth European Conference on Computer Systems, 2025


2024
DMLR: Data-centric Machine Learning Research - Past, Present and Future.
J. Data-centric Mach. Learn. Res., 2024

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order.
CoRR, 2024

RedPajama: an Open Dataset for Training Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

HexGen: Generative Inference of Large Language Model over Heterogeneous Environment.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
DeltaZip: Multi-Tenant Language Model Serving via Delta Compression.
CoRR, 2023

HexGen: Generative Inference of Foundation Model over Heterogeneous Decentralized Environment.
CoRR, 2023


2022
SHiFT: An Efficient, Flexible Search Engine for Transfer Learning.
Proc. VLDB Endow., 2022

DataPerf: Benchmarks for Data-Centric AI Development.
CoRR, 2022

2018
CVTron Web: A Versatile Framework for Online Computer Vision Services.
Proceedings of the Services - SERVICES 2018, 2018

2017
Face Based Advertisement Recommendation with Deep Learning: A Case Study.
Proceedings of the Smart Computing and Communication, 2017


  Loading...