Xiangxi Mo

Orcid: 0009-0007-4427-0036

According to our database¹, Xiangxi Mo authored at least 18 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

When to Reason: Semantic Router for vLLM.

[BibT_eX]

[DOI]

CoRR, October, 2025

SkyStore: Cost-Optimized Object Storage Across Regions and Clouds.

[BibT_eX]

[DOI]

Proc. VLDB Endow., March, 2025

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

[BibT_eX]

[DOI]

CoRR, February, 2025

Jenga: Effective Memory Management for Serving LLM with Heterogeneity.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGOPS 31st Symposium on Operating Systems Principles, 2025

Optimizing LLM Queries in Relational Data Analytics Workloads.

[BibT_eX]

[DOI]

Luis Gaspar Schroeder

Proceedings of the Eighth Conference on Machine Learning and Systems, 2025

Language Models Can Easily Learn to Reason from Demonstrations.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024

Pie: Pooling CPU Memory for LLM Inference.

[BibT_eX]

[DOI]

CoRR, 2024

Optimizing Speculative Decoding for Serving Large Language Models Using Goodput.

[BibT_eX]

[DOI]

CoRR, 2024

Optimizing LLM Queries in Relational Workloads.

[BibT_eX]

[DOI]

CoRR, 2024

Cloudcast: High-Throughput, Cost-Aware Overlay Multicast in the Cloud.

[BibT_eX]

[DOI]

Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

2023

RALF: Accuracy-Aware Scheduling for Feature Store Maintenance.

[BibT_eX]

[DOI]

Joseph M. Hellerstein

Natacha Crooks

Joseph E. Gonzalez

Proc. VLDB Endow., November, 2023

2022

Context-Aware Streaming Perception in Dynamic Environments.

[BibT_eX]

[DOI]

Gur-Eyal Sela

Ionel Gog

Justin Wong

Kumar Krishna Agrawal

Proceedings of the Computer Vision - ECCV 2022, 2022

2020

Towards Scalable Dataframe Systems.

[BibT_eX]

[DOI]

Joseph M. Hellerstein

Anthony D. Joseph

Aditya G. Parameswaran

Proc. VLDB Endow., 2020

InferLine: latency-aware provisioning and scaling for prediction serving pipelines.

[BibT_eX]

[DOI]

Proceedings of the SoCC '20: ACM Symposium on Cloud Computing, 2020

2019

Pay Attention to Convolution Filters: Towards Fast and Accurate Fine-Grained Transfer Learning.

[BibT_eX]

[DOI]

Xiangxi Mo

Ruizhe Cheng

Tianyi Fang

CoRR, 2019

The OoO VLIW JIT Compiler for GPU Inference.

[BibT_eX]

[DOI]

CoRR, 2019

Dynamic Space-Time Scheduling for GPU Inference.

[BibT_eX]

[DOI]

CoRR, 2019

2018

InferLine: ML Inference Pipeline Composition Framework.

[BibT_eX]

[DOI]

CoRR, 2018

Xiangxi Mo

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...