We stand with Ukraine

We stand with Ukraine

Chao Jin

Orcid: 0009-0006-1355-4995

Affiliations:

Peking University, Beijing, China

According to our database¹, Chao Jin authored at least 17 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2026

BigMac: Breaking the Pareto Frontier of Compute and Memory in Multimodal LLM Training.

[DOI]

,

,

Shenglong Zhang

,

,

,

,

,

,

,

,

CoRR, May, 2026

ReLibra: Routing-Replay-Guided Load Balancing for MoE Training in Reinforcement Learning.

[DOI]

,

,

,

,

,

,

,

,

CoRR, May, 2026

Heddle: A Distributed Orchestration System for Agentic RL Rollout.

[DOI]

,

,

,

,

,

,

,

CoRR, March, 2026

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation.

[DOI]

,

,

,

,

,

,

ACM Trans. Comput. Syst., February, 2026

HydraServe: Minimizing Cold Start Latency for Serverless LLM Serving in Public Clouds.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 21st European Conference on Computer Systems, 2026

2025

MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism.

[DOI]

,

,

,

,

Cesar A. Stuardo

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, April, 2025

Towards Swift Serverless LLM Cold Starts with ParaServe.

[DOI]

,

,

,

,

,

,

CoRR, February, 2025

FaaSPR: Latency-Oriented Placement and Routing Optimization for Serverless Workflow Processing.

[DOI]

,

,

,

,

IEEE Trans. Netw., 2025

MegaScale-Infer: Efficient Mixture-of-Experts Model Serving with Disaggregated Expert Parallelism.

[DOI]

,

,

,

,

Cesar A. Stuardo

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the ACM SIGCOMM 2025 Conference, 2025

2024

Pyxis: Scheduling Mixed Tasks in Disaggregated Datacenters.

[DOI]

,

,

Mosharaf Chowdhury

,

,

,

IEEE Trans. Parallel Distributed Syst., September, 2024

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation.

[DOI]

,

,

,

,

,

,

CoRR, 2024

Jolteon: Unleashing the Promise of Serverless for Serverless Workflows.

[DOI]

,

,

Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

2023

Ditto: Efficient Serverless Analytics with Elastic Parallelism.

[DOI]

,

,

,

,

,

,

Proceedings of the ACM SIGCOMM 2023 Conference, 2023

Fast, Approximate Vector Queries on Very Large Unstructured Datasets.

[DOI]

,

,

,

,

Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

2022

Melon: breaking the memory wall for resource-efficient on-device machine learning.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the MobiSys '22: The 20th Annual International Conference on Mobile Systems, Applications and Services, Portland, Oregon, 27 June 2022, 2022

Loading...