Chao Jin

Orcid: 0009-0006-1355-4995

Affiliations:
  • Peking University, Beijing, China


According to our database1, Chao Jin authored at least 17 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
BigMac: Breaking the Pareto Frontier of Compute and Memory in Multimodal LLM Training.
CoRR, May, 2026

ReLibra: Routing-Replay-Guided Load Balancing for MoE Training in Reinforcement Learning.
CoRR, May, 2026

Heddle: A Distributed Orchestration System for Agentic RL Rollout.
CoRR, March, 2026

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation.
ACM Trans. Comput. Syst., February, 2026

HydraServe: Minimizing Cold Start Latency for Serverless LLM Serving in Public Clouds.
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production.
Proceedings of the 21st European Conference on Computer Systems, 2026

2025
MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production.
CoRR, May, 2025

MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism.
CoRR, April, 2025

Towards Swift Serverless LLM Cold Starts with ParaServe.
CoRR, February, 2025

FaaSPR: Latency-Oriented Placement and Routing Optimization for Serverless Workflow Processing.
IEEE Trans. Netw., 2025

MegaScale-Infer: Efficient Mixture-of-Experts Model Serving with Disaggregated Expert Parallelism.
Proceedings of the ACM SIGCOMM 2025 Conference, 2025

2024
Pyxis: Scheduling Mixed Tasks in Disaggregated Datacenters.
IEEE Trans. Parallel Distributed Syst., September, 2024

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation.
CoRR, 2024

Jolteon: Unleashing the Promise of Serverless for Serverless Workflows.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

2023
Ditto: Efficient Serverless Analytics with Elastic Parallelism.
Proceedings of the ACM SIGCOMM 2023 Conference, 2023

Fast, Approximate Vector Queries on Very Large Unstructured Datasets.
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

2022
Melon: breaking the memory wall for resource-efficient on-device machine learning.
Proceedings of the MobiSys '22: The 20th Annual International Conference on Mobile Systems, Applications and Services, Portland, Oregon, 27 June 2022, 2022


  Loading...