Yao Fu

Orcid: 0009-0009-0174-8823

Affiliations:
  • University of Edinburgh, Edinburgh, UK


According to our database1, Yao Fu authored at least 7 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing.
CoRR, May, 2025

2024
MoE-CAP: Cost-Accuracy-Performance Benchmarking for Mixture-of-Experts Systems.
CoRR, 2024

MoE-Infinity: Activation-Aware Expert Offloading for Efficient MoE Serving.
CoRR, 2024

ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models.
CoRR, 2024

ServerlessLLM: Low-Latency Serverless Inference for Large Language Models.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

2023
TorchOpt: An Efficient Library for Differentiable Optimization.
J. Mach. Learn. Res., 2023

2022
Ekko: A Large-Scale Deep Learning Recommender System with Low-Latency Model Update.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022


  Loading...