Luo Mai

Orcid: 0000-0002-3594-1092

According to our database1, Luo Mai authored at least 31 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MoE-Infinity: Activation-Aware Expert Offloading for Efficient MoE Serving.
CoRR, 2024

ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models.
CoRR, 2024

2023
Large sequence models for sequential decision-making: a survey.
Frontiers Comput. Sci., December, 2023

TENPLEX: Changing Resources of Deep Learning Jobs using Parallelizable Tensor Collections.
CoRR, 2023

Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload Awareness.
CoRR, 2023

GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models.
Proceedings of the International Conference on Machine Learning, 2023

2022
TorchOpt: An Efficient Library for Differentiable Optimization.
CoRR, 2022

Ekko: A Large-Scale Deep Learning Recommender System with Low-Latency Model Update.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment.
CoRR, 2021

Move Fast and Meet Deadlines: Fine-grained Real-time Stream Processing with Cameo.
Proceedings of the 18th USENIX Symposium on Networked Systems Design and Implementation, 2021

Fast and Flexible Human Pose Estimation with HyperPose.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Efficient Reinforcement Learning Development with RLzoo.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
RLzoo: A Comprehensive and Adaptive Reinforcement Learning Library.
CoRR, 2020

KungFu: Making Training in Distributed Machine Learning Adaptive.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

Spotnik: Designing Distributed Machine Learning for Transient Cloud Resources.
Proceedings of the 12th USENIX Workshop on Hot Topics in Cloud Computing, 2020

2019
Taming Hyper-parameters in Deep Learning Systems.
ACM SIGOPS Oper. Syst. Rev., 2019

Crossbow: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers.
Proc. VLDB Endow., 2019

2018
Chi: A Scalable and Programmable Control Plane for Distributed Stream Processing Systems.
Proc. VLDB Endow., 2018

2017
Towards efficient big data processing in data centres.
PhD thesis, 2017

Extending programs with debug-related features, with application to hardware development.
CoRR, 2017

Emu: Rapid Prototyping of Networking Services.
Proceedings of the 2017 USENIX Annual Technical Conference, 2017

TensorLayer: A Versatile Library for Efficient Deep Learning Development.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

2016
FLICK: Developing and Running Application-Specific Network Services.
Proceedings of the 2016 USENIX Annual Technical Conference, 2016

Towards a Network Marketplace in a Cloud.
Proceedings of the 8th USENIX Workshop on Hot Topics in Cloud Computing, 2016

2015
Optimizing Network Performance in Distributed Machine Learning.
Proceedings of the 7th USENIX Workshop on Hot Topics in Cloud Computing, 2015

2014
NetAgg: Using Middleboxes for Application-specific On-path Aggregation in Data Centres.
Proceedings of the 10th ACM International on Conference on emerging Networking Experiments and Technologies, 2014

2013
Supporting application-specific in-network processing in data centres.
Proceedings of the ACM SIGCOMM 2013 Conference, 2013

2012
Rendezvous Data Collection Using a Mobile Element in Heterogeneous Sensor Networks.
Int. J. Distributed Sens. Networks, 2012

2011
Load Balanced Rendezvous Data Collection in Wireless Sensor Networks.
Proceedings of the IEEE 8th International Conference on Mobile Adhoc and Sensor Systems, 2011


  Loading...