We stand with Ukraine

We stand with Ukraine

Weichao Mao

Orcid: 0000-0001-8301-4173

According to our database¹, Weichao Mao authored at least 32 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

Model-Free Nonstationary Reinforcement Learning: Near-Optimal Regret and Applications in Multiagent Reinforcement Learning and Inventory Control.

[DOI]

,

,

,

David Simchi-Levi

,

Manag. Sci., 2025

Teaching Language Models to Critique via Reinforcement Learning.

[DOI]

,

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Decision Transformer as a Foundation Model for Partially Observable Continuous Control.

[DOI]

Xiangyuan Zhang

,

,

,

Proceedings of the 2025 American Control Conference, 2025

2024

On Designing Market Model and Pricing Mechanisms for IoT Data Exchange.

[DOI]

,

,

,

IEEE Trans. Mob. Comput., November, 2024

Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction.

[DOI]

,

,

,

,

,

,

Hubertus Franke

,

Zbigniew T. Kalbarczyk

,

,

Ravishankar K. Iyer

CoRR, 2024

Õ(T<sup>-1</sup>) Convergence to (Coarse) Correlated Equilibria in Full-Information General-Sum Markov Games.

[DOI]

,

,

,

Hubertus Franke

,

Zbigniew Kalbarczyk

,

CoRR, 2024

Power-aware Deep Learning Model Serving with μ-Serve.

[DOI]

,

,

,

,

,

,

Hubertus Franke

,

Zbigniew Kalbarczyk

,

,

Ravishankar K. Iyer

Proceedings of the 2024 USENIX Annual Technical Conference, 2024

FLASH: Fast Model Adaptation in ML-Centric Cloud Platforms.

[DOI]

,

,

,

,

,

Hubertus Franke

,

Zbigniew Kalbarczyk

,

,

Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

Controlgym: Large-scale control environments for benchmarking reinforcement learning algorithms.

[DOI]

Xiangyuan Zhang

,

,

,

Mouhacine Benosman

,

Proceedings of the 6th Annual Learning for Dynamics & Control Conference, 2024

$\widetilde{O}(T^{-1})$ {C}onvergence to (coarse) correlated equilibria in full-information general-sum markov games.

[DOI]

,

,

,

Hubertus Franke

,

Zbigniew Kalbarczyk

,

Proceedings of the 6th Annual Learning for Dynamics & Control Conference, 2024

When Green Computing Meets Performance and Resilience SLOs.

[DOI]

,

,

,

,

Hubertus Franke

,

Chandra Narayanaswami

,

Zbigniew Kalbarczyk

,

,

Ravishankar K. Iyer

Proceedings of the 54th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2024

2023

Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games.

[DOI]

,

Dyn. Games Appl., March, 2023

Controlgym: Large-Scale Safety-Critical Control Environments for Benchmarking Reinforcement Learning Algorithms.

[DOI]

Xiangyuan Zhang

,

,

,

Mouhacine Benosman

,

CoRR, 2023

Action Dynamics Task Graphs for Learning Plannable Representations of Procedural Tasks.

[DOI]

,

,

Michael Louis Iuzzolino

,

CoRR, 2023

AWARE: Automate Workload Autoscaling with Reinforcement Learning in Production Cloud Systems.

[DOI]

,

,

,

Hubertus Franke

,

,

Zbigniew T. Kalbarczyk

,

,

Ravishankar K. Iyer

Proceedings of the 2023 USENIX Annual Technical Conference, 2023

Multi-Agent Meta-Reinforcement Learning: Sharper Convergence Rates with Task Similarity.

[DOI]

,

,

,

Hubertus Franke

,

Zbigniew Kalbarczyk

,

Ravishankar K. Iyer

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

A Mean-Field Game Approach to Cloud Resource Management with Function Approximation.

[DOI]

,

,

,

Hubertus Franke

,

Zbigniew Kalbarczyk

,

Ravishankar K. Iyer

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning.

[DOI]

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

Reinforcement learning for resource management in multi-tenant serverless platforms.

[DOI]

,

,

,

,

Hubertus Franke

,

Zbigniew T. Kalbarczyk

,

,

Ravishankar K. Iyer

Proceedings of the EuroMLSys '22: Proceedings of the 2nd European Workshop on Machine Learning and Systems, Rennes, France, April 5, 2022

SIMPPO: a scalable and incremental online learning framework for serverless resource management.

[DOI]

,

,

,

,

Hubertus Franke

,

Zbigniew T. Kalbarczyk

,

,

Ravishankar K. Iyer

Proceedings of the 13th Symposium on Cloud Computing, SoCC 2022, 2022

2021

Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration.

[DOI]

,

,

,

CoRR, 2021

Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs.

[DOI]

,

,

,

David Simchi-Levi

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Semiparametric Information State Embedding for Policy Search under Imperfect Information.

[DOI]

,

,

,

Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021

2020

Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs.

[DOI]

,

,

,

David Simchi-Levi

,

CoRR, 2020

POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis.

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning.

[DOI]

,

,

,

Proceedings of the 59th IEEE Conference on Decision and Control, 2020

2019

A fast and anti-matchability matching algorithm for content-based publish/subscribe systems.

[DOI]

,

,

,

,

,

,

Comput. Networks, 2019

Adjusting Matching Algorithm to Adapt to Workload Fluctuations in Content-based Publish/Subscribe Systems.

[DOI]

,

,

,

Frederic Le Mouel

,

Proceedings of the 2019 IEEE Conference on Computer Communications, 2019

Pricing for Revenue Maximization in IoT Data Markets: An Information Design Perspective.

[DOI]

,

,

Proceedings of the 2019 IEEE Conference on Computer Communications, 2019

Challenges and Opportunities in IoT Data Markets.

[DOI]

,

,

,

Proceedings of the Fourth International Workshop on Social Sensing, 2019

2018

Adjusting Matching Algorithm to Adapt to Dynamic Subscriptions in Content-Based Publish/Subscribe Systems.

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2018

Online Pricing for Revenue Maximization with Unknown Time Discounting Valuations.

[DOI]

,

,

,

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Loading...