Chen-Yu Wei

Orcid: 0000-0001-8735-4773

According to our database1, Chen-Yu Wei authored at least 46 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data.
CoRR, 2024

Tractable Local Equilibria in Non-Concave Games.
CoRR, 2024

Near-Optimal Policy Optimization for Correlated Equilibrium in General-Sum Markov Games.
CoRR, 2024

2023
Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback.
CoRR, 2023

Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games.
CoRR, 2023

First- and Second-Order Bounds for Adversarial Linear Contextual Bandits.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games with Bandit Feedback.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Best of Both Worlds Policy Optimization.
Proceedings of the International Conference on Machine Learning, 2023

Refined Regret for Adversarial MDPs with Linear Function Approximation.
Proceedings of the International Conference on Machine Learning, 2023

A Blackbox Approach to Best of Both Worlds in Bandits and Beyond.
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

A Unified Algorithm for Stochastic Path Problems.
Proceedings of the International Conference on Algorithmic Learning Theory, 2023

2022
Personalization Improves Privacy-Accuracy Tradeoffs in Federated Optimization.
CoRR, 2022

Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence.
Proceedings of the International Conference on Machine Learning, 2022

Personalization Improves Privacy-Accuracy Tradeoffs in Federated Learning.
Proceedings of the International Conference on Machine Learning, 2022

A Model Selection Approach for Corruption Robust Reinforcement Learning.
Proceedings of the International Conference on Algorithmic Learning Theory, 29 March, 2022

Decentralized Cooperative Reinforcement Learning with Hierarchical Information Structure.
Proceedings of the International Conference on Algorithmic Learning Theory, 29 March, 2022

2021
Policy Optimization in Adversarial MDPs: Improved Exploration via Dilated Bonuses.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Achieving Near Instance-Optimality and Minimax-Optimality in Stochastic and Adversarial Linear Bandits Simultaneously.
Proceedings of the 38th International Conference on Machine Learning, 2021

Linear Last-iterate Convergence in Constrained Saddle-point Optimization.
Proceedings of the 9th International Conference on Learning Representations, 2021

Last-iterate Convergence of Decentralized Optimistic Gradient Descent/Ascent in Infinite-horizon Competitive Markov Games.
Proceedings of the Conference on Learning Theory, 2021

Non-stationary Reinforcement Learning without Prior Knowledge: an Optimal Black-box Approach.
Proceedings of the Conference on Learning Theory, 2021

Impossible Tuning Made Possible: A New Expert Algorithm and Its Applications.
Proceedings of the Conference on Learning Theory, 2021

Minimax Regret for Stochastic Shortest Path with Adversarial Costs and Known Transition.
Proceedings of the Conference on Learning Theory, 2021

Adversarial Online Learning with Changing Action Sets: Efficient Algorithms with Approximate Regret Bounds.
Proceedings of the Algorithmic Learning Theory, 2021

Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
Linear Last-iterate Convergence for Matrix Games and Stochastic Games.
CoRR, 2020

Federated Residual Learning.
CoRR, 2020

Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes.
Proceedings of the 37th International Conference on Machine Learning, 2020

Taking a hint: How to leverage loss predictors in contextual bandits?
Proceedings of the Conference on Learning Theory, 2020

2019
Analyzing the Variance of Policy Gradient Estimators for the Linear-Quadratic Regulator.
CoRR, 2019

Beating Stochastic and Adversarial Semi-bandits Optimally and Simultaneously.
Proceedings of the 36th International Conference on Machine Learning, 2019

Bandit Multiclass Linear Classification: Efficient Algorithms for the Separable Case.
Proceedings of the 36th International Conference on Machine Learning, 2019

A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal and Parameter-free.
Proceedings of the Conference on Learning Theory, 2019

Improved Path-length Regret Bounds for Bandits.
Proceedings of the Conference on Learning Theory, 2019

Achieving Optimal Dynamic Regret for Non-stationary Bandits without Prior Information.
Proceedings of the Conference on Learning Theory, 2019

2018
Multi-Cell Cooperative Scheduling for Network Utility Maximization With User Equipment Side Interference Cancellation.
IEEE Trans. Wirel. Commun., 2018

Efficient Online Portfolio with Logarithmic Regret.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

More Adaptive Algorithms for Adversarial Bandits.
Proceedings of the Conference On Learning Theory, 2018

Efficient Contextual Bandits in Non-stationary Worlds.
Proceedings of the Conference On Learning Theory, 2018

2017
Online Reinforcement Learning in Stochastic Games.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

2016
Tracking the Best Expert in Non-stationary Stochastic Environments.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Adaptive measurement for energy efficient mobility management in ultra-dense small cell networks.
Proceedings of the 2016 IEEE International Conference on Communications, 2016


  Loading...