Zhengyuan Zhou

Orcid: 0000-0002-0005-9411

According to our database1, Zhengyuan Zhou authored at least 117 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
RED-Net: Residual and Enhanced Discriminative Network for Image Steganalysis in the Internet of Medical Things and Telemedicine.
IEEE J. Biomed. Health Informatics, March, 2024

Tensor Recovery With Weighted Tensor Average Rank.
IEEE Trans. Neural Networks Learn. Syst., January, 2024

Stochastic contextual bandits with graph feedback: from independence number to MAS number.
CoRR, 2024

2023
Distributionally Robust Batch Contextual Bandits.
Manag. Sci., October, 2023

Structured Sparsity Optimization With Non-Convex Surrogates of $\ell _{2,0}$ℓ2,0-Norm: A Unified Algorithmic Framework.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Offline Multi-Action Policy Learning: Generalization and Optimization.
Oper. Res., January, 2023

A Unified Linear Speedup Analysis of Federated Averaging and Nesterov FedAvg.
J. Artif. Intell. Res., 2023

Revisiting the Last-Iterate Convergence of Stochastic Gradient Methods.
CoRR, 2023

On the Foundation of Distributionally Robust Reinforcement Learning.
CoRR, 2023

Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback.
CoRR, 2023

Sample Complexity of Variance-reduced Distributionally Robust Q-learning.
CoRR, 2023

Stochastic Nonsmooth Convex Optimization with Heavy-Tailed Noises.
CoRR, 2023

Near-Optimal High-Probability Convergence for Non-Convex Stochastic Optimization with Variance Reduction.
CoRR, 2023

Single-Trajectory Distributionally Robust Reinforcement Learning.
CoRR, 2023

Breaking the Lower Bound with (Little) Structure: Acceleration in Non-Convex Stochastic Optimization with Heavy-Tailed Noise.
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

A Finite Sample Complexity Bound for Distributionally Robust Q-learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Smart Greedy Distributed Energy Allocation: A Random Games Approach.
IEEE Trans. Autom. Control., 2022

Distributed Stochastic Optimization with Large Delays.
Math. Oper. Res., 2022

Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States.
J. Mach. Learn. Res., 2022

No Weighted-Regret Learning in Adversarial Bandits with Delays.
J. Mach. Learn. Res., 2022

Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning.
J. Artif. Intell. Res., 2022

Batched Learning in Generalized Linear Contextual Bandits With General Decision Sets.
IEEE Control. Syst. Lett., 2022

Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation.
CoRR, 2022

Optimal Diagonal Preconditioning: Theory and Practice.
CoRR, 2022

Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies.
CoRR, 2022

Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Society of Agents: Regret Bounds of Concurrent Thompson Sampling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Distributionally Robust Q-Learning.
Proceedings of the International Conference on Machine Learning, 2022

Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning.
Proceedings of the International Conference on Machine Learning, 2022

2021
Robust Low-Rank Tensor Recovery with Rectification and Alignment.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Robust Power Management via Learning and Game Design.
Oper. Res., 2021

Optimal No-Regret Learning in Strongly Monotone Games with Bandit Feedback.
CoRR, 2021

Computational Benefits of Intermediate Rewards for Hierarchical Planning.
CoRR, 2021

Policy Learning with Adaptively Collected Data.
CoRR, 2021

No Discounted-Regret Learning in Adversarial Bandits with Delays.
CoRR, 2021

Doubly-Adaptive Thompson Sampling for Multi-Armed and Contextual Bandits.
CoRR, 2021

Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent State.
CoRR, 2021

Online Multi-Armed Bandits with Adaptive Inference.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Provably Sample Efficient Reinforcement Learning in Competitive Linear Quadratic Systems.
Proceedings of the 3rd Annual Conference on Learning for Dynamics and Control, 2021

MEOW: A Space-Efficient Nonparametric Bid Shading Algorithm.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Finite-Sample Regret Bound for Distributionally Robust Offline Tabular Reinforcement Learning.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
On the Convergence of Mirror Descent beyond Stochastic Convex Programming.
SIAM J. Optim., 2020

Smarter Lions: Efficient Cooperative Pursuit in General Bounded Arenas.
SIAM J. Control. Optim., 2020

policytree: Policy learning via doubly robust empirical welfare maximization over trees.
J. Open Source Softw., 2020

Federated LQR: Learning through Sharing.
CoRR, 2020

Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits.
CoRR, 2020

Federated Learning's Blessing: FedAvg has Linear Speedup.
CoRR, 2020

Learning to Bid Optimally and Efficiently in Adversarial First-price Auctions.
CoRR, 2020

Gradient-free Online Learning in Games with Delayed Rewards.
CoRR, 2020

Distributional Robust Batch Contextual Bandits.
CoRR, 2020

Distributional Soft Actor Critic for Risk Sensitive Learning.
CoRR, 2020

Sequential Batch Learning in Finite-Action Linear Contextual Bandits.
CoRR, 2020

Optimal No-regret Learning in Repeated First-price Auctions.
CoRR, 2020

Diagonal Preconditioning: Theory and Algorithms.
CoRR, 2020

Delay-Adaptive Learning in Generalized Linear Contextual Bandits.
CoRR, 2020

Optimistic Dual Extrapolation for Coherent Non-monotone Variational Inequalities.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits.
Proceedings of the 37th International Conference on Machine Learning, 2020

Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games.
Proceedings of the 37th International Conference on Machine Learning, 2020

Gradient-free Online Learning in Continuous Games with Delayed Rewards.
Proceedings of the 37th International Conference on Machine Learning, 2020

Understanding l4-based Dictionary Learning: Interpretation, Stability, and Robustness.
Proceedings of the 8th International Conference on Learning Representations, 2020

Delay-Adaptive Distributed Stochastic Optimization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Electrochemiluminescence Sensor Based on Electrospun Three-Dimensional Carbon Nanofibers for the Detection of Difenidol Hydrochloride.
Sensors, 2019

Learning in games with continuous action sets and unknown payoff functions.
Math. Program., 2019

Provably Efficient Reinforcement Learning with Aggregated States.
CoRR, 2019

Learning in Generalized Linear Contextual Bandits with Stochastic Delays.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Online EXP3 Learning in Adversarial Bandits with Delayed Feedback.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Bidding-Based Dynamic Power Pricing Scheme in Smart Grids.
Proceedings of the International Conference on Computing, Networking and Communications, 2019

Anesthesiologist Surgery Assignments using Policy Learning.
Proceedings of the 2019 IEEE International Conference on Communications, 2019

Smart Greedy Distributed Allocation in Microgrids.
Proceedings of the 2019 IEEE International Conference on Communications, 2019

Balanced Linear Contextual Bandits.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Infinite Time Horizon Maximum Causal Entropy Inverse Reinforcement Learning.
IEEE Trans. Autom. Control., 2018

Deterministic and Stochastic Wireless Network Games: Equilibrium, Dynamics, and Price of Anarchy.
Oper. Res., 2018

Efficient path planning algorithms in reach-avoid problems.
Autom., 2018

Learning in Games with Lossy Feedback.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Micro-UAV onboard vehicle detection: architecture and experiments.
Proceedings of the IEEE International Conference on Intelligence and Safety for Robotics, 2018

Distributed Asynchronous Optimization with Unbounded Delays: How Slow Can You Go?
Proceedings of the 35th International Conference on Machine Learning, 2018

MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels.
Proceedings of the 35th International Conference on Machine Learning, 2018

Optimal Sensing for Patient Health Monitoring.
Proceedings of the 2018 IEEE International Conference on Communications, 2018

Power Control with Random Delays: Robust Feedback Averaging.
Proceedings of the 57th IEEE Conference on Decision and Control, 2018

Sensing-Constrained Power Control in Digital Health.
Proceedings of the 2018 Annual American Control Conference, 2018

Robustness of Join-the-Shortest-Queue Scheduling to Communication Delay.
Proceedings of the 2018 Annual American Control Conference, 2018

2017
Service Rate Control of Tandem Queues With Power Constraints.
IEEE Trans. Autom. Control., 2017

Multiplayer Reach-Avoid Games via Pairwise Outcomes.
IEEE Trans. Autom. Control., 2017

Improving predictions of pediatric surgical durations with supervised learning.
Int. J. Data Sci. Anal., 2017

MentorNet: Regularizing Very Deep Neural Networks on Corrupted Labels.
CoRR, 2017

Mirror descent in non-convex stochastic programming.
CoRR, 2017

Least action routing: Identifying the optimal path in a wireless relay network.
Proceedings of the 28th IEEE Annual International Symposium on Personal, 2017

Longest-queue-first scheduling with intermittent sampling.
Proceedings of the 28th IEEE Annual International Symposium on Personal, 2017

Countering Feedback Delays in Multi-Agent Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Stochastic Mirror Descent in Variationally Coherent Optimization Problems.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Dynamic control of data center network and computation resources.
Proceedings of the 2017 International Conference on Computing, 2017

Stable Power Control in Wireless Networks via Dual Averaging.
Proceedings of the 2017 IEEE Global Communications Conference, 2017

An infinite dimensional model for a many server priority queue.
Proceedings of the 51st Annual Conference on Information Sciences and Systems, 2017

Mirror descent learning in continuous games.
Proceedings of the 56th IEEE Annual Conference on Decision and Control, 2017

Asynchronous best-response dynamics for resource allocation games in cloud computing.
Proceedings of the 2017 American Control Conference, 2017

Join-the-shortest-queue scheduling with delay.
Proceedings of the 2017 American Control Conference, 2017

An infinite dimensional model for a single server priority queue.
Proceedings of the 2017 American Control Conference, 2017

2016
The Importance of Exploration in Online Marketplaces.
IEEE Internet Comput., 2016

Cooperative pursuit with Voronoi partitions.
Autom., 2016

A Stochastic Stability Characterization of the Foschini-Miljanic Algorithm in Random Wireless Networks.
Proceedings of the 2016 IEEE Global Communications Conference, 2016

Dynamics on Linear Influence Network Games Under Stochastic Environments.
Proceedings of the Decision and Game Theory for Security - 7th International Conference, 2016

Detecting Inaccurate Predictions of Pediatric Surgical Durations.
Proceedings of the 2016 IEEE International Conference on Data Science and Advanced Analytics, 2016

Repeated games for power control in wireless communications: Equilibrium and regret.
Proceedings of the 55th IEEE Conference on Decision and Control, 2016

A game-theoretical formulation of influence networks.
Proceedings of the 2016 American Control Conference, 2016

2015
Scalable Data Center Power Management via a Global Stress Signal.
Proceedings of the 2015 IEEE Global Communications Conference, 2015

Target-rate driven resource sharing in queueing systems.
Proceedings of the 54th IEEE Conference on Decision and Control, 2015

Wireless communications games in fixed and random environments.
Proceedings of the 54th IEEE Conference on Decision and Control, 2015

A general model for resource allocation in utility computing.
Proceedings of the American Control Conference, 2015

2014
Distributed Multi-Depot Routing without Communications.
CoRR, 2014

Evasion of a team of dubins vehicles from a hidden pursuer.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

A path defense approach to the multiplayer reach-avoid game.
Proceedings of the 53rd IEEE Conference on Decision and Control, 2014

Convexity verification for a hybrid chance constrained method in stochastic control problems.
Proceedings of the American Control Conference, 2014

Multiplayer reach-avoid games via low dimensional solutions and maximum matching.
Proceedings of the American Control Conference, 2014

Hybrid Singular Value Thresholding for Tensor Completion.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Simultaneous Rectification and Alignment via Robust Recovery of Low-rank Tensors.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Evasion as a team against a faster pursuer.
Proceedings of the American Control Conference, 2013

2012
A general, open-loop formulation for reach-avoid games.
Proceedings of the 51th IEEE Conference on Decision and Control, 2012


  Loading...