Kaiqing Zhang

IEEE Trans. Biomed. Eng., 2021

Finite-Sample Analysis for Decentralized Batch Multiagent Reinforcement Learning With Networked Agents.

[BibT_eX]

[DOI]

IEEE Trans. Autom. Control., 2021

Influence of behavioral state on the neuromodulatory effect of low-intensity transcranial ultrasound stimulation on hippocampal CA1 in mouse.

[BibT_eX]

[DOI]

NeuroImage, 2021

Decentralized multi-agent reinforcement learning with networked agents: recent advances.

[BibT_eX]

[DOI]

Frontiers Inf. Technol. Electron. Eng., 2021

Independent Learning in Stochastic Games.

[BibT_eX]

[DOI]

Asuman E. Ozdaglar

Muhammed O. Sayin

CoRR, 2021

Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration.

[BibT_eX]

[DOI]

CoRR, 2021

Derivative-Free Policy Optimization for Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity.

[BibT_eX]

[DOI]

CoRR, 2021

Derivative-Free Policy Optimization for Linear Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Decentralized Q-learning in Zero-sum Markov Games.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Reinforcement Learning for Cost-Aware Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Learning Safe Multi-agent Control with Decentralized Neural Barrier Certificates.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Decentralized Policy Gradient Descent Ascent for Safe Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2020

Asynchronous Advantage Actor Critic: Non-asymptotic Analysis and Linear Speedup.

[BibT_eX]

[DOI]

CoRR, 2020

Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs.

[BibT_eX]

[DOI]

CoRR, 2020

Asynchronous Policy Evaluation in Distributed Reinforcement Learning over Networks.

[BibT_eX]

[DOI]

CoRR, 2020

Distributed learning of average belief over networks using sequential observations.

[BibT_eX]

[DOI]

Autom., 2020

Robust Multi-Agent Reinforcement Learning with Model Uncertainty.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On the Stability and Convergence of Robust Adversarial Reinforcement Learning: A Case Study on Linear Quadratic Systems.

[BibT_eX]

[DOI]

Bin Hu

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Natural Policy Gradient Primal-Dual Method for Constrained Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Policy Optimization for H<sub>2</sub> Linear Control with H<sub>∞</sub> Robustness Guarantee: Implicit Regularization and Global Convergence.

[BibT_eX]

[DOI]

Bin Hu

Proceedings of the 2nd Annual Conference on Learning for Dynamics and Control, 2020

Reinforcement Learning in Non-Stationary Discrete-Time Linear-Quadratic Mean-Field Games.

[BibT_eX]

[DOI]

Muhammad Aneeq uz Zaman

Erik Miehling

Proceedings of the 59th IEEE Conference on Decision and Control, 2020

Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 59th IEEE Conference on Decision and Control, 2020

Approximate Equilibrium Computation for Discrete-Time Linear-Quadratic Mean-Field Games.

[BibT_eX]

[DOI]

Muhammad Aneeq uz Zaman

Erik Miehling

Proceedings of the 2020 American Control Conference, 2020

2019

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms.

[BibT_eX]

[DOI]

CoRR, 2019

Stochastic Convergence Results for Regularized Actor-Critic Methods.

[BibT_eX]

[DOI]

CoRR, 2019

A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Non-Cooperative Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Policy Search in Infinite-Horizon Discounted Reinforcement Learning: Advances through Connections to Non-Convex Optimization : Invited Presentation.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual Conference on Information Sciences and Systems, 2019

Convergence and Iteration Complexity of Policy Gradient Method for Infinite-horizon Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 58th IEEE Conference on Decision and Control, 2019

A Communication-Efficient Multi-Agent Actor-Critic Algorithm for Distributed Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 58th IEEE Conference on Decision and Control, 2019

Online Planning for Decentralized Stochastic Control with Partial History Sharing.

[BibT_eX]

[DOI]

Erik Miehling

Proceedings of the 2019 American Control Conference, 2019

2018

Dynamic Power Distribution System Management With a Locally Connected Communication Network.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2018

Communication-Efficient Distributed Reinforcement Learning.

[BibT_eX]

[DOI]

Tianyi Chen

Georgios B. Giannakis

CoRR, 2018

Finite-Sample Analyses for Fully Decentralized Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Projected Stochastic Primal-Dual Method for Constrained Online Learning with Kernels.

[BibT_eX]

[DOI]

Proceedings of the 57th IEEE Conference on Decision and Control, 2018

Networked Multi-Agent Reinforcement Learning in Continuous Spaces.

[BibT_eX]

[DOI]

Proceedings of the 57th IEEE Conference on Decision and Control, 2018

A Finite Sample Analysis of the Actor-Critic Algorithm.

[BibT_eX]

[DOI]

Proceedings of the 57th IEEE Conference on Decision and Control, 2018

Distributed Equilibrium-Learning for Power Network Voltage Control With a Locally Connected Communication Network.

[BibT_eX]

[DOI]

Proceedings of the 2018 Annual American Control Conference, 2018

Nonlinear Structured Signal Estimation in High Dimensions via Iterative Hard Thresholding.

[BibT_eX]

[DOI]

Zhaoran Wang

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017

Consumption Behavior Analytics-Aided Energy Forecasting and Dispatch.

[BibT_eX]

[DOI]

IEEE Intell. Syst., 2017

Parameter Sensitivity and Dependency Analysis for the WECC Dynamic Composite Load Model.

[BibT_eX]

[DOI]

Siming Guo

Hao Zhu

Proceedings of the 50th Hawaii International Conference on System Sciences, 2017

A game-theoretic approach for communication-free voltage-VAR optimization.

[BibT_eX]

[DOI]

Hao Zhu

Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

2016

On the performance of map-aware cooperative localization.

[BibT_eX]

[DOI]

Yuan Shen

Moe Z. Win

Proceedings of the 2016 IEEE International Conference on Communications, 2016

2015

Indoor Localization Algorithm For Smartphones.

[BibT_eX]

[DOI]

CoRR, 2015

Enhanced multi-parameter cognitive architecture for future wireless communications.

[BibT_eX]

[DOI]

Feifei Gao

IEEE Commun. Mag., 2015

Spectrum prediction and channel selection for sensing-based spectrum sharing scheme using online learning techniques.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE Annual International Symposium on Personal, 2015

An area state-aided indoor localization algorithm and its implementation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Communication, 2015

Sequential Detection Aided Modulation Classification in Cognitive Radio Networks.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Global Communications Conference, 2015

2014

Enhanced Multi-Parameter Cognitive Architecture for Future Wireless Communications.

[BibT_eX]

[DOI]

Feifei Gao

Qihui Wu

CoRR, 2014

Machine learning techniques for spectrum sensing when primary user has multiple transmit powers.

[BibT_eX]

[DOI]