Kaiqing Zhang

According to our database1, Kaiqing Zhang authored at least 62 papers between 2014 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2022
The Complexity of Markov Equilibrium in Stochastic Games.
CoRR, 2022

Globally Convergent Policy Search over Dynamic Filters for Output Estimation.
CoRR, 2022

Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence.
CoRR, 2022

Do Differentiable Simulators Give Better Policy Gradients?
CoRR, 2022

Fully asynchronous policy evaluation in distributed reinforcement learning over networks.
Autom., 2022

2021
The Effect of Low-Intensity Transcranial Ultrasound Stimulation on Neural Oscillation and Hemodynamics in the Mouse Visual Cortex Depends on Anesthesia Level and Ultrasound Intensity.
IEEE Trans. Biomed. Eng., 2021

Finite-Sample Analysis for Decentralized Batch Multiagent Reinforcement Learning With Networked Agents.
IEEE Trans. Autom. Control., 2021

Policy Optimization for ℋ<sub>2</sub> Linear Control with ℋ<sub>∞</sub> Robustness Guarantee: Implicit Regularization and Global Convergence.
SIAM J. Control. Optim., 2021

Influence of behavioral state on the neuromodulatory effect of low-intensity transcranial ultrasound stimulation on hippocampal CA1 in mouse.
NeuroImage, 2021

Decentralized multi-agent reinforcement learning with networked agents: recent advances.
Frontiers Inf. Technol. Electron. Eng., 2021

Independent Learning in Stochastic Games.
CoRR, 2021

Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration.
CoRR, 2021

Derivative-Free Policy Optimization for Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity.
CoRR, 2021

Derivative-Free Policy Optimization for Linear Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Decentralized Q-learning in Zero-sum Markov Games.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Reinforcement Learning for Cost-Aware Markov Decision Processes.
Proceedings of the 38th International Conference on Machine Learning, 2021

Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs.
Proceedings of the 38th International Conference on Machine Learning, 2021

Learning Safe Multi-agent Control with Decentralized Neural Barrier Certificates.
Proceedings of the 9th International Conference on Learning Representations, 2021

Decentralized Policy Gradient Descent Ascent for Safe Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies.
SIAM J. Control. Optim., 2020

Asynchronous Advantage Actor Critic: Non-asymptotic Analysis and Linear Speedup.
CoRR, 2020

Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs.
CoRR, 2020

Asynchronous Policy Evaluation in Distributed Reinforcement Learning over Networks.
CoRR, 2020

Distributed learning of average belief over networks using sequential observations.
Autom., 2020

Robust Multi-Agent Reinforcement Learning with Model Uncertainty.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On the Stability and Convergence of Robust Adversarial Reinforcement Learning: A Case Study on Linear Quadratic Systems.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Natural Policy Gradient Primal-Dual Method for Constrained Markov Decision Processes.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Reinforcement Learning in Non-Stationary Discrete-Time Linear-Quadratic Mean-Field Games.
Proceedings of the 59th IEEE Conference on Decision and Control, 2020

Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning.
Proceedings of the 59th IEEE Conference on Decision and Control, 2020

Approximate Equilibrium Computation for Discrete-Time Linear-Quadratic Mean-Field Games.
Proceedings of the 2020 American Control Conference, 2020

2019
Projected Stochastic Primal-Dual Method for Constrained Online Learning With Kernels.
IEEE Trans. Signal Process., 2019

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms.
CoRR, 2019

Stochastic Convergence Results for Regularized Actor-Critic Methods.
CoRR, 2019

A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning.
CoRR, 2019

Non-Cooperative Inverse Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Policy Search in Infinite-Horizon Discounted Reinforcement Learning: Advances through Connections to Non-Convex Optimization : Invited Presentation.
Proceedings of the 53rd Annual Conference on Information Sciences and Systems, 2019

Convergence and Iteration Complexity of Policy Gradient Method for Infinite-horizon Reinforcement Learning.
Proceedings of the 58th IEEE Conference on Decision and Control, 2019

A Communication-Efficient Multi-Agent Actor-Critic Algorithm for Distributed Reinforcement Learning.
Proceedings of the 58th IEEE Conference on Decision and Control, 2019

Online Planning for Decentralized Stochastic Control with Partial History Sharing.
Proceedings of the 2019 American Control Conference, 2019

2018
Dynamic Power Distribution System Management With a Locally Connected Communication Network.
IEEE J. Sel. Top. Signal Process., 2018

Communication-Efficient Distributed Reinforcement Learning.
CoRR, 2018

Finite-Sample Analyses for Fully Decentralized Multi-Agent Reinforcement Learning.
CoRR, 2018

Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents.
Proceedings of the 35th International Conference on Machine Learning, 2018

Networked Multi-Agent Reinforcement Learning in Continuous Spaces.
Proceedings of the 57th IEEE Conference on Decision and Control, 2018

A Finite Sample Analysis of the Actor-Critic Algorithm.
Proceedings of the 57th IEEE Conference on Decision and Control, 2018

Distributed Equilibrium-Learning for Power Network Voltage Control With a Locally Connected Communication Network.
Proceedings of the 2018 Annual American Control Conference, 2018

Nonlinear Structured Signal Estimation in High Dimensions via Iterative Hard Thresholding.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017
Consumption Behavior Analytics-Aided Energy Forecasting and Dispatch.
IEEE Intell. Syst., 2017

Parameter Sensitivity and Dependency Analysis for the WECC Dynamic Composite Load Model.
Proceedings of the 50th Hawaii International Conference on System Sciences, 2017

A game-theoretic approach for communication-free voltage-VAR optimization.
Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

2016
On the performance of map-aware cooperative localization.
Proceedings of the 2016 IEEE International Conference on Communications, 2016

2015
Indoor Localization Algorithm For Smartphones.
CoRR, 2015

Enhanced multi-parameter cognitive architecture for future wireless communications.
IEEE Commun. Mag., 2015

Spectrum prediction and channel selection for sensing-based spectrum sharing scheme using online learning techniques.
Proceedings of the 26th IEEE Annual International Symposium on Personal, 2015

An area state-aided indoor localization algorithm and its implementation.
Proceedings of the IEEE International Conference on Communication, 2015

Sequential Detection Aided Modulation Classification in Cognitive Radio Networks.
Proceedings of the 2015 IEEE Global Communications Conference, 2015

2014
Enhanced Multi-Parameter Cognitive Architecture for Future Wireless Communications.
CoRR, 2014

Machine learning techniques for spectrum sensing when primary user has multiple transmit powers.
Proceedings of the IEEE International Conference on Communication Systems, 2014


  Loading...