We stand with Ukraine

We stand with Ukraine

Zaiwei Chen

Orcid: 0000-0001-9915-5595

According to our database¹, Zaiwei Chen authored at least 36 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Non-Asymptotic Convergence of Stochastic Iterative Algorithms: A Lyapunov Framework.

[DOI]

,

Siva Theja Maguluri

CoRR, May, 2026

Achieving ε<sup>-2</sup> Sample Complexity for Single-Loop Actor-Critic under Minimal Assumptions.

[DOI]

,

CoRR, May, 2026

Natural Policy Gradient as Doubly Smoothed Policy Iteration: A Bellman-Operator Framework.

[DOI]

,

CoRR, May, 2026

Bridging the Gap Between Average and Discounted TD Learning.

[DOI]

,

,

Ioannis Ch. Paschalidis

,

CoRR, May, 2026

Natural Hypergradient Descent: Algorithm Design, Convergence Analysis, and Parallel Implementation.

[DOI]

,

,

,

CoRR, February, 2026

Achieving ϵ<sup>-2</sup> Dependence for Average-Reward Q-Learning with a New Contraction Principle.

[DOI]

,

,

,

CoRR, January, 2026

A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies.

[DOI]

,

Proceedings of the Abstracts of the 2026 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2026

2025

A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms.

[DOI]

,

,

,

,

Siva Theja Maguluri

CoRR, February, 2025

An approximate policy iteration viewpoint of actor-critic algorithms.

[DOI]

,

Siva Theja Maguluri

Autom., 2025

Reinforcement Learning with Imperfect Transition Predictions: A Bellman-Jensen Approach.

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Non-Asymptotic Guarantees for Average-Reward Q-Learning with Adaptive Stepsizes.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization.

[DOI]

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Approximate Global Convergence of Independent Learning in Multi-Agent Systems.

[DOI]

,

,

,

,

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025

2024

A Lyapunov Theory for Finite-Sample Guarantees of Markovian Stochastic Approximation.

[DOI]

,

Siva Theja Maguluri

,

Sanjay Shakkottai

,

Karthikeyan Shanmugam

Oper. Res., 2024

Last-Iterate Convergence of Payoff-Based Independent Learning in Zero-Sum Stochastic Games.

[DOI]

,

,

,

Asuman E. Ozdaglar

,

CoRR, 2024

Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games.

[DOI]

,

,

,

Asuman E. Ozdaglar

,

Proceedings of the 25th ACM Conference on Economics and Computation, 2024

Last-Iterate Convergence for Generalized Frank-Wolfe in Monotone Variational Inequalities.

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

2023

Target Network and Truncation Overcome the Deadly Triad in \(\boldsymbol{Q}\)-Learning.

[DOI]

,

John-Paul Clarke

,

Siva Theja Maguluri

SIAM J. Math. Data Sci., December, 2023

Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise.

[DOI]

,

Siva Theja Maguluri

,

Martin Zubeldia

CoRR, 2023

Convergence rates for localized actor-critic in networked Markov potential games.

[DOI]

,

,

,

Proceedings of the Uncertainty in Artificial Intelligence, 2023

Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning.

[DOI]

,

,

,

,

,

Proceedings of the Abstract Proceedings of the 2023 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2023

A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games.

[DOI]

,

,

,

Asuman E. Ozdaglar

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

A Unified Lyapunov Framework for Finite-Sample Analysis of Reinforcement Learning Algorithms.

[DOI]

SIGMETRICS Perform. Evaluation Rev., December, 2022

A Unified Lyapunov Framework for Finite-Sample Analysis of Reinforcement Learning Algorithms.

[DOI]

PhD thesis, 2022

Finite-Sample Analysis of Off-Policy Natural Actor-Critic With Linear Function Approximation.

[DOI]

,

Sajad Khodadadian

,

Siva Theja Maguluri

IEEE Control. Syst. Lett., 2022

Target Network and Truncation Overcome The Deadly triad in Q-Learning.

[DOI]

,

John-Paul Clarke

,

Siva Theja Maguluri

CoRR, 2022

Finite-sample analysis of nonlinear stochastic approximation with applications in reinforcement learning.

[DOI]

,

,

,

John-Paul Clarke

,

Siva Theja Maguluri

Autom., 2022

Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization.

[DOI]

,

,

Siva Theja Maguluri

Proceedings of the SIGMETRICS/PERFORMANCE '22: ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems, Mumbai, India, June 6, 2022

Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation.

[DOI]

,

Siva Theja Maguluri

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

Nested Vehicle Routing Problem: Optimizing Drone-Truck Surveillance Operations.

[DOI]

,

,

John-Paul Clarke

,

CoRR, 2021

A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants.

[DOI]

,

Siva Theja Maguluri

,

Sanjay Shakkottai

,

Karthikeyan Shanmugam

CoRR, 2021

Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators.

[DOI]

,

Siva Theja Maguluri

,

Sanjay Shakkottai

,

Karthikeyan Shanmugam

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm.

[DOI]

Sajad Khodadadian

,

,

Siva Theja Maguluri

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

Finite-Sample Analysis of Stochastic Approximation Using Smooth Convex Envelopes.

[DOI]

,

Siva Theja Maguluri

,

Sanjay Shakkottai

,

Karthikeyan Shanmugam

CoRR, 2020

Finite-Sample Analysis of Contractive Stochastic Approximation Using Smooth Convex Envelopes.

[DOI]

,

Siva Theja Maguluri

,

Sanjay Shakkottai

,

Karthikeyan Shanmugam

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019

Finite-Time Analysis of Q-Learning with Linear Function Approximation.

[DOI]

,

,

,

Siva Theja Maguluri

,

John-Paul Clarke

CoRR, 2019

Loading...