Yaozhong Gan

According to our database1, Yaozhong Gan authored at least 15 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
ORAL: Adaptive Gap Increasing for Advantage Learning via Occam's Razor Principle.
IEEE Trans. Neural Networks Learn. Syst., April, 2026

MARPO: A Reflective Policy Optimization for Multi-Agent Reinforcement Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
A Sanity Check for Multi-In-Domain Face Forgery Detection in the Real World.
CoRR, December, 2025

Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
AdaMemento: Adaptive Memory-Assisted Policy Optimization for Reinforcement Learning.
CoRR, 2024

The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective.
CoRR, 2024

Transductive Off-policy Proximal Policy Optimization.
CoRR, 2024

Autoencoder Reconstruction Model for Long-Horizon Exploration.
Proceedings of the International Joint Conference on Neural Networks, 2024

Reflective Policy Optimization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

PAE: Reinforcement Learning from External Knowledge for Efficient Exploration.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2022
Alleviating the estimation bias of deep deterministic policy gradient via co-regularization.
Pattern Recognit., 2022

Robust Action Gap Increasing with Clipped Advantage Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Smoothing Advantage Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Stabilizing Q Learning Via Soft Mellowmax Operator.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2019
Trust Region-Guided Proximal Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019


  Loading...