Yaozhong Gan

According to our database¹, Yaozhong Gan authored at least 15 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

ORAL: Adaptive Gap Increasing for Advantage Learning via Occam's Razor Principle.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., April, 2026

MARPO: A Reflective Policy Optimization for Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

A Sanity Check for Multi-In-Domain Face Forgery Detection in the Real World.

[BibT_eX]

[DOI]

CoRR, December, 2025

Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024

AdaMemento: Adaptive Memory-Assisted Policy Optimization for Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective.

[BibT_eX]

[DOI]

CoRR, 2024

Transductive Off-policy Proximal Policy Optimization.

[BibT_eX]

[DOI]

CoRR, 2024

Autoencoder Reconstruction Model for Long-Horizon Exploration.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

Reflective Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

PAE: Reinforcement Learning from External Knowledge for Efficient Exploration.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2022

Alleviating the estimation bias of deep deterministic policy gradient via co-regularization.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Robust Action Gap Increasing with Clipped Advantage Learning.

[BibT_eX]

[DOI]

Zhe Zhang

Yaozhong Gan

Xiaoyang Tan

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Smoothing Advantage Learning.

[BibT_eX]

[DOI]

Yaozhong Gan

Zhe Zhang

Xiaoyang Tan

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Stabilizing Q Learning Via Soft Mellowmax Operator.

[BibT_eX]

[DOI]

Yaozhong Gan

Zhe Zhang

Xiaoyang Tan

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2019

Trust Region-Guided Proximal Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Yaozhong Gan

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...