Chenjun Xiao

According to our database¹, Chenjun Xiao authored at least 37 papers between 2012 and 2026.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Large Language Model-Enhanced Multi-Armed Bandits.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models (Abstract Reprint).

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Scaling DRL for Decision Making: A Survey on Data, Network, and Training Budget Strategies.

[BibT_eX]

[DOI]

CoRR, August, 2025

Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, April, 2025

An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2025

β-DQN: Improving Deep Q-Learning By Evolving the Behavior.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

Hindsight Preference Learning for Offline Preference-based Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Diffusion Spectral Representation for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Iteratively Refined Behavior Regularization for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Exploiting the Replay Memory Before Exploring the Environment: Enhancing Reinforcement Learning Through Empirical MDP Iteration.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Rethinking Decision Transformer via Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

HarmonyDream: Task Harmonization Inside World Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation.

[BibT_eX]

[DOI]

Christopher K. Harris

A. Rupam Mahmood

Dale Schuurmans

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

In-Sample Policy Iteration for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Conditionally optimistic exploration for cooperative deep multi-agent reinforcement learning.

[BibT_eX]

[DOI]

Janarthanan Rajendran

Proceedings of the Uncertainty in Artificial Intelligence, 2023

Energy-based Predictive Representations for Partially Observed Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Uncertainty in Artificial Intelligence, 2023

Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

The In-Sample Softmax for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Latent Variable Representation for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Understanding and Leveraging Overparameterization in Recursive Value Estimation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

The Curse of Passive Data Collection in Batch Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data.

[BibT_eX]

[DOI]

CoRR, 2021

Understanding the Effect of Stochasticity in Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

On the Optimality of Batch Policy Optimization Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

Escaping the Gravitational Pull of Softmax.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On the Global Convergence Rates of Softmax Policy Gradient Methods.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

Learning to Combat Compounding-Error in Model-Based Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Maximum Entropy Monte-Carlo Planning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On Principled Entropy Exploration in Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018

Memory-Augmented Monte Carlo Tree Search.

[BibT_eX]

[DOI]

Chenjun Xiao

Jincheng Mei

Martin Müller

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Only-One-Victor Pattern Learning in Computer Go.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Intell. AI Games, 2017

2016

Integrating Factorization Ranked Features in MCTS: An Experimental Study.

[BibT_eX]

[DOI]

Chenjun Xiao

Martin Müller

Proceedings of the Computer Games - 5th Workshop on Computer Games, 2016

Factorization Ranking Model for Move Prediction in the Game of Go.

[BibT_eX]

[DOI]

Chenjun Xiao

Martin Müller

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2012

The 2<sup>nd</sup> National University Student Computer-Games Tournaments.

[BibT_eX]

[DOI]

ICGA J., 2012

Chenjun Xiao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...