Chenjun Xiao

According to our database1, Chenjun Xiao authored at least 24 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning.
CoRR, 2023

Rethinking Decision Transformer via Hierarchical Reinforcement Learning.
CoRR, 2023

In-Sample Policy Iteration for Offline Reinforcement Learning.
CoRR, 2023

Conditionally optimistic exploration for cooperative deep multi-agent reinforcement learning.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Energy-based Predictive Representations for Partially Observed Reinforcement Learning.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

The In-Sample Softmax for Offline Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Latent Variable Representation for Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Understanding and Leveraging Overparameterization in Recursive Value Estimation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

The Curse of Passive Data Collection in Batch Reinforcement Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data.
CoRR, 2021

Understanding the Effect of Stochasticity in Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

On the Optimality of Batch Policy Optimization Algorithms.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Escaping the Gravitational Pull of Softmax.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On the Global Convergence Rates of Softmax Policy Gradient Methods.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Learning to Combat Compounding-Error in Model-Based Reinforcement Learning.
CoRR, 2019

Maximum Entropy Monte-Carlo Planning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On Principled Entropy Exploration in Policy Optimization.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018
Memory-Augmented Monte Carlo Tree Search.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Only-One-Victor Pattern Learning in Computer Go.
IEEE Trans. Comput. Intell. AI Games, 2017

2016
Integrating Factorization Ranked Features in MCTS: An Experimental Study.
Proceedings of the Computer Games - 5th Workshop on Computer Games, 2016

Factorization Ranking Model for Move Prediction in the Game of Go.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2012
The 2<sup>nd</sup> National University Student Computer-Games Tournaments.
J. Int. Comput. Games Assoc., 2012


  Loading...