Arthur Guez

According to our database1, Arthur Guez authored at least 36 papers between 2008 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Optimism and Adaptivity in Policy Optimization.
CoRR, 2023

2022
Retrieval-Augmented Reinforcement Learning.
CoRR, 2022

Large-Scale Retrieval for Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


Policy improvement by planning with Gumbel.
Proceedings of the Tenth International Conference on Learning Representations, 2022

COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Counterfactual Credit Assignment in Model-Free Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Muesli: Combining Improvements in Policy Optimization.
Proceedings of the 38th International Conference on Machine Learning, 2021

On the role of planning in model-based deep reinforcement learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Mastering Atari, Go, chess and shogi by planning with a learned model.
Nat., 2020

Counterfactual Credit Assignment in Model-Free Reinforcement Learning.
CoRR, 2020

Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban.
CoRR, 2020

Physically Embedded Planning Problems: New Challenges for Reinforcement Learning.
CoRR, 2020

Value-driven Hindsight Modelling.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Augmenting learning using symmetry in a biologically-inspired domain.
CoRR, 2019

An Investigation of Model-Free Planning.
Proceedings of the 36th International Conference on Machine Learning, 2019

Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Learning to Search with MCTSnets.
Proceedings of the 35th International Conference on Machine Learning, 2018

Adaptive planning in human search.
Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

2017
Mastering the game of Go without human knowledge.
Nat., 2017

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm.
CoRR, 2017

Imagination-Augmented Agents for Deep Reinforcement Learning.
CoRR, 2017

Imagination-Augmented Agents for Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

The Predictron: End-To-End Learning and Planning.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016
Mastering the game of Go with deep neural networks and tree search.
Nat., 2016

Learning functions across many orders of magnitudes.
CoRR, 2016

Learning values across many orders of magnitude.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Deep Reinforcement Learning with Double Q-Learning.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Increasing the Action Gap: New Operators for Reinforcement Learning.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2014
Better Optimism By Bayes: Adaptive Planning with Rich Models.
CoRR, 2014

Bayes-Adaptive Simulation-based Search with Value Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

2013
Scalable and Efficient Bayes-Adaptive Reinforcement Learning Based on Monte-Carlo Tree Search.
J. Artif. Intell. Res., 2013

2012
Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

2010
Multi-tasking SLAM.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

2009
Treating Epilepsy via Adaptive Neurostimulation: a Reinforcement Learning Approach.
Int. J. Neural Syst., 2009

2008
Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008


  Loading...