Marc G. Bellemare

J. Artif. Intell. Res., 2018

An Introduction to Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Vincent François-Lavet

Found. Trends Mach. Learn., 2018

An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents.

[BibT_eX]

[DOI]

CoRR, 2018

Dopamine: A Research Framework for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2018

The Barbados 2018 List of Open Issues in Continual Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Approximate Exploration through State Abstraction.

[BibT_eX]

[DOI]

Adrien Ali Taïga

Aaron C. Courville

CoRR, 2018

Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents (Extended Abstract).

[BibT_eX]

[DOI]

Matthew J. Hausknecht

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning.

[BibT_eX]

[DOI]

Audrunas Gruslys

Will Dabney

Mohammad Gheshlaghi Azar

Bilal Piot

Proceedings of the 6th International Conference on Learning Representations, 2018

An Analysis of Categorical Distributional Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

Distributional Reinforcement Learning With Quantile Regression.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

The Reactor: A Sample-Efficient Actor-Critic Architecture.

[BibT_eX]

[DOI]

Audrunas Gruslys

Mohammad Gheshlaghi Azar

CoRR, 2017

The Cramer Distance as a Solution to Biased Wasserstein Gradients.

[BibT_eX]

[DOI]

Balaji Lakshminarayanan

Stephan Hoyer

CoRR, 2017

Count-Based Exploration with Neural Density Models.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

A Laplacian Framework for Option Discovery in Reinforcement Learning.

[BibT_eX]

[DOI]

Marlos C. Machado

Michael H. Bowling

Proceedings of the 34th International Conference on Machine Learning, 2017

Automated Curriculum Learning for Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

A Distributional Perspective on Reinforcement Learning.

[BibT_eX]

[DOI]

Will Dabney

Proceedings of the 34th International Conference on Machine Learning, 2017

2016

Safe and Efficient Off-Policy Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Unifying Count-Based Exploration and Intrinsic Motivation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Q(λ) with Off-Policy Corrections.

[BibT_eX]

[DOI]

Proceedings of the Algorithmic Learning Theory - 27th International Conference, 2016

Increasing the Action Gap: New Operators for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Human-level control through deep reinforcement learning.

[BibT_eX]

[DOI]

Nat., 2015

Online Learning of k-CNF Boolean Functions.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

The Arcade Learning Environment: An Evaluation Platform for General Agents (Extended Abstract).

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Count-Based Frequency Estimation with Bounded Memory.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Compress and Control.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Skip Context Tree Switching.

[BibT_eX]

[DOI]

Erik Talvitie

Proceedings of the 31th International Conference on Machine Learning, 2014

2013

The Arcade Learning Environment: An Evaluation Platform for General Agents.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2013

Bayesian Learning of Recursively Factored Environments.

[BibT_eX]

[DOI]

Proceedings of the 30th International Conference on Machine Learning, 2013

2012

Sketch-Based Linear Value Function Approximation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Investigating Contingency Awareness Using Atari 2600 Games.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2007

Context-Driven Predictions.

[BibT_eX]

[DOI]