Marc G. Bellemare

According to our database1, Marc G. Bellemare authored at least 30 papers between 2007 and 2019.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Statistics and Samples in Distributional Reinforcement Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

DeepMDP: Learning Continuous Latent Space Models for Representation Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

The Value Function Polytope in Reinforcement Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

Distributional reinforcement learning with linear function approximation.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Temporally Extended Metrics for Markov Decision Processes.
Proceedings of the Workshop on Artificial Intelligence Safety 2019 co-located with the Thirty-Third AAAI Conference on Artificial Intelligence 2019 (AAAI-19), 2019

2018
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents.
J. Artif. Intell. Res., 2018

An Introduction to Deep Reinforcement Learning.
Foundations and Trends in Machine Learning, 2018

Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents (Extended Abstract).
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018

An Analysis of Categorical Distributional Reinforcement Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

Distributional Reinforcement Learning With Quantile Regression.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Count-Based Exploration with Neural Density Models.
Proceedings of the 34th International Conference on Machine Learning, 2017

A Laplacian Framework for Option Discovery in Reinforcement Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

Automated Curriculum Learning for Neural Networks.
Proceedings of the 34th International Conference on Machine Learning, 2017

A Distributional Perspective on Reinforcement Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016
Safe and Efficient Off-Policy Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Unifying Count-Based Exploration and Intrinsic Motivation.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Q(λ) with Off-Policy Corrections.
Proceedings of the Algorithmic Learning Theory - 27th International Conference, 2016

Increasing the Action Gap: New Operators for Reinforcement Learning.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Human-level control through deep reinforcement learning.
Nature, 2015

Online Learning of k-CNF Boolean Functions.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

The Arcade Learning Environment: An Evaluation Platform for General Agents (Extended Abstract).
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Count-Based Frequency Estimation with Bounded Memory.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Compress and Control.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Skip Context Tree Switching.
Proceedings of the 31th International Conference on Machine Learning, 2014

2013
The Arcade Learning Environment: An Evaluation Platform for General Agents.
J. Artif. Intell. Res., 2013

Bayesian Learning of Recursively Factored Environments.
Proceedings of the 30th International Conference on Machine Learning, 2013

2012
Sketch-Based Linear Value Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Investigating Contingency Awareness Using Atari 2600 Games.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2007
Context-Driven Predictions.
Proceedings of the IJCAI 2007, 2007


  Loading...