We stand with Ukraine

We stand with Ukraine

Volodymyr Mnih

According to our database¹, Volodymyr Mnih authored at least 40 papers between 2006 and 2023.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2023

Vision-Language Models as a Source of Rewards.

[BibT_eX]

[DOI]

CoRR, 2023

In-context Reinforcement Learning with Algorithm Distillation.

[BibT_eX]

[DOI]

,

,

,

Emilio Parisotto

,

Stephen Spencer

,

Richie Steigerwald

,

,

Steven Stenberg Hansen

,

,

Ethan A. Brooks

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Palm up: Playing in the Latent Manifold for Unsupervised Pretraining.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning more skills through optimistic exploration.

[BibT_eX]

[DOI]

,

,

David Warde-Farley

,

,

Steven Stenberg Hansen

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Wasserstein Distance Maximizing Intrinsic Control.

[BibT_eX]

[DOI]

,

,

Stephen Spencer

,

CoRR, 2021

Discovering Diverse Nearly Optimal Policies withSuccessor Features.

[BibT_eX]

[DOI]

,

Brendan O'Donoghue

,

,

,

Sebastian Flennerhag

,

CoRR, 2021

Entropic Desired Dynamics for Intrinsic Control.

[BibT_eX]

[DOI]

,

Guillaume Desjardins

,

,

David Warde-Farley

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Relative Variational Intrinsic Control.

[BibT_eX]

[DOI]

,

David Warde-Farley

,

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Q-Learning in enormous action spaces via amortized approximate maximization.

[BibT_eX]

[DOI]

Tom Van de Wiele

,

David Warde-Farley

,

,

CoRR, 2020

Fast Task Inference with Variational Intrinsic Successor Features.

[BibT_eX]

[DOI]

,

,

,

David Warde-Farley

,

Tom Van de Wiele

,

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Unsupervised Learning of Object Keypoints for Perception and Control.

[BibT_eX]

[DOI]

Tejas D. Kulkarni

,

,

Catalin Ionescu

,

Sebastian Borgeaud

,

Malcolm Reynolds

,

Andrew Zisserman

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Unsupervised Control Through Non-Parametric Discriminative Rewards.

[BibT_eX]

[DOI]

David Warde-Farley

,

Tom Van de Wiele

,

Tejas D. Kulkarni

,

Catalin Ionescu

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

2018

Learning by Playing Solving Sparse Reward Tasks from Scratch.

[BibT_eX]

[DOI]

Martin A. Riedmiller

,

,

,

Michael Neunert

,

,

Tom Van de Wiele

,

,

,

Jost Tobias Springenberg

Proceedings of the 35th International Conference on Machine Learning, 2018

The Uncertainty Bellman Equation and Exploration.

[BibT_eX]

[DOI]

Brendan O'Donoghue

,

,

,

Proceedings of the 35th International Conference on Machine Learning, 2018

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Koray Kavukcuoglu

Proceedings of the 35th International Conference on Machine Learning, 2018

Noisy Networks For Exploration.

[BibT_eX]

[DOI]

Meire Fortunato

,

Mohammad Gheshlaghi Azar

,

,

,

,

,

,

,

,

,

Olivier Pietquin

,

Charles Blundell

,

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

Noisy Networks for Exploration.

[BibT_eX]

[DOI]

Meire Fortunato

,

Mohammad Gheshlaghi Azar

,

,

,

,

,

,

,

,

Olivier Pietquin

,

Charles Blundell

,

CoRR, 2017

Combining policy gradient and Q-learning.

[BibT_eX]

[DOI]

Brendan O'Donoghue

,

,

Koray Kavukcuoglu

,

Proceedings of the 5th International Conference on Learning Representations, 2017

Reinforcement Learning with Unsupervised Auxiliary Tasks.

[BibT_eX]

[DOI]

,

,

Wojciech Marian Czarnecki

,

,

,

,

Koray Kavukcuoglu

Proceedings of the 5th International Conference on Learning Representations, 2017

Sample Efficient Actor-Critic with Experience Replay.

[BibT_eX]

[DOI]

,

,

,

,

,

Koray Kavukcuoglu

,

Nando de Freitas

Proceedings of the 5th International Conference on Learning Representations, 2017

2016

Policy Distillation.

[BibT_eX]

[DOI]

,

Sergio Gomez Colmenarejo

,

Çaglar Gülçehre

,

Guillaume Desjardins

,

James Kirkpatrick

,

,

,

Koray Kavukcuoglu

,

Proceedings of the 4th International Conference on Learning Representations, 2016

PGQ: Combining policy gradient and Q-learning.

[BibT_eX]

[DOI]

Brendan O'Donoghue

,

,

Koray Kavukcuoglu

,

CoRR, 2016

Strategic Attentive Writer for Learning Macro-Actions.

[BibT_eX]

[DOI]

Alexander Vezhnevets

,

,

,

,

,

John P. Agapiou

,

Koray Kavukcuoglu

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Learning values across many orders of magnitude.

[BibT_eX]

[DOI]

Hado van Hasselt

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Using Fast Weights to Attend to the Recent Past.

[BibT_eX]

[DOI]

,

Geoffrey E. Hinton

,

,

,

Catalin Ionescu

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Asynchronous Methods for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

Adrià Puigdomènech Badia

,

,

,

Timothy P. Lillicrap

,

,

,

Koray Kavukcuoglu

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

Human-level control through deep reinforcement learning.

[BibT_eX]

[DOI]

,

Koray Kavukcuoglu

,

,

,

,

Marc G. Bellemare

,

,

Martin A. Riedmiller

,

Andreas Fidjeland

,

Georg Ostrovski

,

,

Charles Beattie

,

,

Ioannis Antonoglou

,

,

Dharshan Kumaran

,

,

,

Nat., 2015

Massively Parallel Methods for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

Praveen Srinivasan

,

,

,

,

Alessandro De Maria

,

Vedavyas Panneershelvam

,

Mustafa Suleyman

,

Charles Beattie

,

,

,

,

Koray Kavukcuoglu

,

CoRR, 2015

Multiple Object Recognition with Visual Attention.

[BibT_eX]

[DOI]

,

,

Koray Kavukcuoglu

Proceedings of the 3rd International Conference on Learning Representations, 2015

2014

Recurrent Models of Visual Attention.

[BibT_eX]

[DOI]

,

,

,

Koray Kavukcuoglu

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

2013

Machine Learning for Aerial Image Labeling.

[BibT_eX]

[DOI]

PhD thesis, 2013

Modeling Natural Images Using Gated MRFs.

[BibT_eX]

[DOI]

Marc'Aurelio Ranzato

,

,

Joshua M. Susskind

,

Geoffrey E. Hinton

IEEE Trans. Pattern Anal. Mach. Intell., 2013

Playing Atari with Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

Koray Kavukcuoglu

,

,

,

Ioannis Antonoglou

,

,

Martin A. Riedmiller

CoRR, 2013

2012

Learning to Label Aerial Images from Noisy Data.

[BibT_eX]

[DOI]

,

Geoffrey E. Hinton

Proceedings of the 29th International Conference on Machine Learning, 2012

2011

Conditional Restricted Boltzmann Machines for Structured Output Prediction.

[BibT_eX]

[DOI]

,

Hugo Larochelle

,

Geoffrey E. Hinton

Proceedings of the UAI 2011, 2011

On deep generative models with applications to recognition.

[BibT_eX]

[DOI]

Marc'Aurelio Ranzato

,

Joshua M. Susskind

,

,

Geoffrey E. Hinton

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010

Generating more realistic images using gated MRF's.

[BibT_eX]

[DOI]

Marc'Aurelio Ranzato

,

,

Geoffrey E. Hinton

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Learning to Detect Roads in High-Resolution Aerial Images.

[BibT_eX]

[DOI]

,

Geoffrey E. Hinton

Proceedings of the Computer Vision - ECCV 2010, 2010

2008

Empirical Bernstein stopping.

[BibT_eX]

[DOI]

,

Csaba Szepesvári

,

Jean-Yves Audibert

Proceedings of the Machine Learning, 2008

2006

Topological map learning from outdoor image sequences.

[BibT_eX]

[DOI]

,

Richard S. Zemel

,

J. Field Robotics, 2006

Loading...