David Silver

According to our database1, David Silver authored at least 113 papers between 2000 and 2018.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2018
Introduction to the special issue on deep reinforcement learning: An editorial.
Neural Networks, 2018

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning.
CoRR, 2018

Implicit Quantile Networks for Distributional Reinforcement Learning.
CoRR, 2018

Meta-Gradient Reinforcement Learning.
CoRR, 2018

Unsupervised Predictive Memory in a Goal-Directed Agent.
CoRR, 2018

Distributed Prioritized Experience Replay.
CoRR, 2018

Unicorn: Continual Learning with a Universal, Off-policy Agent.
CoRR, 2018

Learning to Search with MCTSnets.
CoRR, 2018

Learning to Search with MCTSnets.
Proceedings of the 35th International Conference on Machine Learning, 2018

Implicit Quantile Networks for Distributional Reinforcement Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement.
Proceedings of the 35th International Conference on Machine Learning, 2018

Rainbow: Combining Improvements in Deep Reinforcement Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm.
CoRR, 2017

A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning.
CoRR, 2017

Rainbow: Combining Improvements in Deep Reinforcement Learning.
CoRR, 2017

StarCraft II: A New Challenge for Reinforcement Learning.
CoRR, 2017

Imagination-Augmented Agents for Deep Reinforcement Learning.
CoRR, 2017

FeUdal Networks for Hierarchical Reinforcement Learning.
CoRR, 2017

Emergence of Locomotion Behaviours in Rich Environments.
CoRR, 2017

Technical perspective: Solving imperfect information games.
Commun. ACM, 2017

Natural Value Approximators: Learning when to Trust Past Estimates.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Imagination-Augmented Agents for Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Successor Features for Transfer in Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

FeUdal Networks for Hierarchical Reinforcement Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

The Predictron: End-To-End Learning and Planning.
Proceedings of the 34th International Conference on Machine Learning, 2017

Decoupled Neural Interfaces using Synthetic Gradients.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016
Mastering the game of Go with deep neural networks and tree search.
Nature, 2016

The Predictron: End-To-End Learning and Planning.
CoRR, 2016

Asynchronous Methods for Deep Reinforcement Learning.
CoRR, 2016

Reinforcement Learning with Unsupervised Auxiliary Tasks.
CoRR, 2016

Deep Reinforcement Learning from Self-Play in Imperfect-Information Games.
CoRR, 2016

Learning and Transfer of Modulated Locomotor Controllers.
CoRR, 2016

Learning functions across many orders of magnitudes.
CoRR, 2016

Successor Features for Transfer in Reinforcement Learning.
CoRR, 2016

Learning values across many orders of magnitude.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Asynchronous Methods for Deep Reinforcement Learning.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Deep Reinforcement Learning with Double Q-Learning.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Human-level control through deep reinforcement learning.
Nature, 2015

Prioritized Experience Replay.
CoRR, 2015

Massively Parallel Methods for Deep Reinforcement Learning.
CoRR, 2015

Continuous control with deep reinforcement learning.
CoRR, 2015

Learning Continuous Control Policies by Stochastic Value Gradients.
CoRR, 2015

Memory-based control with recurrent neural networks.
CoRR, 2015

Deep Reinforcement Learning with Double Q-learning.
CoRR, 2015

Value Iteration with Options and State Aggregation.
CoRR, 2015

Learning Continuous Control Policies by Stochastic Value Gradients.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Scene understanding for a high-mobility walking robot.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Smooth UCT Search in Computer Poker.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Universal Value Function Approximators.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Fictitious Self-Play in Extensive-Form Games.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Authenticity, Relatability and Collaborative Approaches to Sharing Knowledge about Assistive Living Technology.
Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, 2015

2014
Move Evaluation in Go Using Deep Convolutional Neural Networks.
CoRR, 2014

Better Optimism By Bayes: Adaptive Planning with Rich Models.
CoRR, 2014

Learning to Win by Reading Manuals in a Monte-Carlo Framework.
CoRR, 2014

Password Managers: Attacks and Defenses.
Proceedings of the 23rd USENIX Security Symposium, San Diego, CA, USA, August 20-22, 2014., 2014

Bayes-Adaptive Simulation-based Search with Value Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Deterministic Policy Gradient Algorithms.
Proceedings of the 31th International Conference on Machine Learning, 2014

2013
Scalable and Efficient Bayes-Adaptive Reinforcement Learning Based on Monte-Carlo Tree Search.
J. Artif. Intell. Res., 2013

Unit Tests for Stochastic Optimization.
CoRR, 2013

Playing Atari with Deep Reinforcement Learning.
CoRR, 2013

Concurrent Reinforcement Learning from Customer Interactions.
Proceedings of the 30th International Conference on Machine Learning, 2013

Temporal-Difference Search in Computer Go.
Proceedings of the Twenty-Third International Conference on Automated Planning and Scheduling, 2013

2012
Temporal-difference search in computer Go.
Machine Learning, 2012

Learning to Win by Reading Manuals in a Monte-Carlo Framework.
J. Artif. Intell. Res., 2012

Digital natives on a media fast.
Inf. Services and Use, 2012

Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search
CoRR, 2012

The grand challenge of computer Go: Monte Carlo tree search and extensions.
Commun. ACM, 2012

Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Learning Autonomous Driving Styles and Maneuvers from Expert Demonstration.
Proceedings of the Experimental Robotics, 2012

Active learning from demonstration for robust autonomous navigation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Compositional Planning Using Optimal Option Models.
Proceedings of the 29th International Conference on Machine Learning, 2012

Gradient Temporal Difference Networks.
Proceedings of the Tenth European Workshop on Reinforcement Learning, 2012

Actor-Critic Reinforcement Learning with Energy-Based Policies.
Proceedings of the Tenth European Workshop on Reinforcement Learning, 2012

2011
A Monte-Carlo AIXI Approximation.
J. Artif. Intell. Res., 2011

Monte-Carlo tree search and rapid action value estimation in computer Go.
Artif. Intell., 2011

Monte Carlo Localization and registration to prior data for outdoor navigation.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Non-Linear Monte-Carlo Search in Civilization II.
Proceedings of the IJCAI 2011, 2011

Learning to Win by Reading Manuals in a Monte-Carlo Framework.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
Learning for Autonomous Navigation.
IEEE Robot. Automat. Mag., 2010

Learning from Demonstration for Autonomous Navigation in Complex Unstructured Terrain.
I. J. Robotics Res., 2010

Reinforcement Learning via AIXI Approximation
CoRR, 2010

Monte-Carlo Planning in Large POMDPs.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Reinforcement Learning via AIXI Approximation.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009
A Monte Carlo AIXI Approximation
CoRR, 2009

Learning to search: Functional gradient techniques for imitation learning.
Auton. Robots, 2009

Bootstrapping from Game Tree Search.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Perceptual Interpretation for Autonomous Navigation through Dynamic Imitation Learning.
Proceedings of the Robotics Research - The 14th International Symposium, 2009

Fast gradient-descent methods for temporal-difference learning with linear function approximation.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Monte-Carlo simulation balancing.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Applied Imitation Learning for Autonomous Navigation in Complex Natural Terrain.
Proceedings of the Field and Service Robotics, Results of the 7th International Conference, 2009

2008
History, Hype, and Hope: An Afterward.
First Monday, 2008

High Performance Outdoor Navigation from Overhead Data using Imitation Learning.
Proceedings of the Robotics: Science and Systems IV, 2008

Sample-based learning and search with permanent and transient memories.
Proceedings of the Machine Learning, 2008

Achieving Master Level Play in 9 x 9 Computer Go.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
Reinforcement Learning of Local Shape in the Game of Go.
Proceedings of the IJCAI 2007, 2007

On the role of tracking in stationary environments.
Proceedings of the Machine Learning, 2007

Combining online and offline knowledge in UCT.
Proceedings of the Machine Learning, 2007

2006
Topological exploration of subterranean environments.
J. Field Robotics, 2006

Recent developments in subterranean robotics.
J. Field Robotics, 2006

Experimental Analysis of Overhead Data Processing To Support Long Range Navigation.
Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

2005
The hierarchical atlas.
IEEE Trans. Robotics, 2005

Towards Topological Exploration of Abandoned Mines.
Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005

Topological Global Localization for Subterranean Voids.
Proceedings of the Field and Service Robotics, Results of the 5th International Conference, 2005

Cooperative Pathfinding.
Proceedings of the First Artificial Intelligence and Interactive Digital Entertainment Conference, 2005

2004
Internet/Cyberculture/ Digital Culture/New Media/ Fill-in-the-Blank Studies.
New Media & Society, 2004

Scan matching for flooded subterranean voids.
Proceedings of the 2004 IEEE Conference on Robotics, Automation and Mechatronics, 2004

A regional point descriptor for global topological localization in flooded subterranean environments.
Proceedings of the 2004 IEEE Conference on Robotics, Automation and Mechatronics, 2004

Feature extraction for topological mine maps.
Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Arc Carving: Obtaining Accurate, Low Latency Maps from Ultrasonic Range Sensors.
Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

2003
Hierarchical simultaneous localization and mapping.
Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, Nevada, USA, October 27, 2003

2000
Book Review: Life Online: Researching Real Experience in Virtual Space.
New Media & Society, 2000


  Loading...