Ian Osband

According to our database1, Ian Osband authored at least 28 papers between 2013 and 2018.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2018
A Tutorial on Thompson Sampling.
Foundations and Trends in Machine Learning, 2018

Randomized Prior Functions for Deep Reinforcement Learning.
CoRR, 2018

Scalable Coordinated Exploration in Concurrent Reinforcement Learning.
CoRR, 2018

The Uncertainty Bellman Equation and Exploration.
Proceedings of the 35th International Conference on Machine Learning, 2018

Deep Q-learning From Demonstrations.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
The Uncertainty Bellman Equation and Exploration.
CoRR, 2017

On Optimistic versus Randomized Exploration in Reinforcement Learning.
CoRR, 2017

Gaussian-Dirichlet Posterior Dominance in Sequential Learning.
CoRR, 2017

Deep Exploration via Randomized Value Functions.
CoRR, 2017

Learning from Demonstrations for Real World Reinforcement Learning.
CoRR, 2017

Noisy Networks for Exploration.
CoRR, 2017

Minimax Regret Bounds for Reinforcement Learning.
CoRR, 2017

A Tutorial on Thompson Sampling.
CoRR, 2017

Why is Posterior Sampling Better than Optimism for Reinforcement Learning?
Proceedings of the 34th International Conference on Machine Learning, 2017

Minimax Regret Bounds for Reinforcement Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016
On Lower Bounds for Regret in Reinforcement Learning.
CoRR, 2016

Posterior Sampling for Reinforcement Learning Without Episodes.
CoRR, 2016

Why is Posterior Sampling Better than Optimism for Reinforcement Learning.
CoRR, 2016

Deep Exploration via Bootstrapped DQN.
CoRR, 2016

Deep Exploration via Bootstrapped DQN.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Generalization and Exploration via Randomized Value Functions.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2015
Bootstrapped Thompson Sampling and Deep Exploration.
CoRR, 2015

2014
Model-based Reinforcement Learning and the Eluder Dimension.
CoRR, 2014

Near-optimal Regret Bounds for Reinforcement Learning in Factored MDPs.
CoRR, 2014

Model-based Reinforcement Learning and the Eluder Dimension.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Near-optimal Reinforcement Learning in Factored MDPs.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

2013
(More) Efficient Reinforcement Learning via Posterior Sampling.
CoRR, 2013

(More) Efficient Reinforcement Learning via Posterior Sampling.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013


  Loading...