Ian Osband

According to our database1, Ian Osband authored at least 12 papers between 2013 and 2018.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2018
A Tutorial on Thompson Sampling.
Foundations and Trends in Machine Learning, 2018

Randomized Prior Functions for Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Scalable Coordinated Exploration in Concurrent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

The Uncertainty Bellman Equation and Exploration.
Proceedings of the 35th International Conference on Machine Learning, 2018

Deep Q-learning From Demonstrations.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Why is Posterior Sampling Better than Optimism for Reinforcement Learning?
Proceedings of the 34th International Conference on Machine Learning, 2017

Minimax Regret Bounds for Reinforcement Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016
Deep Exploration via Bootstrapped DQN.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Generalization and Exploration via Randomized Value Functions.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2014
Model-based Reinforcement Learning and the Eluder Dimension.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Near-optimal Reinforcement Learning in Factored MDPs.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

2013
(More) Efficient Reinforcement Learning via Posterior Sampling.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013


  Loading...