# Mohammad Gheshlaghi Azar

According to our database

Collaborative distances:

^{1}, Mohammad Gheshlaghi Azar authored at least 14 papers between 2011 and 2018.Collaborative distances:

## Timeline

#### Legend:

Book In proceedings Article PhD thesis Other## Links

#### On csauthors.net:

## Bibliography

2018

The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning.

Proceedings of the 6th International Conference on Learning Representations, 2018

Noisy Networks For Exploration.

Proceedings of the 6th International Conference on Learning Representations, 2018

Rainbow: Combining Improvements in Deep Reinforcement Learning.

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Minimax Regret Bounds for Reinforcement Learning.

Proceedings of the 34th International Conference on Machine Learning, 2017

2016

Convex Relaxation Regression: Black-Box Optimization of Smooth Functions by Learning Their Convex Envelopes.

Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

Correcting Multivariate Auto-Regressive Models for the Influence of Unobserved Common Input.

Proceedings of the Artificial Intelligence Research and Development, 2016

2014

Online Stochastic Optimization under Correlated Bandit Feedback.

Proceedings of the 31th International Conference on Machine Learning, 2014

2013

Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model.

Machine Learning, 2013

Regret Bounds for Reinforcement Learning with Policy Advice.

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013

Sequential Transfer in Multi-armed Bandit with Finite Set of Models.

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

2012

Dynamic policy programming.

J. Mach. Learn. Res., 2012

On the Sample Complexity of Reinforcement Learning with a Generative Model .

Proceedings of the 29th International Conference on Machine Learning, 2012

2011

Dynamic Policy Programming with Function Approximation.

Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011

Speedy Q-Learning.

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011