Mohammad Gheshlaghi Azar
According to our database1,
Mohammad Gheshlaghi Azar
authored at least 45 papers
between 2010 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion.
CoRR, 2024
CoRR, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024
2023
CoRR, 2023
Proceedings of the International Conference on Machine Learning, 2023
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice.
Proceedings of the International Conference on Machine Learning, 2023
2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
2021
CoRR, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
2018
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Proceedings of the 34th International Conference on Machine Learning, 2017
2016
Convex Relaxation Regression: Black-Box Optimization of Smooth Functions by Learning Their Convex Envelopes.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016
Correcting Multivariate Auto-Regressive Models for the Influence of Unobserved Common Input.
Proceedings of the Artificial Intelligence Research and Development, 2016
2014
Stochastic Optimization of a Locally Smooth Function under Correlated Bandit Feedback.
CoRR, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
2013
Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model.
Mach. Learn., 2013
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013
2012
Proceedings of the 29th International Conference on Machine Learning, 2012
2011
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011
2010