Alessandro Lazaric

J. Mach. Learn. Res., 2012

Learning with stochastic inputs and adversarial outputs.

[BibT_eX]

[DOI]

J. Comput. Syst. Sci., 2012

A truthful learning mechanism for contextual multi-slot sponsored search auctions with externalities.

[BibT_eX]

[DOI]

Francesco Trovò

Proceedings of the 13th ACM Conference on Electronic Commerce, 2012

Risk-Aversion in Multi-armed Bandits.

[BibT_eX]

[DOI]

Amir Sani

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence.

[BibT_eX]

[DOI]

Victor Gabillon

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

A Dantzig Selector Approach to Temporal Difference Learning.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Machine Learning, 2012

Semi-Supervised Apprenticeship Learning.

[BibT_eX]

[DOI]

Michal Valko

Proceedings of the Tenth European Workshop on Reinforcement Learning, 2012

A truthful learning mechanism for multi-slot sponsored search auctions with externalities.

[BibT_eX]

[DOI]

Francesco Trovò

Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Conservative and Greedy Approaches to Classification-Based Policy Iteration.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Transfer in Reinforcement Learning: A Framework and a Survey.

[BibT_eX]

[DOI]

Proceedings of the Reinforcement Learning, 2012

Least-Squares Methods for Policy Iteration.

[BibT_eX]

[DOI]

Proceedings of the Reinforcement Learning, 2012

2011

Transfer from Multiple MDPs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Multi-Bandit Best Arm Identification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Finite-Sample Analysis of Lasso-TD.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Machine Learning, 2011

Classification-based Policy Iteration with a Critic.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Machine Learning, 2011

Regularized Least Squares Temporal Difference Learning with Nested ℓ2 and ℓ1 Penalization.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits.

[BibT_eX]

[DOI]

Proceedings of the Algorithmic Learning Theory - 22nd International Conference, 2011

2010

Finite-sample Analysis of Bellman Residual Minimization.

[BibT_eX]

[DOI]

Odalric-Ambrym Maillard

Proceedings of the 2nd Asian Conference on Machine Learning, 2010

LSTD with Random Projections.

[BibT_eX]

[DOI]

Odalric-Ambrym Maillard

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Finite-Sample Analysis of LSTD.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Analysis of a Classification-based Policy Iteration Algorithm.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Bayesian Multi-Task Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

2009

Reinforcement distribution in fuzzy Q-learning.

[BibT_eX]

[DOI]

Fuzzy Sets Syst., 2009

Workshop summary: On-line learning with limited feedback.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Hybrid Stochastic-Adversarial On-line Learning.

[BibT_eX]

[DOI]

Proceedings of the COLT 2009, 2009

2008

Improving Batch Reinforcement Learning Performance through Transfer of Samples.

[BibT_eX]

[DOI]

Proceedings of the STAIRS 2008, 2008

Batch Reinforcement Learning for Controlling a Mobile Wheeled Pendulum Robot.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence in Theory and Practice II, 2008

Transfer of samples in batch reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2008

On the usefulness of opponent modeling: the Kuhn Poker case study.

[BibT_eX]

[DOI]

Mario Quaresimale

Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Transfer of task representation in reinforcement learning using policy-based proto-value functions.

[BibT_eX]

[DOI]

Eliseo Ferrante

Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Towards Automated Bargaining in Electronic Markets: A Partially Two-Sided Competition Model.

[BibT_eX]

[DOI]

Proceedings of the Agent-Mediated Electronic Commerce and Trading Agent Design and Analysis, 2008

2007

Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Piecewise constant reinforcement learning for robotic applications.

[BibT_eX]

Proceedings of the ICINCO 2007, 2007

Reinforcement learning in extensive form games with incomplete information: the bargaining case study.

[BibT_eX]

[DOI]

Enrique Munoz de Cote

Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Reinforcement Learning in Complex Environments Through Multiple Adaptive Partitions.

[BibT_eX]

[DOI]

Proceedings of the AI*IA 2007: Artificial Intelligence and Human-Oriented Computing, 2007

Bifurcation Analysis of Reinforcement Learning Agents in the Selten's Horse Game.

[BibT_eX]

[DOI]

Enrique Munoz de Cote

Fabio Dercole

Proceedings of the Adaptive Agents and Multi-Agent Systems III. Adaptation and Multi-Agent Learning, 2007

2006

Incremental Skill Acquisition for Self-motivated Learning Animats.

[BibT_eX]

[DOI]

Proceedings of the From Animals to Animats 9, 2006

Learning to cooperate in multi-agent social dilemmas.

[BibT_eX]

[DOI]

Enrique Munoz de Cote