Olivier Buffet

Iadine Chadès

Proceedings of the Thirtieth International Conference on Automated Planning and Scheduling, 2020

Reinforcement Learning.

[BibT_eX]

[DOI]

Olivier Pietquin

Paul Weng

Proceedings of the A Guided Tour of Artificial Intelligence Research: Volume I: Knowledge Representation, 2020

2018

rho-POMDPs have Lipschitz-Continuous epsilon-Optimal Value Functions.

[BibT_eX]

[DOI]

Mathieu Fehr

Vincent Thomas

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning to Act in Continuous Dec-POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Journées Francophone Planification, 2018

Recherche heuristique pour jeux stochastiques (à somme nulle).

[BibT_eX]

[DOI]

Abdallah Saffidine

Vincent Thomas

Proceedings of the Journées Francophone Planification, 2018

Learning to Act in Decentralized Partially Observable MDPs.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

2017

Prise de décision séquentielle dans l'incertain : Exploiter la structure et rester dans le cadre.

[BibT_eX]

[DOI]

, 2017

2016

Intersections intelligentes pour le contrôle de véhicules sans pilote. Coordination locale et optimisation globale.

[BibT_eX]

[DOI]

Rev. d'Intelligence Artif., 2016

Goal Probability Analysis in Probabilistic Planning: Exploring and Enhancing the State of the Art.

[BibT_eX]

[DOI]

Marcel Steinmetz

J. Artif. Intell. Res., 2016

Revisiting Goal Probability Analysis in Probabilistic Planning.

[BibT_eX]

[DOI]

Marcel Steinmetz

Proceedings of the Twenty-Sixth International Conference on Automated Planning and Scheduling, 2016

2015

Structural Results for Cooperative Decentralized Control Models.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Exploiting Separability in Multiagent Planning with Continuous-State MDPs (Extended Abstract).

[BibT_eX]

[DOI]

Christopher Amato

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

2014

Towards the Usage of Advanced Behavioral Simulations for Simultaneous Tracking and Activity Recognition.

[BibT_eX]

[DOI]

Proceedings of the STAIRS 2014, 2014

Error-Bounded Approximations for Infinite-Horizon Discounted Decentralized POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014

Decentralized traffic management: A synchronization-based intersection control.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Advanced Logistics and Transport, 2014

Tracking multiple interacting targets using a joint probabilistic Data Association filter.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Information Fusion, 2014

Stop-Free Strategies for Traffic Networks: Decentralized On-line Optimization.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014

Simultaneous Tracking and Activity Recognition (STAR) using Advanced Agent-Based Behavioral Simulations.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014

Learning Pruning Rules for Heuristic Search Planning.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014

Simulation-based behavior tracking of pedestrians in partially observed indoor environments.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Exploiting separability in multiagent planning with continuous-state MDPs.

[BibT_eX]

[DOI]

Christopher Amato

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

2013

Introduction.

[BibT_eX]

[DOI]

Bruno Zanuttini

Guillaume Laurent

Rev. d'Intelligence Artif., 2013

Les POMDP font de meilleurs hackers: Tenir compte de l'incertitude dans les tests de penetration.

[BibT_eX]

[DOI]

Carlos Sarraute

CoRR, 2013

Penetration Testing == POMDP Solving?

[BibT_eX]

[DOI]

Carlos Sarraute

CoRR, 2013

Reactive Coordination Rules for Traffic Optimization in Road Sharing Problems.

[BibT_eX]

[DOI]

Proceedings of the Highlights on Practical Applications of Agents and Multi-Agent Systems, 2013

Adaptive Management of Migratory Birds Under Sea Level Rise.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2013, 2013

Optimally Solving Dec-POMDPs as Continuous-State MDPs.

[BibT_eX]

[DOI]

Christopher Amato

Proceedings of the IJCAI 2013, 2013

2012

Cooperative Behaviors for the Self-Regulation of Autonomous Vehicles in Space Sharing Conflicts.

[BibT_eX]

[DOI]

Proceedings of the IEEE 24th International Conference on Tools with Artificial Intelligence, 2012

Near-Optimal BRL using Optimistic Local Transitions.

[BibT_eX]

[DOI]

Mauricio Araya-López

Vincent Thomas

Proceedings of the 29th International Conference on Machine Learning, 2012

POMDPs Make Better Hackers: Accounting for Uncertainty in Penetration Testing.

[BibT_eX]

[DOI]

Carlos Sarraute

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

MOMDPs: A Solution for Modelling Adaptive Management Problems.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011

Optimal Priority Assignment Algorithms for Probabilistic Real-Time Systems.

[BibT_eX]

[DOI]

Dorin Maxim

Luca Santinelli

Liliana Cucu-Grosjean

Robert I. Davis

Proceedings of the 19th International Conference on Real-Time and Network Systems, 2011

Active Learning of MDP Models.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

2010

A POMDP Extension with Belief-dependent Rewards.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

From "I Like" to "I Prefer" in Collaborative Filtering.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Conference on Tools with Artificial Intelligence, 2010

A Closer Look at MOMDPs.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Conference on Tools with Artificial Intelligence, 2010

Influence of different execution models on patrolling ant behaviors: from agents to robots.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

2009

The factored policy-gradient planner.

[BibT_eX]

[DOI]

Artif. Intell., 2009

Self-Organization of Patrolling-Ant Algorithms.

[BibT_eX]

[DOI]

Proceedings of the Third IEEE International Conference on Self-Adaptive and Self-Organizing Systems, 2009

Global Multiprocessor Real-Time Scheduling as a Constraint Satisfaction Problem.

[BibT_eX]

[DOI]

Liliana Cucu-Grosjean

Proceedings of the ICPPW 2009, 2009

2008

Theoretical Study of Ant-based Algorithms for Multi-Agent Patrolling.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2008, 2008

2007

Policy-Gradients for PSRs and POMDPs.

[BibT_eX]

[DOI]

Owen Thomas

Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics, 2007

Shaping multi-agent systems with gradient reinforcement learning.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2007

Factored Planning Using Decomposition Trees.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2007, 2007

FF + FPG: Guiding a Policy-Gradient Planner.

[BibT_eX]

[DOI]

Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling, 2007

Concurrent Probabilistic Temporal Planning with Policy-Gradients.

[BibT_eX]

[DOI]

Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling, 2007

2006

Étude de différentes combinaisons de comportements adaptatives.

[BibT_eX]

[DOI]

Rev. d'Intelligence Artif., 2006

2005

Développement autonome des comportements de base d'un agent.

[BibT_eX]

[DOI]

Rev. d'Intelligence Artif., 2005

Robust Planning with (L)RTDP.

[BibT_eX]

[DOI]

Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Reachability Analysis for Uncertain SSPs.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2005), 2005

Planification robuste avec (L)RTDP.

[BibT_eX]

Proceedings of the Actes de CAP 05, Conférence francophone sur l'apprentissage automatique, 2005

2003

Une double approche modulaire de l'apprentissage par renforcement pour des agents intelligents adaptatifs. (A Twofold Modular Approach of Reinforcement Learning for Adaptive Intelligent Agents).

[BibT_eX]

[DOI]

PhD thesis, 2003

Apprentissage par renforcement pour la conception de systèmes multi-agents réactifs.

[BibT_eX]

[DOI]