Frans A. Oliehoek

Auton. Agents Multi Agent Syst., October, 2023

An Analysis of Model-Based Reinforcement Learning From Abstracted Observations.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

What model does MuZero learn?

[BibT_eX]

[DOI]

Jinke He

Thomas M. Moerland

CoRR, 2023

Towards a Unifying Model of Rationality in Multiagent Systems.

[BibT_eX]

[DOI]

Robert Tyler Loftin

Mustafa Mert Çelikok

CoRR, 2023

What Lies beyond the Pareto Front? A Survey on Decision-Support Methods for Multi-Objective Optimization.

[BibT_eX]

[DOI]

Zuzanna Osika

Jazmin Zatarain Salazar

Pradeep K. Murukannaiah

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Safe Multi-agent Learning via Trapping Regions.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Safety Guarantees in Multi-agent Learning via Trapping Regions.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022

An Analysis of Abstracted Model-Based Reinforcement Learning.

[BibT_eX]

[DOI]

Rolf A. N. Starre

Marco Loog

CoRR, 2022

Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Online Planning in POMDPs with Self-Improving Simulators.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

On the Impossibility of Learning to Cooperate with Adaptive Partner Strategies in Repeated Games.

[BibT_eX]

[DOI]

Robert Tyler Loftin

Proceedings of the International Conference on Machine Learning, 2022

Multi-Agent MDP Homomorphic Networks.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Multi Robot Surveillance and Planning in Limited Communication Environments.

[BibT_eX]

[DOI]

Vibhav Inna Kedege

Ludo Stellingwerff

Proceedings of the 14th International Conference on Agents and Artificial Intelligence, 2022

Model-Based Reinforcement Learning with State Abstraction: A Survey.

[BibT_eX]

[DOI]

Rolf A. N. Starre

Marco Loog

Proceedings of the Artificial Intelligence and Machine Learning, 2022

Speeding up Deep Reinforcement Learning through Influence-Augmented Local Simulators.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

MORAL: Aligning AI with Human Norms through Multi-Objective Reinforced Active Learning.

[BibT_eX]

[DOI]

Markus Peschl

Arkady Zgonnikov

Luciano Cavalcante Siebert

Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

Best-Response Bayesian Reinforcement Learning with Bayes-adaptive POMDPs for Centaurs.

[BibT_eX]

[DOI]

Mustafa Mert Çelikok

Samuel Kaski

Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

2021

General-Sum Multi-Agent Continuous Inverse Optimal Control.

[BibT_eX]

[DOI]

Christian Neumeyer

Dariu M. Gavrila

IEEE Robotics Autom. Lett., 2021

A Sufficient Statistic for Influence in Structured Multiagent Environments.

[BibT_eX]

[DOI]

Leslie Pack Kaelbling

J. Artif. Intell. Res., 2021

Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2021

ReproducedPapers.org: Openly Teaching and Structuring Machine Learning Reproducibility.

[BibT_eX]

[DOI]

Proceedings of the Reproducible Research in Pattern Recognition, 2021

Learning Complex Policy Distribution with CEM Guided Adversarial Hypernetwork.

[BibT_eX]

[DOI]

Shi Yuan Tang

Athirai A. Irissappane

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Environment Shift Games: Are Multiple Agents the Solution, and not the Problem, to Non-Stationarity?

[BibT_eX]

[DOI]

Alexander Mey

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Loss Bounds for Approximate Influence-Based Abstraction.

[BibT_eX]

[DOI]

Elena Congeduti

Alexander Mey

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Difference Rewards Policy Gradients.

[BibT_eX]

[DOI]

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Abstraction-Guided Policy Recovery from Expert Demonstrations.

[BibT_eX]

[DOI]

Canmanie T. Ponnambalam

Proceedings of the Thirty-First International Conference on Automated Planning and Scheduling, 2021

2020

Analog Circuit Design with Dyna-Style Reinforcement Learning.

[BibT_eX]

[DOI]

Wook Lee

CoRR, 2020

Maximizing Information Gain in Partially Observable Environments via Prediction Reward.

[BibT_eX]

[DOI]

CoRR, 2020

Diversity in Action: General-Sum Multi-Agent Continuous Inverse Optimal Control.

[BibT_eX]

[DOI]

Christian Muench

Dariu M. Gavrila

CoRR, 2020

Mimicking Evolution with Reinforcement Learning.

[BibT_eX]

[DOI]

João P. Abrantes

Arnaldo J. Abrantes

CoRR, 2020

A Research Agenda for Hybrid Intelligence: Augmenting Human Intellect With Collaborative, Adaptive, Responsible, and Explainable Artificial Intelligence.

[BibT_eX]

[DOI]

Computer, 2020

MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Multi-agent active perception with prediction rewards.

[BibT_eX]

[DOI]

Mikko Lauri

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Influence-Augmented Online Planning for Complex Environments.

[BibT_eX]

[DOI]

Jinke He

Miguel Suau

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Sensor Data for Human Activity Recognition: Feature Representation and Benchmarking.

[BibT_eX]

[DOI]

Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Decentralized MCTS via Learned Teammate Models.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Maximizing Information Gain in Partially Observable Environments via Prediction Rewards.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Plannable Approximations to MDP Homomorphisms: Equivariance under Actions.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019

Influence-aware Memory for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Miguel Suau

Elena Congeduti

Rolf Starre

CoRR, 2019

Learning From Demonstration in the Wild.

[BibT_eX]

[DOI]

Feryal M. P. Behbahani

Proceedings of the International Conference on Robotics and Automation, 2019

Bayesian RL in Factored POMDPs.

[BibT_eX]

[DOI]

Sammie Katt

Chris Amato

Proceedings of the 31st Benelux Conference on Artificial Intelligence (BNAIC 2019) and the 28th Belgian Dutch Conference on Machine Learning (Benelearn 2019), 2019

Bayesian Reinforcement Learning in Factored POMDPs.

[BibT_eX]

[DOI]

Sammie Katt

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

The Representational Capacity of Action-Value Networks for Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018

Exploiting submodular value functions for scaling up active perception.

[BibT_eX]

[DOI]

Auton. Robots, 2018

Reports on the 2018 AAAI Spring Symposium Series.

[BibT_eX]

[DOI]

Haitham Bou-Ammar

Elizabeth F. Churchill

AI Mag., 2018

Interactive Learning and Decision Making: Foundations, Insights & Challenges.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Beyond Local Nash Equilibria for Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence - 30th Benelux Conference, 2018

Model-Based Reinforcement Learning under Periodical Observability.

[BibT_eX]

[DOI]

Richard Klíma

Karl Tuyls

Proceedings of the 2018 AAAI Spring Symposia, 2018

2017

The MADP Toolbox: An Open Source Library for Planning and Learning in (Multi-)Agent Systems.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2017

GANGs: Generative Adversarial Network Games.

[BibT_eX]

[DOI]

CoRR, 2017

Real-Time Resource Allocation for Tracking Systems.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017

Learning in POMDPs with Monte Carlo Tree Search.

[BibT_eX]

[DOI]

Sammie Katt

Proceedings of the 34th International Conference on Machine Learning, 2017

Decentralised Online Planning for Multi-Robot Warehouse Commissioning.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

LiftUpp: Support to Develop Learner Performance.

[BibT_eX]

[DOI]

John Christopher Jones

Proceedings of the Artificial Intelligence in Education - 18th International Conference, 2017

Maximizing the Probability of Arriving on Time: A Practical Q-Learning Method.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

A Concise Introduction to Decentralized POMDPs

[BibT_eX]

[DOI]

Springer Briefs in Intelligent Systems, Springer, ISBN: 978-3-319-28929-8, 2016

Probably Approximately Correct Greedy Maximization.

[BibT_eX]

[DOI]

CoRR, 2016

Reports of the AAAI 2016 Spring Symposium Series.

[BibT_eX]

[DOI]

AI Mag., 2016

The 2015 AAAI Fall Symposium Series Reports.

[BibT_eX]

[DOI]

AI Mag., 2016

PAC Greedy Maximization with Efficient Bounds on Information Gain for Sensor Selection.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Structure in the Value Function of Two-Player Zero-Sum Games of Incomplete Information.

[BibT_eX]

[DOI]

Auke J. Wiggers

Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

Probably Approximately Correct Greedy Maximization: (Extended Abstract).

[BibT_eX]

[DOI]

Mathijs Michiel de Weerdt

Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Solving Transition-Independent Multi-Agent MDPs with Sparse Interactions.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Energy- and Cost-Efficient Pumping Station Control.

[BibT_eX]

[DOI]

Timon V. Kanters

Michael Kaisers

Stan R. van den Bosch

Joep Grispen

Jeroen Hermans

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

A Scalable Framework to Choose Sellers in E-Marketplaces Using POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Computing Convex Coverage Sets for Faster Multi-objective Coordination.

[BibT_eX]

[DOI]

Diederik Marijn Roijers

J. Artif. Intell. Res., 2015

Solving Transition-Independent Multi-agent MDPs with Sparse Interactions (Extended version).

[BibT_eX]

[DOI]

CoRR, 2015

Exploiting Anonymity in Approximate Linear Programming: Scaling to Large Multiagent MDPs (Extended Version).

[BibT_eX]

[DOI]

Philipp Robbel

Mykel J. Kochenderfer

CoRR, 2015

Influence-Optimistic Local Values for Multiagent Planning - Extended Version.

[BibT_eX]

[DOI]

CoRR, 2015

Scaling POMDPs For Selecting Sellers in E-markets-Extended Version.

[BibT_eX]

[DOI]

CoRR, 2015

Point-Based Planning for Multi-Objective POMDPs.

[BibT_eX]

[DOI]

Diederik Marijn Roijers

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Factored Upper Bounds for Multiagent Planning Problems under Uncertainty with Non-Factored Value Functions.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Secure Routing in Wireless Sensor Networks via POMDPs.

[BibT_eX]

[DOI]

Partha Sarathi Dutta

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Influence-Optimistic Local Values for Multiagent Planning.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Effective Approximations for Multi-Robot Coordination in Spatially Distributed Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Exploiting Anonymity in Approximate Linear Programming: Scaling to Large Multiagent MDPs.

[BibT_eX]

[DOI]

Philipp Robbel

Mykel J. Kochenderfer

Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

The MADP Toolbox: An Open-Source Library for Planning and Learning in (Multi-)Agent Systems.

[BibT_eX]

[DOI]

Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

Exploiting Submodular Value Functions for Faster Dynamic Sensor Selection.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Scalable Planning and Learning for Multiagent POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Multi-Source Entity Resolution for Genealogical Data.

[BibT_eX]

[DOI]

Julia Efremova

Bijan Ranjbar Sahraei

Proceedings of the Population Reconstruction, 2015

2014

Linear support for multi-objective coordination graphs.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

A POMDP based approach to optimally select sellers in electronic marketplaces.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Bounded Approximations for Linear Multi-Objective Planning Under Uncertainty.

[BibT_eX]

[DOI]

Diederik Marijn Roijers

Proceedings of the Twenty-Fourth International Conference on Automated Planning and Scheduling, 2014

2013

Incremental Clustering and Expansion for Faster Optimal Planning in Dec-POMDPs.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2013

Sufficient Plan-Time Statistics for Decentralized POMDPs.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2013, 2013

Multi-objective variable elimination for collaborative graphical games.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Approximate solutions for factored Dec-POMDPs with many agents.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Computing Convex Coverage Sets for Multi-objective Coordination Graphs.

[BibT_eX]

[DOI]

Proceedings of the Algorithmic Decision Theory - Third International Conference, 2013

2012

Exploiting Structure in Cooperative Bayesian Games.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Heuristic search of multiagent influence space.

[BibT_eX]

[DOI]

Leslie Pack Kaelbling

Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Tree-based pruning for multiagent POMDPs with delayed communication.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Influence-Based Abstraction for Multiagent Systems.

[BibT_eX]

[DOI]

Leslie Pack Kaelbling

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Decentralized POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Reinforcement Learning, 2012

2011

Exploiting Agent and Type Independence in Collaborative Graphical Bayesian Games

[BibT_eX]

[DOI]

CoRR, 2011

Scaling Up Optimal Heuristic Search in Dec-POMDPs via Incremental Expansion.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2011, 2011

2010

A Decision-Theoretic Approach to Collaboration: Principal Description Methods and Efficient Heuristic Approximations.

[BibT_eX]

[DOI]

Arnoud Visser

Proceedings of the Interactive Collaborative Information Systems, 2010

Heuristic search for identical payoff Bayesian games.

[BibT_eX]

[DOI]

Jilles Steeve Dibangoye

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

2009

Lossless clustering of histories in decentralized POMDPs.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

2008

Optimal and Approximate Q-value Functions for Decentralized POMDPs.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2008

The Cross-Entropy Method for Policy Search in Decentralized POMDPs.

[BibT_eX]

[DOI]

Julian F. P. Kooij

Informatica (Slovenia), 2008

Exploiting locality of interaction in factored Dec-POMDPs.

[BibT_eX]

[DOI]

Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Multiagent Planning Under Uncertainty with Stochastic Communication Delays.

[BibT_eX]

[DOI]

Proceedings of the Eighteenth International Conference on Automated Planning and Scheduling, 2008

2007

A Cross-Entropy Approach to Solving Dec-POMDPs.

[BibT_eX]

[DOI]

Julian F. P. Kooij

Proceedings of the Advances in Intelligent and Distributed Computing, 2007

Q-value functions for decentralized POMDPs.

[BibT_eX]

[DOI]

Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Q-value Heuristics for Approximate Solutions of Dec-POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Game Theoretic and Decision Theoretic Agents, 2007

2006

The parallel Nash Memory for asymmetric games.

[BibT_eX]

[DOI]

Edwin D. de Jong

Proceedings of the Genetic and Evolutionary Computation Conference, 2006

2005

Coevolutionary Nash in poker games.

[BibT_eX]