Frans A. Oliehoek

Orcid: 0000-0003-4372-5055

Affiliations:
  • Delft University of Technology, The Netherlands


According to our database1, Frans A. Oliehoek authored at least 114 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Policy Space Response Oracles: A Survey.
CoRR, 2024

When Do Off-Policy and On-Policy Policy Gradient Methods Align?
CoRR, 2024

Explaining Learned Reward Functions with Counterfactual Trajectories.
CoRR, 2024

2023
Teacher-apprentices RL (TARL): leveraging complex policy distribution through generative adversarial hypernetwork in reinforcement learning.
Auton. Agents Multi Agent Syst., October, 2023

Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL.
CoRR, 2023

What model does MuZero learn?
CoRR, 2023

Towards a Unifying Model of Rationality in Multiagent Systems.
CoRR, 2023

Uncoupled Learning of Differential Stackelberg Equilibria with Commitments.
CoRR, 2023

What Lies beyond the Pareto Front? A Survey on Decision-Support Methods for Multi-Objective Optimization.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Safe Multi-agent Learning via Trapping Regions.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Safety Guarantees in Multi-agent Learning via Trapping Regions.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
An Analysis of Abstracted Model-Based Reinforcement Learning.
CoRR, 2022

Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Online Planning in POMDPs with Self-Improving Simulators.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems.
Proceedings of the International Conference on Machine Learning, 2022

On the Impossibility of Learning to Cooperate with Adaptive Partner Strategies in Repeated Games.
Proceedings of the International Conference on Machine Learning, 2022

Multi-Agent MDP Homomorphic Networks.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Multi Robot Surveillance and Planning in Limited Communication Environments.
Proceedings of the 14th International Conference on Agents and Artificial Intelligence, 2022

Model-Based Reinforcement Learning with State Abstraction: A Survey.
Proceedings of the Artificial Intelligence and Machine Learning, 2022

Speeding up Deep Reinforcement Learning through Influence-Augmented Local Simulators.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

MORAL: Aligning AI with Human Norms through Multi-Objective Reinforced Active Learning.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

Best-Response Bayesian Reinforcement Learning with Bayes-adaptive POMDPs for Centaurs.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

2021
General-Sum Multi-Agent Continuous Inverse Optimal Control.
IEEE Robotics Autom. Lett., 2021

A Sufficient Statistic for Influence in Structured Multiagent Environments.
J. Artif. Intell. Res., 2021

Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning.
Auton. Agents Multi Agent Syst., 2021

ReproducedPapers.org: Openly Teaching and Structuring Machine Learning Reproducibility.
Proceedings of the Reproducible Research in Pattern Recognition, 2021

Learning Complex Policy Distribution with CEM Guided Adversarial Hypernetwork.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Environment Shift Games: Are Multiple Agents the Solution, and not the Problem, to Non-Stationarity?
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Loss Bounds for Approximate Influence-Based Abstraction.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Difference Rewards Policy Gradients.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Abstraction-Guided Policy Recovery from Expert Demonstrations.
Proceedings of the Thirty-First International Conference on Automated Planning and Scheduling, 2021

2020
Analog Circuit Design with Dyna-Style Reinforcement Learning.
CoRR, 2020

Maximizing Information Gain in Partially Observable Environments via Prediction Reward.
CoRR, 2020

Diversity in Action: General-Sum Multi-Agent Continuous Inverse Optimal Control.
CoRR, 2020

Mimicking Evolution with Reinforcement Learning.
CoRR, 2020

A Research Agenda for Hybrid Intelligence: Augmenting Human Intellect With Collaborative, Adaptive, Responsible, and Explainable Artificial Intelligence.
Computer, 2020

MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Multi-agent active perception with prediction rewards.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Influence-Augmented Online Planning for Complex Environments.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Sensor Data for Human Activity Recognition: Feature Representation and Benchmarking.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Decentralized MCTS via Learned Teammate Models.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Maximizing Information Gain in Partially Observable Environments via Prediction Rewards.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Plannable Approximations to MDP Homomorphisms: Equivariance under Actions.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019
Influence-aware Memory for Deep Reinforcement Learning.
CoRR, 2019

Learning From Demonstration in the Wild.
Proceedings of the International Conference on Robotics and Automation, 2019

Bayesian RL in Factored POMDPs.
Proceedings of the 31st Benelux Conference on Artificial Intelligence (BNAIC 2019) and the 28th Belgian Dutch Conference on Machine Learning (Benelearn 2019), 2019

Bayesian Reinforcement Learning in Factored POMDPs.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

The Representational Capacity of Action-Value Networks for Multi-Agent Reinforcement Learning.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018
Exploiting submodular value functions for scaling up active perception.
Auton. Robots, 2018

Reports on the 2018 AAAI Spring Symposium Series.
AI Mag., 2018

Interactive Learning and Decision Making: Foundations, Insights & Challenges.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Beyond Local Nash Equilibria for Adversarial Networks.
Proceedings of the Artificial Intelligence - 30th Benelux Conference, 2018

Model-Based Reinforcement Learning under Periodical Observability.
Proceedings of the 2018 AAAI Spring Symposia, 2018

2017
The MADP Toolbox: An Open Source Library for Planning and Learning in (Multi-)Agent Systems.
J. Mach. Learn. Res., 2017

GANGs: Generative Adversarial Network Games.
CoRR, 2017

Real-Time Resource Allocation for Tracking Systems.
Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017

Learning in POMDPs with Monte Carlo Tree Search.
Proceedings of the 34th International Conference on Machine Learning, 2017

Decentralised Online Planning for Multi-Robot Warehouse Commissioning.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

LiftUpp: Support to Develop Learner Performance.
Proceedings of the Artificial Intelligence in Education - 18th International Conference, 2017

Maximizing the Probability of Arriving on Time: A Practical Q-Learning Method.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
A Concise Introduction to Decentralized POMDPs
Springer Briefs in Intelligent Systems, Springer, ISBN: 978-3-319-28929-8, 2016

Probably Approximately Correct Greedy Maximization.
CoRR, 2016

Reports of the AAAI 2016 Spring Symposium Series.
AI Mag., 2016

The 2015 AAAI Fall Symposium Series Reports.
AI Mag., 2016

PAC Greedy Maximization with Efficient Bounds on Information Gain for Sensor Selection.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Structure in the Value Function of Two-Player Zero-Sum Games of Incomplete Information.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

Probably Approximately Correct Greedy Maximization: (Extended Abstract).
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Solving Transition-Independent Multi-Agent MDPs with Sparse Interactions.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Exploiting Anonymity in Approximate Linear Programming: Scaling to Large Multiagent MDPs.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Energy- and Cost-Efficient Pumping Station Control.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

A Scalable Framework to Choose Sellers in E-Marketplaces Using POMDPs.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Computing Convex Coverage Sets for Faster Multi-objective Coordination.
J. Artif. Intell. Res., 2015

Solving Transition-Independent Multi-agent MDPs with Sparse Interactions (Extended version).
CoRR, 2015

Exploiting Anonymity in Approximate Linear Programming: Scaling to Large Multiagent MDPs (Extended Version).
CoRR, 2015

Influence-Optimistic Local Values for Multiagent Planning - Extended Version.
CoRR, 2015

Scaling POMDPs For Selecting Sellers in E-markets-Extended Version.
CoRR, 2015

Point-Based Planning for Multi-Objective POMDPs.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Factored Upper Bounds for Multiagent Planning Problems under Uncertainty with Non-Factored Value Functions.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Secure Routing in Wireless Sensor Networks via POMDPs.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Influence-Optimistic Local Values for Multiagent Planning.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Effective Approximations for Multi-Robot Coordination in Spatially Distributed Tasks.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

The MADP Toolbox: An Open-Source Library for Planning and Learning in (Multi-)Agent Systems.
Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

Exploiting Submodular Value Functions for Faster Dynamic Sensor Selection.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Scalable Planning and Learning for Multiagent POMDPs.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Multi-Source Entity Resolution for Genealogical Data.
Proceedings of the Population Reconstruction, 2015

2014
Linear support for multi-objective coordination graphs.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

A POMDP based approach to optimally select sellers in electronic marketplaces.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Bounded Approximations for Linear Multi-Objective Planning Under Uncertainty.
Proceedings of the Twenty-Fourth International Conference on Automated Planning and Scheduling, 2014

2013
Incremental Clustering and Expansion for Faster Optimal Planning in Dec-POMDPs.
J. Artif. Intell. Res., 2013

Sufficient Plan-Time Statistics for Decentralized POMDPs.
Proceedings of the IJCAI 2013, 2013

Multi-objective variable elimination for collaborative graphical games.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Approximate solutions for factored Dec-POMDPs with many agents.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Computing Convex Coverage Sets for Multi-objective Coordination Graphs.
Proceedings of the Algorithmic Decision Theory - Third International Conference, 2013

2012
Exploiting Structure in Cooperative Bayesian Games.
Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Heuristic search of multiagent influence space.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Tree-based pruning for multiagent POMDPs with delayed communication.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Influence-Based Abstraction for Multiagent Systems.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Decentralized POMDPs.
Proceedings of the Reinforcement Learning, 2012

2011
Exploiting Agent and Type Independence in Collaborative Graphical Bayesian Games
CoRR, 2011

Scaling Up Optimal Heuristic Search in Dec-POMDPs via Incremental Expansion.
Proceedings of the IJCAI 2011, 2011

2010
A Decision-Theoretic Approach to Collaboration: Principal Description Methods and Efficient Heuristic Approximations.
Proceedings of the Interactive Collaborative Information Systems, 2010

Heuristic search for identical payoff Bayesian games.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

2009
Lossless clustering of histories in decentralized POMDPs.
Proceedings of the 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

2008
Optimal and Approximate Q-value Functions for Decentralized POMDPs.
J. Artif. Intell. Res., 2008

The Cross-Entropy Method for Policy Search in Decentralized POMDPs.
Informatica (Slovenia), 2008

Exploiting locality of interaction in factored Dec-POMDPs.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Multiagent Planning Under Uncertainty with Stochastic Communication Delays.
Proceedings of the Eighteenth International Conference on Automated Planning and Scheduling, 2008

2007
A Cross-Entropy Approach to Solving Dec-POMDPs.
Proceedings of the Advances in Intelligent and Distributed Computing, 2007

Q-value functions for decentralized POMDPs.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Q-value Heuristics for Approximate Solutions of Dec-POMDPs.
Proceedings of the Game Theoretic and Decision Theoretic Agents, 2007

2006
The parallel Nash Memory for asymmetric games.
Proceedings of the Genetic and Evolutionary Computation Conference, 2006

2005
Coevolutionary Nash in poker games.
Proceedings of the BNAIC 2005, 2005


  Loading...