Matthijs T. J. Spaan

Affiliations:
  • Delft University of Technology, Netherlands


According to our database1, Matthijs T. J. Spaan authored at least 98 papers between 2001 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications.
CoRR, 2024

When Do Off-Policy and On-Policy Policy Gradient Methods Align?
CoRR, 2024

2023
Safety-constrained reinforcement learning with a distributional safety critic.
Mach. Learn., March, 2023

Diverse Projection Ensembles for Distributional Reinforcement Learning.
CoRR, 2023

The Role of Diverse Replay for Generalisation in Reinforcement Learning.
CoRR, 2023

Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL.
CoRR, 2023

Scalable Safe Policy Improvement via Monte Carlo Tree Search.
Proceedings of the International Conference on Machine Learning, 2023

Reinforcement Learning by Guided Safe Exploration.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

CEM: Constrained Entropy Maximization for Task-Agnostic Safe Exploration.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Planning with Uncertainty: Deep Exploration in Model-Based Reinforcement Learning.
CoRR, 2022

Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Modern Perspective on Safe Automated Driving for Different Traffic Dynamics Using Constrained Reinforcement Learning.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2022

Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems.
Proceedings of the International Conference on Machine Learning, 2022

Abstraction-Refinement for Hierarchical Probabilistic Models.
Proceedings of the Computer Aided Verification - 34th International Conference, 2022

Speeding up Deep Reinforcement Learning through Influence-Augmented Local Simulators.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

2021
Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms.
J. Artif. Intell. Res., 2021

Safe Policies for Factored Partially Observable Stochastic Games.
Proceedings of the Robotics: Science and Systems XVII, Virtual Event, July 12-16, 2021., 2021

AlwaysSafe: Reinforcement Learning without Safety Constraint Violations during Training.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Abstraction-Guided Policy Recovery from Expert Demonstrations.
Proceedings of the Thirty-First International Conference on Automated Planning and Scheduling, 2021

WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
A Unified Decision-Theoretic Model for Information Gathering and Communication Planning.
Proceedings of the 29th IEEE International Conference on Robot and Human Interactive Communication, 2020

Improved Power Flow Methods for DC Grids.
Proceedings of the 29th IEEE International Symposium on Industrial Electronics, 2020

Decentralized Combinatorial Auctions for Dynamic and Large-Scale Collaborative Vehicle Routing.
Proceedings of the Computational Logistics - 11th International Conference, 2020

2019
Point-Based Value Iteration for Finite-Horizon POMDPs.
J. Artif. Intell. Res., 2019

Structure Learning for Safe Policy Improvement.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Safe Policy Improvement with Baseline Bootstrapping in Factored Environments.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Column Generation Algorithms for Constrained POMDPs.
J. Artif. Intell. Res., 2018

Exploiting submodular value functions for scaling up active perception.
Auton. Robots, 2018

Stability and Decentralized Control of Plug-and-Play DC Distribution Grids.
IEEE Access, 2018

Fleet Management for Pickup and Delivery Problems with Multiple Locations and Preferences.
Proceedings of the Dynamics in Logistics, 2018

Improving Offline Value-Function Approximations for POMDPs by Reducing Discount Factors.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Capacity-aware Sequential Recommendations.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

Bootstrapping LPs in Value Iteration for Multi-Objective and Partially Observable MDPs.
Proceedings of the Twenty-Eighth International Conference on Automated Planning and Scheduling, 2018

Preallocation and Planning Under Stochastic Resource Constraints.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
The MADP Toolbox: An Open Source Library for Planning and Learning in (Multi-)Agent Systems.
J. Mach. Learn. Res., 2017

Accelerated Vector Pruning for Optimal POMDP Solvers.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Bounding the Probability of Resource Constraint Violations in Multi-Agent MDPs.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Traffic flow optimization: A reinforcement learning approach.
Eng. Appl. Artif. Intell., 2016

The 2015 AAAI Fall Symposium Series Reports.
AI Mag., 2016

Planning Under Uncertainty for Aggregated Electric Vehicle Charging with Renewable Energy Supply.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

Decoupling a Resource Constraint Through Fictitious Play in Multi-Agent Sequential Decision Making.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

Planning under Uncertainty for Aggregated Electric Vehicle Charging Using Markov Decision Processes.
Proceedings of the AI for Smart Grids and Smart Buildings, 2016

Solving Transition-Independent Multi-Agent MDPs with Sparse Interactions.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Solving Transition-Independent Multi-agent MDPs with Sparse Interactions (Extended version).
CoRR, 2015

Influence-Optimistic Local Values for Multiagent Planning - Extended Version.
CoRR, 2015

Decision-theoretic planning under uncertainty with information rewards for active cooperative perception.
Auton. Agents Multi Agent Syst., 2015

Planning under Uncertainty with Weighted State Scenarios.
Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015

Factored Upper Bounds for Multiagent Planning Problems under Uncertainty with Non-Factored Value Functions.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Improving Value Function Approximation in Factored POMDPs by Exploiting Model Structure.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Influence-Optimistic Local Values for Multiagent Planning.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

The MADP Toolbox: An Open-Source Library for Planning and Learning in (Multi-)Agent Systems.
Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

Best-Response Planning of Thermostatically Controlled Loads under Power Constraints.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
A testbed for autonomous robot surveillance.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Bounded Approximations for Linear Multi-Objective Planning Under Uncertainty.
Proceedings of the Twenty-Fourth International Conference on Automated Planning and Scheduling, 2014

Point-Based POMDP Solving with Factored Value Function Approximation.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Incremental Clustering and Expansion for Faster Optimal Planning in Dec-POMDPs.
J. Artif. Intell. Res., 2013

Decentralized multi-robot cooperation with auctioned POMDPs.
Int. J. Robotics Res., 2013

Coordinating maintenance planning under uncertainty: (demonstration).
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Approximate solutions for factored Dec-POMDPs with many agents.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Multiagent POMDPs with asynchronous execution.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

A Flexible Approach to Modeling Unpredictable Events in MDPs.
Proceedings of the Twenty-Third International Conference on Automated Planning and Scheduling, 2013

Planning under Uncertainty for Coordinating Infrastructural Maintenance.
Proceedings of the Twenty-Third International Conference on Automated Planning and Scheduling, 2013

GSMDPs for Multi-Robot Sequential Decision-Making.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Bayesian-Game-Based Fuzzy Reinforcement Learning Control for Decentralized POMDPs.
IEEE Trans. Comput. Intell. AI Games, 2012

Exploiting Structure in Cooperative Bayesian Games.
Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Tree-based pruning for multiagent POMDPs with delayed communication.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Partially Observable Markov Decision Processes.
Proceedings of the Reinforcement Learning, 2012

2011
Exploiting Agent and Type Independence in Collaborative Graphical Bayesian Games
CoRR, 2011

Efficient Offline Communication Policies for Factored Multiagent POMDPs.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Scaling Up Optimal Heuristic Search in Dec-POMDPs via Incremental Expansion.
Proceedings of the IJCAI 2011, 2011

Fuzzy reinforcement learning control for decentralized partially observable Markov decision processes.
Proceedings of the FUZZ-IEEE 2011, 2011

QueryPOMDP: POMDP-Based Communication in Multiagent Systems.
Proceedings of the Multi-Agent Systems - 9th European Workshop, 2011

2010
Decentralized Sensor Fusion for Ubiquitous Networking Robotics in Urban Areas.
Sensors, 2010

Active cooperative perception in network robot systems using POMDPs.
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Fault-tolerant probabilistic sensor fusion for Multi-Agent Systems.
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Multirobot coordination by auctioning POMDPs.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

A Bayesian game based adaptive fuzzy controller for multiagent POMDPs.
Proceedings of the FUZZ-IEEE 2010, 2010

Heuristic search for identical payoff Bayesian games.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

2009
Signal Processing Advances in Robots and Autonomy.
EURASIP J. Adv. Signal Process., 2009

Decision-theoretic robot guidance for active cooperative perception.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

ISROBOTNET: A testbed for sensor and robot network systems.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Lossless clustering of histories in decentralized POMDPs.
Proceedings of the 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

A Decision-Theoretic Approach to Dynamic Sensor Selection in Camera Networks.
Proceedings of the 19th International Conference on Automated Planning and Scheduling, 2009

2008
Optimal and Approximate Q-value Functions for Decentralized POMDPs.
J. Artif. Intell. Res., 2008

Interaction-driven Markov games for decentralized multiagent planning under uncertainty.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Exploiting locality of interaction in factored Dec-POMDPs.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Multiagent Planning Under Uncertainty with Stochastic Communication Delays.
Proceedings of the Eighteenth International Conference on Automated Planning and Scheduling, 2008

2006
Point-Based Value Iteration for Continuous POMDPs.
J. Mach. Learn. Res., 2006

Decentralized planning under uncertainty for teams of communicating agents.
Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2006), 2006

2005
Non-communicative multi-robot coordination in dynamic environments.
Robotics Auton. Syst., 2005

Perseus: Randomized Point-based Value Iteration for POMDPs.
J. Artif. Intell. Res., 2005

Real World Multi-agent Systems: Information Sharing, Coordination and Planning.
Proceedings of the Logic, 2005

Robot Planning in Partially Observable Continuous Domains.
Proceedings of the Robotics: Science and Systems I, 2005

Planning with Continuous Actions in Partially Observable Environments.
Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005

2004
A Point-based POMDP Algorithm for Robot Planning.
Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

2002
Team Coordination among Robotic Soccer Players.
Proceedings of the RoboCup 2002: Robot Soccer World Cup VI, 2002

2001
Clockwork Orange: The Dutch RoboSoccer Team.
Proceedings of the RoboCup 2001: Robot Soccer World Cup V, 2001


  Loading...