Jakob N. Foerster

CoRR, 2022

Learning to Coordinate with Humans using Action Features.

[BibT_eX]

[DOI]

CoRR, 2022

Proximal Learning With Opponent-Learning Awareness.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Equivariant Networks for Zero-Shot Coordination.

[BibT_eX]

[DOI]

Darius Muglich

Elise van der Pol

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Influencing Long-Term Behavior in Multiagent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Grounding Aleatoric Uncertainty for Unsupervised Environment Design.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Self-Explaining Deviations for Coordination.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Off-Team Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Discovered Policy Optimisation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

COLA: Consistent Learning with Opponent-Learning Awareness.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Communicating via Markov Decision Processes.

[BibT_eX]

[DOI]

Samuel Sokota

Proceedings of the International Conference on Machine Learning, 2022

Evolving Curricula with Regret-Based Environment Design.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Generalized Beliefs for Cooperative AI.

[BibT_eX]

[DOI]

Darius Muglich

Luisa M. Zintgraf

Proceedings of the International Conference on Machine Learning, 2022

Model-Free Opponent Shaping.

[BibT_eX]

[DOI]

Christopher Lu

Timon Willi

Proceedings of the International Conference on Machine Learning, 2022

Mirror Learning: A Unifying Framework of Policy Optimisation.

[BibT_eX]

[DOI]

Jakub Grudzien Kuba

Proceedings of the International Conference on Machine Learning, 2022

A Fine-Tuning Approach to Belief State Modeling.

[BibT_eX]

[DOI]

Jakob Nicolaus Foerster

Noam Brown

Proceedings of the Tenth International Conference on Learning Representations, 2022

Lyapunov Exponents for Diversity in Differentiable Games.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

Centralized Model and Exploration Policy for Multi-Agent RL.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

2021

Reinforcement learning enhanced quantum-inspired algorithm for combinatorial optimization.

[BibT_eX]

[DOI]

Mach. Learn. Sci. Technol., 2021

Don't Sweep your Learning Rate under the Rug: A Closer Look at Cross-modal Transfer of Pretrained Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

Implicit Communication as Minimum Entropy Coupling.

[BibT_eX]

[DOI]

Samuel Sokota

CoRR, 2021

Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings.

[BibT_eX]

[DOI]

CoRR, 2021

Quasi-Equivalence Discovery for Zero-Shot Emergent Communication.

[BibT_eX]

[DOI]

CoRR, 2021

Off-Belief Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Neural Pseudo-Label Optimism for the Bank Loan Problem.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Replay-Guided Adversarial Environment Design.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

K-level Reasoning for Zero-Shot Coordination in Hanabi.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A New Formalism, Method and Open Issues for Zero-Shot Coordination.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Trajectory Diversity for Zero-Shot Coordination.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Off-Belief Learning.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Trajectory Diversity for Zero-Shot Coordination.

[BibT_eX]

[DOI]

Andrei Lupu

Hengyuan Hu

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Tabish Rashid

Mikayel Samvelyan

J. Mach. Learn. Res., 2020

Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent Populations.

[BibT_eX]

[DOI]

CoRR, 2020

The Struggles of Feature-Based Explanations: Shapley Values vs. Minimal Sufficient Subsets.

[BibT_eX]

[DOI]

CoRR, 2020

The Hanabi challenge: A new frontier for AI research.

[BibT_eX]

[DOI]

Artif. Intell., 2020

Compositionality and Capacity in Emergent Languages.

[BibT_eX]

[DOI]

Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian.

[BibT_eX]

[DOI]

Alexander Peysakhovich

Aldo Pacchiano

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

"Other-Play" for Zero-Shot Coordination.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

On the interaction between supervision and self-play in emergent communication.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Hengyuan Hu

Proceedings of the 8th International Conference on Learning Representations, 2020

Capacity, Bandwidth, and Compositionality in Emergent Language Learning.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Improving Policies via Search in Cooperative Partially Observable Games.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Exploratory Combinatorial Optimization with Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Differentiable Game Mechanics.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2019

Robust Domain Randomization for Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Can I Trust the Explainer? Verifying Post-hoc Explanatory Methods.

[BibT_eX]

[DOI]

CoRR, 2019

Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Multi-Agent Common Knowledge Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

A Survey of Reinforcement Learning Informed by Natural Language.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

A Baseline for Any Order Gradient Estimation in Stochastic Computation Graphs.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Stable Opponent Shaping in Differentiable Games.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Seeded self-play for language learning.

[BibT_eX]

[DOI]

Proceedings of the Beyond Vision and LANguage: inTEgrating Real-world kNowledge, 2019

The StarCraft Multi-Agent Challenge.

[BibT_eX]

[DOI]

Mikayel Samvelyan

Tabish Rashid

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

On the Pitfalls of Measuring Emergent Communication.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018

Deep multi-agent reinforcement learning.

[BibT_eX]

[DOI]

PhD thesis, 2018

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Tabish Rashid

Mikayel Samvelyan

Proceedings of the 35th International Conference on Machine Learning, 2018

DiCE: The Infinitely Differentiable Monte Carlo Estimator.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

The Mechanics of n-Player Differentiable Games.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Learning with Opponent-Learning Awareness.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

Pommerman: A Multi-Agent Playground.

[BibT_eX]

[DOI]

Proceedings of the Joint Proceedings of the AIIDE 2018 Workshops co-located with 14th AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE 2018), 2018

Counterfactual Multi-Agent Policy Gradients.

[BibT_eX]

[DOI]

Triantafyllos Afouras

Nantas Nardelli