Thomas W. Anthony

Affiliations:

DeepMind, UK

According to our database¹, Thomas W. Anthony authored at least 21 papers between 2017 and 2023.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2023

Learning to play against any mixture of opponents.

[BibT_eX]

[DOI]

Max Olan Smith

Thomas W. Anthony

Michael P. Wellman

Frontiers Artif. Intell., February, 2023

Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Strategic Knowledge Transfer.

[BibT_eX]

[DOI]

Max Olan Smith

Thomas W. Anthony

Michael P. Wellman

J. Mach. Learn. Res., 2023

Evaluating Agents using Social Choice Theory.

[BibT_eX]

[DOI]

CoRR, 2023

Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas.

[BibT_eX]

[DOI]

Edgar A. Duéñez-Guzmán

CoRR, 2023

2022

Figure Data for the paper "Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning".

[BibT_eX]

[DOI]

Dataset, October, 2022

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Developing, evaluating and scaling learning agents in multi-agent environments.

[BibT_eX]

[DOI]

AI Commun., 2022

Turbocharging Solution Concepts: Solving NEs, CEs and CCEs with Neural Equilibrium Solvers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sample-based Approximation of Nash in Large Many-Player Games via Gradient Descent.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

2021

Expert iteration

[BibT_eX]

[DOI]

Thomas W. Anthony

PhD thesis, 2021

From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization.

[BibT_eX]

[DOI]

Julien Pérolat

Rémi Munos

Jean-Baptiste Lespiau

Proceedings of the 38th International Conference on Machine Learning, 2021

Iterative Empirical Game Solving via Single Policy Best Response.

[BibT_eX]

[DOI]

Max Olan Smith

Thomas Anthony

Michael P. Wellman

Proceedings of the 9th International Conference on Learning Representations, 2021

On the role of planning in model-based deep reinforcement learning.

[BibT_eX]

[DOI]

Jessica B. Hamrick

Abram L. Friesen

Feryal M. P. Behbahani

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

Learning to Play against Any Mixture of Opponents.

[BibT_eX]

[DOI]

CoRR, 2020

Learning to Play No-Press Diplomacy with Best Response Policy Iteration.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Smooth markets: A basic mechanism for organizing gradient-based learners.

[BibT_eX]

[DOI]

David Balduzzi

Wojciech M. Czarnecki

Proceedings of the 8th International Conference on Learning Representations, 2020

Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019

OpenSpiel: A Framework for Reinforcement Learning in Games.

[BibT_eX]

[DOI]

CoRR, 2019

Policy Gradient Search: Online Planning and Expert Iteration without Search Trees.

[BibT_eX]

[DOI]

CoRR, 2019

2017

Thinking Fast and Slow with Deep Learning and Tree Search.

[BibT_eX]

[DOI]

Thomas Anthony

Zheng Tian

David Barber

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Thomas W. Anthony

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...