Thomas W. Anthony

Affiliations:
  • DeepMind, UK


According to our database1, Thomas W. Anthony authored at least 20 papers between 2017 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Learning to play against any mixture of opponents.
Frontiers Artif. Intell., February, 2023

Strategic Knowledge Transfer.
J. Mach. Learn. Res., 2023

Evaluating Agents using Social Choice Theory.
CoRR, 2023

Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas.
CoRR, 2023

Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning.
CoRR, 2023

2022
Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments.
CoRR, 2022

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning.
CoRR, 2022

Developing, evaluating and scaling learning agents in multi-agent environments.
AI Commun., 2022

Turbocharging Solution Concepts: Solving NEs, CEs and CCEs with Neural Equilibrium Solvers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sample-based Approximation of Nash in Large Many-Player Games via Gradient Descent.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

2021
From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Iterative Empirical Game Solving via Single Policy Best Response.
Proceedings of the 9th International Conference on Learning Representations, 2021

On the role of planning in model-based deep reinforcement learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Learning to Play against Any Mixture of Opponents.
CoRR, 2020

Learning to Play No-Press Diplomacy with Best Response Policy Iteration.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Smooth markets: A basic mechanism for organizing gradient-based learners.
Proceedings of the 8th International Conference on Learning Representations, 2020

Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019
OpenSpiel: A Framework for Reinforcement Learning in Games.
CoRR, 2019

Policy Gradient Search: Online Planning and Expert Iteration without Search Trees.
CoRR, 2019

2017
Thinking Fast and Slow with Deep Learning and Tree Search.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017


  Loading...