Stephen McAleer

Orcid: 0000-0003-0118-6874

According to our database1, Stephen McAleer authored at least 51 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Policy Space Response Oracles: A Survey.
CoRR, 2024

Scalable Mechanism Design for Multi-Agent Path Finding.
CoRR, 2024

Grasper: A Generalist Pursuer for Pursuit-Evasion Problems.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Automated Design of Affine Maximizer Mechanisms in Dynamic Settings.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
AI Alignment: A Comprehensive Survey.
CoRR, 2023

Llemma: An Open Language Model For Mathematics.
CoRR, 2023

Confronting Reward Model Overoptimization with Constrained RLHF.
CoRR, 2023

JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games.
CoRR, 2023

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations.
CoRR, 2023

Steering No-Regret Learners to Optimal Equilibria.
CoRR, 2023

ASP: Learn a Universal Neural Solver!
CoRR, 2023

MANSA: Learning Fast and Slow in Multi-Agent Systems.
CoRR, 2023

Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning.
CoRR, 2023

Algorithms and Complexity for Computing Nash Equilibria in Adversarial Team Games.
Proceedings of the 24th ACM Conference on Economics and Computation, 2023

Computing Optimal Equilibria and Mechanisms via Learning in Zero-Sum Extensive-Form Games.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Policy Space Diversity for Non-Transitive Games.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Language Models can Solve Computer Tasks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Regret-Minimizing Double Oracle for Extensive-Form Games.
Proceedings of the International Conference on Machine Learning, 2023

A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems.
Proceedings of the International Conference on Machine Learning, 2023

MANSA: Learning Fast and Slow in Multi-Agent Systems.
Proceedings of the International Conference on Machine Learning, 2023

ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Online Double Oracle.
Trans. Mach. Learn. Res., 2022

Game Theoretic Rating in N-player general-sum games with Equilibria.
CoRR, 2022

Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments.
CoRR, 2022

Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games.
CoRR, 2022

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning.
CoRR, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.
CoRR, 2022

Learning Risk-Averse Equilibria in Multi-Agent Systems.
CoRR, 2022

Anytime PSRO for Two-Player Zero-Sum Games.
CoRR, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks.
Proceedings of the International Conference on Machine Learning, 2022

Proving Theorems using Incremental Learning and Hindsight Experience Replay.
Proceedings of the International Conference on Machine Learning, 2022

Independent Natural Policy Gradient always converges in Markov Potential Games.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
Target Entropy Annealing for Discrete Soft Actor-Critic.
CoRR, 2021

Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates.
CoRR, 2021

Improving Social Welfare While Preserving Autonomy via a Pareto Mediator.
CoRR, 2021

Discovering Multi-Agent Auto-Curricula in Two-Player Zero-Sum Games.
CoRR, 2021

XDO: A Double Oracle Algorithm for Extensive-Form Games.
CoRR, 2021

A* Search Without Expansions: Learning Heuristic Functions with Deep Q-Networks.
CoRR, 2021

XDO: A Double Oracle Algorithm for Extensive-Form Games.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neural Auto-Curricula in Two-Player Zero-Sum Games.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Deep machine learning-assisted multiphoton microscopy to reduce light exposure and expedite imaging.
CoRR, 2020

Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Highly Accurate Machine Fault Diagnosis Using Deep Transfer Learning.
IEEE Trans. Ind. Informatics, 2019

Solving the Rubik's cube with deep reinforcement learning and search.
Nat. Mach. Intell., 2019

ColosseumRL: A Framework for Multiagent Reinforcement Learning in N-Player Games.
CoRR, 2019

Curiosity-Driven Multi-Criteria Hindsight Experience Replay.
CoRR, 2019

Solving the Rubik's Cube with Approximate Policy Iteration.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Solving the Rubik's Cube Without Human Knowledge.
CoRR, 2018


  Loading...