Gregory Farquhar

According to our database1, Gregory Farquhar authored at least 25 papers between 2017 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients.
CoRR, 2022

Model-Value Inconsistency as a Signal for Epistemic Uncertainty.
Proceedings of the International Conference on Machine Learning, 2022

2021
Proper Value Equivalence.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Self-Consistent Models and Values.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Transient Non-stationarity and Generalisation in Deep Reinforcement Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning.
J. Mach. Learn. Res., 2020

Weighted QMIX: Expanding Monotonic Value Function Factorisation.
CoRR, 2020

The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning.
CoRR, 2020

Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Growing Action Spaces.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning.
CoRR, 2019

Multi-Agent Common Knowledge Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

A Survey of Reinforcement Learning Informed by Natural Language.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

A Baseline for Any Order Gradient Estimation in Stochastic Computation Graphs.
Proceedings of the 36th International Conference on Machine Learning, 2019

The StarCraft Multi-Agent Challenge.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

DiCE: The Infinitely Differentiable Monte-Carlo Estimator.
Proceedings of the 6th International Conference on Learning Representations, 2018

TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018

Counterfactual Multi-Agent Policy Gradients.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
TreeQN and ATreeC: Differentiable Tree Planning for Deep Reinforcement Learning.
CoRR, 2017

Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning.
CoRR, 2017

Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017


  Loading...