John Aslanides

According to our database1, John Aslanides authored at least 17 papers between 2017 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Fine-Tuning Language Models via Epistemic Neural Networks.
CoRR, 2022

Improving alignment of dialogue agents via targeted human judgements.
CoRR, 2022

Teaching language models to support answers with verified quotes.
CoRR, 2022

Fine-tuning language models to find agreement among humans with diverse preferences.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Red Teaming Language Models with Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Scaling Language Models: Methods, Analysis & Insights from Training Gopher.
CoRR, 2021

2020
Acme: A Research Framework for Distributed Reinforcement Learning.
CoRR, 2020

Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning.
CoRR, 2020

Behaviour Suite for Reinforcement Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

A General Approach to Fairness with Optimal Transport.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
TF-Replicator: Distributed Machine Learning for Researchers.
CoRR, 2019

When to use parametric models in reinforcement learning?
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Randomized Prior Functions for Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
Generalised Discount Functions applied to a Monte-Carlo AImu Implementation.
CoRR, 2017

AIXIjs: A Software Demo for General Reinforcement Learning.
CoRR, 2017

Universal Reinforcement Learning Algorithms: Survey and Experiments.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Generalised Discount Functions applied to a Monte-Carlo AI u Implementation.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017


  Loading...