Proceedings of the Workshop on Artificial Intelligence Safety 2019 co-located with the 28th International Joint Conference on Artificial Intelligence, 2019

Modeling AGI Safety Frameworks with Causal Influence Diagrams.

[BibT_eX]

[DOI]

Tom Everitt

Ramana Kumar

Victoria Krakovna

Shane Legg

Proceedings of the Workshop on Artificial Intelligence Safety 2019 co-located with the 28th International Joint Conference on Artificial Intelligence, 2019

2018

Scaling shared model governance via model splitting.

[BibT_eX]

[DOI]

CoRR, 2018

Scalable agent alignment via reward modeling: a research direction.

[BibT_eX]

[DOI]

CoRR, 2018

Modeling Friends and Foes.

[BibT_eX]

[DOI]

Pedro A. Ortega

Shane Legg

CoRR, 2018

Measuring and avoiding side effects using relative reachability.

[BibT_eX]

[DOI]

CoRR, 2018

Agents and Devices: A Relative Definition of Agency.

[BibT_eX]

[DOI]

Laurent Orseau

Simon McGregor McGill

Shane Legg

CoRR, 2018

Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents.

[BibT_eX]

[DOI]

Joel Z. Leibo

Cyprien de Masson d'Autume

Antonio García Castañeda

CoRR, 2018

Reward learning from human preferences and demonstrations in Atari.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Noisy Networks For Exploration.

[BibT_eX]

[DOI]

Meire Fortunato

Mohammad Gheshlaghi Azar

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

AI Safety Gridworlds.

[BibT_eX]

[DOI]

CoRR, 2017

Building Machines that Learn and Think for Themselves: Commentary on Lake et al., Behavioral and Brain Sciences, 2017.

[BibT_eX]

[DOI]

Danilo Jimenez Rezende

Adam Santoro

Tom Schaul

Christopher Summerfield

CoRR, 2017

Symmetric Decomposition of Asymmetric Games.

[BibT_eX]

[DOI]

CoRR, 2017

Noisy Networks for Exploration.

[BibT_eX]

[DOI]

Meire Fortunato

Mohammad Gheshlaghi Azar

CoRR, 2017

Reinforcement Learning with a Corrupted Reward Channel.

[BibT_eX]

[DOI]

CoRR, 2017

Deep Reinforcement Learning from Human Preferences.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Reinforcement Learning with a Corrupted Reward Channel.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Soft-Bayes: Prod for Mixtures of Experts with Log-Loss.

[BibT_eX]

[DOI]

Laurent Orseau

Tor Lattimore

Shane Legg

Proceedings of the International Conference on Algorithmic Learning Theory, 2017

2016

DeepMind Lab.

[BibT_eX]

[DOI]

CoRR, 2016

2015

Human-level control through deep reinforcement learning.

[BibT_eX]

[DOI]

Nat., 2015

Massively Parallel Methods for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Vedavyas Panneershelvam

CoRR, 2015

Letter to the Editor: Research Priorities for Robust and Beneficial Artificial Intelligence: An Open Letter.

[BibT_eX]

[DOI]

AI Mag., 2015

2014

From academia to industry: The story of Google DeepMind.

[BibT_eX]

[DOI]

Shane Legg

Proceedings of the 2014 Imperial College Computing Student Workshop, 2014

2011

An Approximation of the Universal Intelligence Measure.

[BibT_eX]

[DOI]

Shane Legg

Joel Veness

Proceedings of the Algorithmic Probability and Friends. Bayesian Prediction and Artificial Intelligence, 2011

2007

Algorithmic probability.

[BibT_eX]

[DOI]

Marcus Hutter

Shane Legg

Paul M. B. Vitányi

Scholarpedia, 2007

Universal Intelligence: A Definition of Machine Intelligence.

[BibT_eX]

[DOI]

Shane Legg

Marcus Hutter

Minds Mach., 2007

Temporal Difference Updating without a Learning Rate.

[BibT_eX]

[DOI]

Marcus Hutter

Shane Legg

Proceedings of the Advances in Neural Information Processing Systems 20, 2007

2006

Fitness uniform optimization.

[BibT_eX]

[DOI]

Marcus Hutter

Shane Legg

IEEE Trans. Evol. Comput., 2006

A Formal Measure of Machine Intelligence

[BibT_eX]

[DOI]

Shane Legg

Marcus Hutter

CoRR, 2006

Is There an Elegant Universal Theory of Prediction?

[BibT_eX]

[DOI]

Shane Legg

Proceedings of the Algorithmic Learning Theory, 17th International Conference, 2006

Tests of Machine Intelligence.

[BibT_eX]

[DOI]

Shane Legg

Marcus Hutter

Proceedings of the 50 Years of Artificial Intelligence, 2006

A Collection of Definitions of Intelligence.

[BibT_eX]

[DOI]

Shane Legg

Marcus Hutter

Proceedings of the Advances in Artificial General Intelligence: Concepts, Architectures and Algorithms, 2006

2005

A Universal Measure of Intelligence for Artificial Agents.

[BibT_eX]

[DOI]

Shane Legg

Marcus Hutter

Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Fitness uniform deletion: a simple way to preserve diversity.

[BibT_eX]

[DOI]

Shane Legg

Marcus Hutter

Proceedings of the Genetic and Evolutionary Computation Conference, 2005

2004

Tournament versus fitness uniform selection.

[BibT_eX]

[DOI]

Shane Legg

Marcus Hutter