Proceedings of the Workshop on Artificial Intelligence Safety 2021 co-located with the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI 2021), 2021

2020

Pitfalls of Learning a Reward Function Online.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

2018

Counterfactual equivalence for POMDPs, and underlying deterministic environments.

[BibT_eX]

[DOI]

Stuart Armstrong

CoRR, 2018

Occam's razor is insufficient to infer the preferences of irrational agents.

[BibT_eX]

[DOI]

Stuart Armstrong

Sören Mindermann

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017

'Indifference' methods for managing agent rewards.

[BibT_eX]

[DOI]

Stuart Armstrong

CoRR, 2017

Impossibility of deducing preferences and rationality from human policy.

[BibT_eX]

[DOI]

Stuart Armstrong

Sören Mindermann

CoRR, 2017

Good and safe uses of AI Oracles.

[BibT_eX]

[DOI]

Stuart Armstrong

CoRR, 2017

Low Impact Artificial Intelligences.

[BibT_eX]

[DOI]

Stuart Armstrong

Benjamin Levinstein

CoRR, 2017

2016

Racing to the precipice: a model of artificial intelligence development.

[BibT_eX]

[DOI]

Stuart Armstrong

Nick Bostrom

Carl Shulman

AI Soc., 2016

Safely Interruptible Agents.

[BibT_eX]

[DOI]

Laurent Orseau

Stuart Armstrong

Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

Ghost Hunter - An Augmented Reality Ghost Busting Game.

[BibT_eX]

[DOI]

Stuart Armstrong

Kyle Morrand

Proceedings of the Virtual, Augmented and Mixed Reality, 2016

2015

Corrigibility.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence and Ethics, 2015

Motivated Value Selection for Artificial Agents.

[BibT_eX]

[DOI]

Stuart Armstrong

Proceedings of the Artificial Intelligence and Ethics, 2015

2014

The errors, insights and lessons of famous AI predictions - and what they mean for the future.

[BibT_eX]

[DOI]

Stuart Armstrong

Kaj Sotala

Seán Sáirséal Ó hÉigeartaigh

J. Exp. Theor. Artif. Intell., 2014

2012

Thinking Inside the Box: Controlling and Using an Oracle AI.

[BibT_eX]

[DOI]

Stuart Armstrong

Anders Sandberg

Nick Bostrom

Minds Mach., 2012

2011

Risks and Mitigation Strategies for Oracle AI.

[BibT_eX]

[DOI]

Stuart Armstrong

Proceedings of the Philosophy and Theory of Artificial Intelligence, 2011

Training and learning for crisis management using a virtual simulation/gaming environment.

[BibT_eX]

[DOI]

Warren E. Walker

Jordan Giddings

Stuart Armstrong

Cogn. Technol. Work., 2011

Anthropic decision theory

[BibT_eX]

[DOI]

Stuart Armstrong

CoRR, 2011

Stuart Armstrong

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...