Stuart Armstrong

Orcid: 0000-0002-1385-3278

According to our database1, Stuart Armstrong authored at least 23 papers between 2011 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
CoinRun: Solving Goal Misgeneralisation.
CoRR, 2023

Concept Extrapolation: A Conceptual Primer.
CoRR, 2023

2022
Recognising the importance of preference change: A call for a coordinated multidisciplinary research effort in the age of AI.
CoRR, 2022

The dangers in algorithms learning humans' values and irrationalities.
CoRR, 2022

Missing Mechanisms of Manipulation in the EU AI Act.
Proceedings of the Thirty-Fifth International Florida Artificial Intelligence Research Society Conference, 2022

2021
Chess as a Testing Grounds for the Oracle Approach to AI Safety.
Proceedings of the Workshop on Artificial Intelligence Safety 2021 co-located with the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI 2021), 2021

2020
Pitfalls of Learning a Reward Function Online.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

2018
Counterfactual equivalence for POMDPs, and underlying deterministic environments.
CoRR, 2018

Occam's razor is insufficient to infer the preferences of irrational agents.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
'Indifference' methods for managing agent rewards.
CoRR, 2017

Impossibility of deducing preferences and rationality from human policy.
CoRR, 2017

Good and safe uses of AI Oracles.
CoRR, 2017

Low Impact Artificial Intelligences.
CoRR, 2017

2016
Racing to the precipice: a model of artificial intelligence development.
AI Soc., 2016

Safely Interruptible Agents.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

Ghost Hunter - An Augmented Reality Ghost Busting Game.
Proceedings of the Virtual, Augmented and Mixed Reality, 2016

2015
Corrigibility.
Proceedings of the Artificial Intelligence and Ethics, 2015

Motivated Value Selection for Artificial Agents.
Proceedings of the Artificial Intelligence and Ethics, 2015

2014
The errors, insights and lessons of famous AI predictions - and what they mean for the future.
J. Exp. Theor. Artif. Intell., 2014

2012
Thinking Inside the Box: Controlling and Using an Oracle AI.
Minds Mach., 2012

2011
Risks and Mitigation Strategies for Oracle AI.
Proceedings of the Philosophy and Theory of Artificial Intelligence, 2011

Training and learning for crisis management using a virtual simulation/gaming environment.
Cogn. Technol. Work., 2011

Anthropic decision theory
CoRR, 2011


  Loading...