We stand with Ukraine

We stand with Ukraine

Joar Skalse

Orcid: 0009-0005-0848-7188

According to our database¹, Joar Skalse authored at least 20 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

On csauthors.net:

Bibliography

2026

Partial identifiability and misspecification in inverse reinforcement learning.

[DOI]

,

Alessandro Abate

Artif. Intell., 2026

2025

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret.

[DOI]

,

,

Alessandro Abate

,

,

,

Joar Max Viktor Skalse

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Partial Identifiability in Inverse Reinforcement Learning for Agents with Non-Exponential Discounting.

[DOI]

Joar Max Viktor Skalse

,

Alessandro Abate

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems.

[DOI]

David Dalrymple

,

,

,

,

,

Sanjit A. Seshia

,

Steve Omohundro

,

Christian Szegedy

,

,

,

Alessandro Abate

,

,

Clark W. Barrett

,

,

,

Jeannette M. Wing

,

Joshua B. Tenenbaum

CoRR, 2024

On the Expressivity of Objective-Specification Formalisms in Reinforcement Learning.

[DOI]

Rohan Subramani

,

Marcus Williams

,

,

,

Charlie Griffin

,

Joar Max Viktor Skalse

Proceedings of the Twelfth International Conference on Learning Representations, 2024

STARC: A General Framework For Quantifying Differences Between Reward Functions.

[DOI]

Joar Max Viktor Skalse

,

,

Sumeet Ramesh Motwani

,

,

,

Alessandro Abate

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification.

[DOI]

Joar Max Viktor Skalse

,

Alessandro Abate

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Goodhart's Law in Reinforcement Learning.

[DOI]

Jacek Karwowski

,

,

,

Klaus Kiendlhofer

,

Charlie Griffin

,

Joar Max Viktor Skalse

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

On the limitations of Markovian rewards to express multi-objective, risk-sensitive, and modal tasks.

[DOI]

,

Alessandro Abate

Proceedings of the Uncertainty in Artificial Intelligence, 2023

Invariance in Policy Optimisation and Partial Identifiability in Reward Learning.

[DOI]

Joar Max Viktor Skalse

,

Matthew Farrugia-Roberts

,

,

Alessandro Abate

,

Proceedings of the International Conference on Machine Learning, 2023

Misspecification in Inverse Reinforcement Learning.

[DOI]

,

Alessandro Abate

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Defining and Characterizing Reward Hacking.

[DOI]

,

Nikolaus H. R. Howe

,

Dmitrii Krasheninnikov

,

CoRR, 2022

Defining and Characterizing Reward Gaming.

[DOI]

,

Nikolaus H. R. Howe

,

Dmitrii Krasheninnikov

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Lexicographic Multi-Objective Reinforcement Learning.

[DOI]

,

,

Charlie Griffin

,

Alessandro Abate

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

2021

Is SGD a Bayesian sampler? Well, almost.

[DOI]

,

Guillermo Valle Pérez

,

,

J. Mach. Learn. Res., 2021

A General Counterexample to Any Decision Theory and Some Responses.

[DOI]

CoRR, 2021

Reinforcement Learning in Newcomblike Environments.

[DOI]

,

Linda Linsefors

,

Caspar Oesterheld

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Safety Properties of Inductive Logic Programming.

[DOI]

,

,

Proceedings of the Workshop on Artificial Intelligence Safety 2021 (SafeAI 2021) co-located with the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021), 2021

2019

Neural networks are a priori biased towards Boolean functions with low entropy.

[DOI]

,

,

Guillermo Valle Pérez

,

David Martínez-Rubio

,

Vladimir Mikulik

,

CoRR, 2019

Risks from Learned Optimization in Advanced Machine Learning Systems.

[DOI]

,

Chris van Merwijk

,

Vladimir Mikulik

,

,

Scott Garrabrant

CoRR, 2019

Loading...