Sydney Levine

Orcid: 0000-0003-3688-3290

According to our database1, Sydney Levine authored at least 26 papers between 2015 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Resource Rational Contractualism Should Guide AI Alignment.
CoRR, June, 2025

Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas.
CoRR, May, 2025

Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models.
CoRR, February, 2025

Resource-Rational Virtual Bargaining for Moral Judgment: Toward a Probabilistic Cognitive Model.
Top. Cogn. Sci., 2025

Investigating machine moral judgement through the Delphi experiment.
Nat. Mac. Intell., 2025

When Is It Acceptable to Break the Rules? Knowledge Representation of Moral Judgements Based on Empirical Data (Extended Abstract).
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Language Model Alignment in Multilingual Trolley Problems.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Can Language Models Reason about Individualistic Human Values and Preferences?
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
When is it acceptable to break the rules? Knowledge representation of moral judgements based on empirical data.
Auton. Agents Multi Agent Syst., December, 2024

Imagining and building wise machines: The centrality of AI metacognition.
CoRR, 2024

SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation.
CoRR, 2024

Intuitions of Compromise: Utilitarianism vs. Contractualism.
CoRR, 2024

Multilingual Trolley Problems for Language Models.
CoRR, 2024

Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
When it's not out of line to get out of line: Principles of universalizability, welfare, and harm.
Proceedings of the 45th Annual Meeting of the Cognitive Science Society, 2023

2022
When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment.
CoRR, 2022

When Is It Acceptable to Break the Rules? Knowledge Representation of Moral Judgement Based on Empirical Data.
CoRR, 2022

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Competing perspectives on building ethical AI: psychological, philosophical, and computational approaches.
Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

Flexibility in Moral Cognition: When is it okay to break the rules?
Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

2021
Engineering and reverse-engineering morality.
Proceedings of the 43rd Annual Meeting of the Cognitive Science Society, 2021

2019
What if everybody did that?: Universalization as a mechanism of moral decision-making.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

2018
Blaming humans in autonomous vehicle accidents: Shared responsibility across levels of automation.
CoRR, 2018

The Mental Representation of Human Action.
Cogn. Sci., 2018

The Cognitive Mechanisms of Contractualist Moral Decision-Making.
Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

2015
Inference of Intention and Permissibility in Moral Decision Making.
Proceedings of the 37th Annual Meeting of the Cognitive Science Society, 2015


  Loading...