Sydney Levine

Orcid: 0000-0003-3688-3290

According to our database¹, Sydney Levine authored at least 37 papers between 2015 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

PluriHarms: Benchmarking the Full Spectrum of Human Judgments on AI Harm.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

Full-Stack Alignment: Co-Aligning AI and Institutions with Thick Models of Value.

[BibT_eX]

[DOI]

CoRR, December, 2025

MoReBench: Evaluating Procedural and Pluralistic Moral Reasoning in Language Models, More than Outcomes.

[BibT_eX]

[DOI]

Udari Madhushani Sehwag

CoRR, October, 2025

Resource Rational Contractualism Should Guide AI Alignment.

[BibT_eX]

[DOI]

CoRR, June, 2025

Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas.

[BibT_eX]

[DOI]

CoRR, May, 2025

Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models.

[BibT_eX]

[DOI]

CoRR, February, 2025

Resource-Rational Virtual Bargaining for Moral Judgment: Toward a Probabilistic Cognitive Model.

[BibT_eX]

[DOI]

Top. Cogn. Sci., 2025

Investigating machine moral judgement through the Delphi experiment.

[BibT_eX]

[DOI]

Nat. Mac. Intell., 2025

When Is It Acceptable to Break the Rules? Knowledge Representation of Moral Judgements Based on Empirical Data (Extended Abstract).

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Language Model Alignment in Multilingual Trolley Problems.

[BibT_eX]

[DOI]

Fernando Gonzalez Adauto

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

The trade-off between rule-based thinking and mutual benefit in tacit coordination.

[BibT_eX]

[DOI]

Proceedings of the 47th Annual Meeting of the Cognitive Science Society, 2025

I Know I Should: Normative Competence From Biology To AI.

[BibT_eX]

[DOI]

Proceedings of the 47th Annual Meeting of the Cognitive Science Society, 2025

Can Language Models Reason about Individualistic Human Values and Preferences?

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

When is it acceptable to break the rules? Knowledge representation of moral judgements based on empirical data.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., December, 2024

Imagining and building wise machines: The centrality of AI metacognition.

[BibT_eX]

[DOI]

CoRR, 2024

SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation.

[BibT_eX]

[DOI]

CoRR, 2024

Intuitions of Compromise: Utilitarianism vs. Contractualism.

[BibT_eX]

[DOI]

Jared Moore

Yejin Choi

Sydney Levine

CoRR, 2024

Multilingual Trolley Problems for Language Models.

[BibT_eX]

[DOI]

Fernando Gonzalez Adauto

CoRR, 2024

Resource-rational moral judgment.

[BibT_eX]

[DOI]

Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024

Moral flexibility in applying queuing norms can be explained by contractualist principles and game-theoretic considerations.

[BibT_eX]

[DOI]

Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024

Perceptions of Compromise: Comparing Consqequentialist and Conctractualist Accounts.

[BibT_eX]

[DOI]

Jared Moore

Sydney Levine

Yejin Choi

Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024

Who is responsible for collective action?

[BibT_eX]

[DOI]

Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024

Neuro-Symbolic Models of Human Moral Judgment.

[BibT_eX]

[DOI]

Joseph Kwon

Josh Tenenbaum

Sydney Levine

Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024

Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

When it's not out of line to get out of line: Principles of universalizability, welfare, and harm.

[BibT_eX]

[DOI]

Proceedings of the 45th Annual Meeting of the Cognitive Science Society, 2023

2022

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment.

[BibT_eX]

[DOI]

CoRR, 2022

When Is It Acceptable to Break the Rules? Knowledge Representation of Moral Judgement Based on Empirical Data.

[BibT_eX]

[DOI]

CoRR, 2022

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment.

[BibT_eX]

[DOI]

Zhijing Jin

Sydney Levine

Fernando Gonzalez Adauto

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Competing perspectives on building ethical AI: psychological, philosophical, and computational approaches.

[BibT_eX]

[DOI]

Sydney Levine

Zhijing Jin

Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

Flexibility in Moral Cognition: When is it okay to break the rules?

[BibT_eX]

[DOI]

Joseph Kwon

Josh Tenenbaum

Sydney Levine

Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

2021

Engineering and reverse-engineering morality.

[BibT_eX]

[DOI]

Proceedings of the 43rd Annual Meeting of the Cognitive Science Society, 2021

2019

What if everybody did that?: Universalization as a mechanism of moral decision-making.

[BibT_eX]

[DOI]

Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

2018

Blaming humans in autonomous vehicle accidents: Shared responsibility across levels of automation.

[BibT_eX]

[DOI]

Jean-François Bonnefon

Iyad Rahwan

CoRR, 2018

The Mental Representation of Human Action.

[BibT_eX]

[DOI]

Sydney Levine

Alan M. Leslie

John Mikhail

Cogn. Sci., 2018

The Cognitive Mechanisms of Contractualist Moral Decision-Making.

[BibT_eX]

[DOI]

Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

2015

Inference of Intention and Permissibility in Moral Decision Making.

[BibT_eX]

[DOI]

Proceedings of the 37th Annual Meeting of the Cognitive Science Society, 2015

Sydney Levine

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...