Sydney Levine

Orcid: 0000-0003-3688-3290

According to our database1, Sydney Levine authored at least 37 papers between 2015 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
PluriHarms: Benchmarking the Full Spectrum of Human Judgments on AI Harm.
CoRR, January, 2026

2025
Full-Stack Alignment: Co-Aligning AI and Institutions with Thick Models of Value.
CoRR, December, 2025

MoReBench: Evaluating Procedural and Pluralistic Moral Reasoning in Language Models, More than Outcomes.
CoRR, October, 2025

Resource Rational Contractualism Should Guide AI Alignment.
CoRR, June, 2025

Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas.
CoRR, May, 2025

Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models.
CoRR, February, 2025

Resource-Rational Virtual Bargaining for Moral Judgment: Toward a Probabilistic Cognitive Model.
Top. Cogn. Sci., 2025

Investigating machine moral judgement through the Delphi experiment.
Nat. Mac. Intell., 2025

When Is It Acceptable to Break the Rules? Knowledge Representation of Moral Judgements Based on Empirical Data (Extended Abstract).
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Language Model Alignment in Multilingual Trolley Problems.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

The trade-off between rule-based thinking and mutual benefit in tacit coordination.
Proceedings of the 47th Annual Meeting of the Cognitive Science Society, 2025

I Know I Should: Normative Competence From Biology To AI.
Proceedings of the 47th Annual Meeting of the Cognitive Science Society, 2025

Can Language Models Reason about Individualistic Human Values and Preferences?
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
When is it acceptable to break the rules? Knowledge representation of moral judgements based on empirical data.
Auton. Agents Multi Agent Syst., December, 2024

Imagining and building wise machines: The centrality of AI metacognition.
CoRR, 2024

SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation.
CoRR, 2024

Intuitions of Compromise: Utilitarianism vs. Contractualism.
CoRR, 2024

Multilingual Trolley Problems for Language Models.
CoRR, 2024

Resource-rational moral judgment.
Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024

Moral flexibility in applying queuing norms can be explained by contractualist principles and game-theoretic considerations.
Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024

Perceptions of Compromise: Comparing Consqequentialist and Conctractualist Accounts.
Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024

Who is responsible for collective action?
Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024

Neuro-Symbolic Models of Human Moral Judgment.
Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024

Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
When it's not out of line to get out of line: Principles of universalizability, welfare, and harm.
Proceedings of the 45th Annual Meeting of the Cognitive Science Society, 2023

2022
When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment.
CoRR, 2022

When Is It Acceptable to Break the Rules? Knowledge Representation of Moral Judgement Based on Empirical Data.
CoRR, 2022

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Competing perspectives on building ethical AI: psychological, philosophical, and computational approaches.
Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

Flexibility in Moral Cognition: When is it okay to break the rules?
Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

2021
Engineering and reverse-engineering morality.
Proceedings of the 43rd Annual Meeting of the Cognitive Science Society, 2021

2019
What if everybody did that?: Universalization as a mechanism of moral decision-making.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

2018
Blaming humans in autonomous vehicle accidents: Shared responsibility across levels of automation.
CoRR, 2018

The Mental Representation of Human Action.
Cogn. Sci., 2018

The Cognitive Mechanisms of Contractualist Moral Decision-Making.
Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

2015
Inference of Intention and Permissibility in Moral Decision Making.
Proceedings of the 37th Annual Meeting of the Cognitive Science Society, 2015


  Loading...