We stand with Ukraine

We stand with Ukraine

Matija Franklin

Orcid: 0000-0003-1846-8907

According to our database¹, Matija Franklin authored at least 37 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Positive Alignment: Artificial Intelligence for Human Flourishing.

[DOI]

Ruben Laukkonen

,

Sébastien Krier

,

,

Shamil Chandaria

,

Morten Kringelbach

,

,

,

,

,

Matija Franklin

,

,

Stephanie C. Y. Chan

,

,

,

,

CoRR, May, 2026

Architecting Trust in Artificial Epistemic Agents.

[DOI]

,

Stephanie C. Y. Chan

,

Matija Franklin

,

,

,

Roberta Fischli

,

,

CoRR, March, 2026

Intelligent AI Delegation.

[DOI]

,

Matija Franklin

,

CoRR, February, 2026

2025

Distributional AGI Safety.

[DOI]

,

Matija Franklin

,

,

Sébastien Krier

,

CoRR, December, 2025

Full-Stack Alignment: Co-Aligning AI and Institutions with Thick Models of Value.

[DOI]

CoRR, December, 2025

Virtual Agent Economies.

[DOI]

,

Matija Franklin

,

,

,

William A. Cunningham

,

,

CoRR, September, 2025

Resource Rational Contractualism Should Guide AI Alignment.

[DOI]

,

Matija Franklin

,

,

Secil Yanik Guyot

,

,

,

,

Joshua B. Tenenbaum

,

Noah D. Goodman

,

,

CoRR, June, 2025

Multi-Agent Risks from Advanced AI.

[DOI]

CoRR, February, 2025

Defense Against the Dark Prompts: Mitigating Best-of-N Jailbreaking with Prompt Evaluation.

[DOI]

Stuart Armstrong

,

Matija Franklin

,

,

Rebecca Gormann

CoRR, February, 2025

Model-Free RL Agents Demonstrate System 1-Like Intentionality.

[DOI]

,

Matija Franklin

CoRR, January, 2025

AI Governance through Markets.

[DOI]

Philip Moreira Tomei

,

,

Matija Franklin

CoRR, January, 2025

LMUNIT: Fine-grained Evaluation with Natural Language Unit Tests.

[DOI]

Jon Saad-Falcon

,

,

William Berrios

,

Nandita Shankar Naik

,

Matija Franklin

,

,

Amanpreet Singh

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024

Beyond Preferences in AI Alignment.

[DOI]

,

,

Matija Franklin

,

CoRR, 2024

The Ethics of Advanced AI Assistants.

[DOI]

,

Arianna Manzini

,

,

Lisa Anne Hendricks

,

,

,

,

,

,

Mikel Rodriguez

,

Seliem El-Sayed

,

,

,

,

,

A. Stevie Bergman

,

,

,

,

Juan Mateos-Garcia

,

Laura Weidinger

,

,

,

,

,

,

,

Victoria Krakovna

,

John Oliver Siy

,

Zeb Kurth-Nelson

,

Amanda McCroskery

,

,

,

Murray Shanahan

,

,

,

,

Yetunde Ibitoye

,

,

,

Sébastien Krier

,

Alexander Reese

,

Sims Witherspoon

,

,

,

,

Matija Franklin

,

Josh A. Goldstein

,

,

,

,

,

Meredith Ringel Morris

,

,

Blaise Agüera y Arcas

,

,

CoRR, 2024

A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI.

[DOI]

CoRR, 2024

Are autonomous vehicles blamed differently?

[DOI]

Darko Stojilovic

,

Matija Franklin

,

Bertram F. Malle

,

Carlos Fernandez-Basso

,

,

David A. Lagnado

Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024

2023

A Proposal for a Definition of General Purpose Artificial Intelligence Systems.

[DOI]

Carlos Ignacio Gutierrez

,

Anthony Aguirre

,

,

Claire C. Boine

,

Matija Franklin

Digit. Soc., December, 2023

An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI.

[DOI]

Ross Gruetzemacher

,

,

,

Christy Manning

,

,

,

José Hernández-Orallo

,

,

Matija Franklin

,

Clíodhna Ní Ghuidhir

,

,

,

Toby D. Pilditch

,

CoRR, 2023

Science Communications for Explainable Artificial Intelligence.

[DOI]

,

Matija Franklin

CoRR, 2023

Strengthening the EU AI Act: Defining Key Terms on AI Manipulation.

[DOI]

Matija Franklin

,

Philip Moreira Tomei

,

Rebecca Gormann

CoRR, 2023

Concept Extrapolation: A Conceptual Primer.

[DOI]

Matija Franklin

,

Rebecca Gormann

,

,

Stuart Armstrong

CoRR, 2023

General Purpose Artificial Intelligence Systems as Group Agents.

[DOI]

Matija Franklin

Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Who Is to Blame? Responsibility Attribution in AI Systems vs Human Agents in the Field of Air Crashes.

[DOI]

Jesica Gómez-Sánchez

,

,

Matija Franklin

,

Carlos Fernandez-Basso

,

David A. Lagnado

Proceedings of the Flexible Query Answering Systems - 15th International Conference, 2023

An Unsupervised Approach to Extracting Knowledge from the Relationships Between Blame Attribution on Twitter.

[DOI]

Matija Franklin

,

Trisevgeni Papakonstantinou

,

,

Carlos Fernandez-Basso

,

David A. Lagnado

Proceedings of the Flexible Query Answering Systems - 15th International Conference, 2023

Enhancing Wearable Technologies for Dementia Care: A Cognitive Architecture Approach.

[DOI]

Matija Franklin

,

David A. Lagnado

,

,

,

Proceedings of the Explainable and Transparent AI and Multi-Agent Systems, 2023

Blame attribution in human-AI and human-only systems: Crowdsourcing judgments from Twitter.

[DOI]

Matija Franklin

,

Trisevgeni Papakonstantinou

,

,

Carlos Fernandez-Basso

,

David A. Lagnado

Proceedings of the 45th Annual Meeting of the Cognitive Science Society, 2023

2022

The Influence of Explainable Artificial Intelligence: Nudging Behaviour or Boosting Capability?

[DOI]

Matija Franklin

CoRR, 2022

Solutions to preference manipulation in recommender systems require knowledge of meta-preferences.

[DOI]

,

Matija Franklin

CoRR, 2022

Preference Change in Persuasive Robotics.

[DOI]

Matija Franklin

,

CoRR, 2022

Recognising the importance of preference change: A call for a coordinated multidisciplinary research effort in the age of AI.

[DOI]

Matija Franklin

,

,

Rebecca Gormann

,

Stuart Armstrong

CoRR, 2022

Human-AI Interaction Paradigm for Evaluating Explainable Artificial Intelligence.

[DOI]

Matija Franklin

,

David A. Lagnado

Proceedings of the HCI International 2022 Posters, 2022

A Method to Check that Participants Really are Imagining Artificial Minds When Ascribing Mental States.

[DOI]

,

Matija Franklin

Proceedings of the HCI International 2022 - Late Breaking Posters, 2022

Missing Mechanisms of Manipulation in the EU AI Act.

[DOI]

Matija Franklin

,

,

Rebecca Gormann

,

Stuart Armstrong

Proceedings of the Thirty-Fifth International Florida Artificial Intelligence Research Society Conference, 2022

Explanations that backfire: Explainable artificial intelligence can cause information overload.

[DOI]

Aidah Nakakande Ferguson

,

Matija Franklin

,

David A. Lagnado

Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

Causal Framework of Artificial Autonomous Agent Responsibility.

[DOI]

Matija Franklin

,

,

,

David A. Lagnado

Proceedings of the AIES '22: AAAI/ACM Conference on AI, Ethics, and Society, Oxford, United Kingdom, May 19, 2022

The Problem of Behaviour and Preference Manipulation in AI Systems.

[DOI]

,

Matija Franklin

Proceedings of the Workshop on Artificial Intelligence Safety 2022 (SafeAI 2022) co-located with the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI2022), 2022

2021

Designing Memory Aids for Dementia Patients using Earables.

[DOI]

Matija Franklin

,

David A. Lagnado

,

,

,

Proceedings of the UbiComp/ISWC '21: 2021 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2021 ACM International Symposium on Wearable Computers, 2021

Loading...