Matija Franklin

Orcid: 0000-0003-1846-8907

According to our database1, Matija Franklin authored at least 37 papers between 2021 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Positive Alignment: Artificial Intelligence for Human Flourishing.
CoRR, May, 2026

Architecting Trust in Artificial Epistemic Agents.
CoRR, March, 2026

Intelligent AI Delegation.
CoRR, February, 2026

2025
Distributional AGI Safety.
CoRR, December, 2025

Full-Stack Alignment: Co-Aligning AI and Institutions with Thick Models of Value.
CoRR, December, 2025

Virtual Agent Economies.
CoRR, September, 2025

Resource Rational Contractualism Should Guide AI Alignment.
CoRR, June, 2025

Multi-Agent Risks from Advanced AI.
CoRR, February, 2025

Defense Against the Dark Prompts: Mitigating Best-of-N Jailbreaking with Prompt Evaluation.
CoRR, February, 2025

Model-Free RL Agents Demonstrate System 1-Like Intentionality.
CoRR, January, 2025

AI Governance through Markets.
CoRR, January, 2025

LMUNIT: Fine-grained Evaluation with Natural Language Unit Tests.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024
Beyond Preferences in AI Alignment.
CoRR, 2024

The Ethics of Advanced AI Assistants.
CoRR, 2024

A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI.
CoRR, 2024

Are autonomous vehicles blamed differently?
Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024

2023
A Proposal for a Definition of General Purpose Artificial Intelligence Systems.
Digit. Soc., December, 2023

An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI.
CoRR, 2023

Science Communications for Explainable Artificial Intelligence.
CoRR, 2023

Strengthening the EU AI Act: Defining Key Terms on AI Manipulation.
CoRR, 2023

Concept Extrapolation: A Conceptual Primer.
CoRR, 2023

General Purpose Artificial Intelligence Systems as Group Agents.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Who Is to Blame? Responsibility Attribution in AI Systems vs Human Agents in the Field of Air Crashes.
Proceedings of the Flexible Query Answering Systems - 15th International Conference, 2023

An Unsupervised Approach to Extracting Knowledge from the Relationships Between Blame Attribution on Twitter.
Proceedings of the Flexible Query Answering Systems - 15th International Conference, 2023

Enhancing Wearable Technologies for Dementia Care: A Cognitive Architecture Approach.
Proceedings of the Explainable and Transparent AI and Multi-Agent Systems, 2023

Blame attribution in human-AI and human-only systems: Crowdsourcing judgments from Twitter.
Proceedings of the 45th Annual Meeting of the Cognitive Science Society, 2023

2022
The Influence of Explainable Artificial Intelligence: Nudging Behaviour or Boosting Capability?
CoRR, 2022

Solutions to preference manipulation in recommender systems require knowledge of meta-preferences.
CoRR, 2022

Preference Change in Persuasive Robotics.
CoRR, 2022

Recognising the importance of preference change: A call for a coordinated multidisciplinary research effort in the age of AI.
CoRR, 2022

Human-AI Interaction Paradigm for Evaluating Explainable Artificial Intelligence.
Proceedings of the HCI International 2022 Posters, 2022

A Method to Check that Participants Really are Imagining Artificial Minds When Ascribing Mental States.
Proceedings of the HCI International 2022 - Late Breaking Posters, 2022

Missing Mechanisms of Manipulation in the EU AI Act.
Proceedings of the Thirty-Fifth International Florida Artificial Intelligence Research Society Conference, 2022

Explanations that backfire: Explainable artificial intelligence can cause information overload.
Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

Causal Framework of Artificial Autonomous Agent Responsibility.
Proceedings of the AIES '22: AAAI/ACM Conference on AI, Ethics, and Society, Oxford, United Kingdom, May 19, 2022

The Problem of Behaviour and Preference Manipulation in AI Systems.
Proceedings of the Workshop on Artificial Intelligence Safety 2022 (SafeAI 2022) co-located with the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI2022), 2022

2021
Designing Memory Aids for Dementia Patients using Earables.
Proceedings of the UbiComp/ISWC '21: 2021 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2021 ACM International Symposium on Wearable Computers, 2021


  Loading...