Thilo Hagendorff
Orcid: 0000-0002-4633-2153
According to our database1,
Thilo Hagendorff authored at least 41 papers
between 2017 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
CoRR, May, 2026
"Dark Triad" Model Organisms of Misalignment: Narrow Fine-Tuning Mirrors Human Antisocial Behavior.
CoRR, March, 2026
Emergently Misaligned Language Models Show Behavioral Self-Awareness That Shifts With Subsequent Realignment.
CoRR, February, 2026
Compromising Honesty and Harmlessness in Language Models via Covert Deception Attacks.
Trans. Mach. Learn. Res., 2026
2025
Speciesism in AI: Evaluating Discrimination Against Animals in Large Language Models.
CoRR, August, 2025
CoRR, July, 2025
PRIDE - Parameter-Efficient Reduction of Identity Discrimination for Equality in LLMs.
CoRR, July, 2025
Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models.
CoRR, April, 2025
CoRR, February, 2025
Trans. Mach. Learn. Res., 2025
2024
Minds Mach., December, 2024
J. Exp. Theor. Artif. Intell., November, 2024
CoRR, 2024
A Looming Replication Crisis in Evaluating Behavior in Language Models? Evidence and Solutions.
CoRR, 2024
2023
Digit. Soc., December, 2023
AI Ethics, May, 2023
Ethical considerations and statistical analysis of industry involvement in machine learning research.
AI Soc., February, 2023
AI Ethics, February, 2023
Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in ChatGPT.
Nat. Comput. Sci., 2023
CoRR, 2023
Machine Psychology: Investigating Emergent Capabilities and Behavior in Large Language Models Using Psychological Methods.
CoRR, 2023
CoRR, 2023
Speciesist bias in AI: how AI applications perpetuate discrimination and unfair outcomes against animals.
AI Ethics, 2023
2022
CoRR, 2022
Why we need biased AI - How including cognitive and ethical machine biases can enhance AI systems.
CoRR, 2022
2021
Linking Human And Machine Behavior: A New Approach to Evaluate Training Data Quality for Beneficial Machine Learning.
Minds Mach., 2021
Forbidden knowledge in machine learning reflections on the limits of research and publication.
AI Soc., 2021
2020
Minds Mach., 2020
Ethical behavior in humans and machines - Evaluating training data quality for beneficial machine learning.
CoRR, 2020
The Big Picture: Ethical Considerations and Statistical Analysis of Industry Involvement in Machine Learning Research.
CoRR, 2020
2019
Ethics Inf. Technol., 2019
2017