Michael Hahn

Orcid: 0000-0003-4828-4834

Affiliations:
  • Saarland University, Department of Language Science and Technology, Saarbrücken, Germany
  • Stanford University, Linguistics Department, CA, USA (former, PhD 2022)
  • University of Edinburgh, School of Informatics, UK (former)
  • University of Tübingen, Germany (former)


According to our database1, Michael Hahn authored at least 53 papers between 2011 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
How Few-Shot Examples Add Up: A Causal Decomposition of Function Vectors in In-Context Learning.
CoRR, May, 2026

Barriers to Universal Reasoning With Transformers (And How to Overcome Them).
CoRR, April, 2026

On the Ability of Transformers to Verify Plans.
CoRR, March, 2026

Understanding the Emergence of Seemingly Useless Features in Next-Token Predictors.
CoRR, March, 2026

Discovering Interpretable Algorithms by Decompiling Transformers to RASP.
CoRR, February, 2026

Systematicity between Forms and Meanings across Languages Supports Efficient Communication.
CoRR, January, 2026

Provably Learning Attention with Queries.
CoRR, January, 2026

System-Mediated Attention Imbalances Make Vision-Language Models Say Yes.
CoRR, January, 2026

When Does LoRA Reuse Work? Theoretical Limits and Mechanisms for Recycling LoRAs Without Data Access.
Trans. Mach. Learn. Res., 2026

Tug-of-war between idioms' figurative and literal interpretations in LLMs.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025
Softmax Transformers are Turing-Complete.
CoRR, November, 2025

Benefits and Limitations of Communication in Multi-Agent Reasoning.
CoRR, October, 2025

The Bayesian Origin of the Probability Weighting Function in Human Representation of Probabilities.
CoRR, October, 2025

Decomposing Representation Space into Interpretable Subspaces with Unsupervised Learning.
CoRR, August, 2025

Position: Pause Recycling LoRAs and Prioritize Mechanisms to Uncover Limits and Effectiveness.
CoRR, June, 2025

Tug-of-war between idiom's figurative and literal meanings in LLMs.
CoRR, June, 2025

Born a Transformer - Always a Transformer?
CoRR, May, 2025

Contextualize-then-Aggregate: Circuits for In-Context Learning in Gemma-2 2B.
CoRR, April, 2025

Emergent Stack Representations in Modeling Counter Languages Using Transformers.
CoRR, February, 2025

Born a Transformer - Always a Transformer? On the Effect of Pretraining on Architectural Abilities.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Lower Bounds for Chain-of-Thought Reasoning in Hard-Attention Transformers.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

A Formal Framework for Understanding Length Generalization in Transformers.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Language models can learn implicit multi-hop reasoning, but only if they have lots of training data.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
Linguistic Structure from a Bottleneck on Sequential Information Processing.
CoRR, 2024

The Expressive Capacity of State Space Models: A Formal Language Perspective.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

InversionView: A General-Purpose Method for Reading Information from Neural Activations.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Separations in the Representational Capabilities of Transformers and Recurrent Architectures.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Information Locality in the Processing of Classifier-Noun Dependencies in Mandarin Chinese.
Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024

More frequent verbs are associated with more diverse valency frames: Efficient principles at the lexicon-grammar interface.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Why are Sensitive Functions Hard for Transformers?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
A Cross-Linguistic Pressure for Uniform Information Density in Word Order.
Trans. Assoc. Comput. Linguistics, 2023

A Theory of Emergent In-Context Learning as Implicit Structure Induction.
CoRR, 2023

How Do Syntactic Statistics and Semantic Plausibility Modulate Local Coherence Effects.
Proceedings of the 45th Annual Meeting of the Cognitive Science Society, 2023

2022
Crosslinguistic word order variation reflects evolutionary pressures of dependency and information locality.
CoRR, 2022

Modeling Fixation Behavior in Reading with Character-level Neural Attention.
Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

Explaining patterns of fusion in morphological paradigms using the memory-surprisal tradeoff.
Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

2021
Sensitivity as a Complexity Measure for Sequence Classification Tasks.
Trans. Assoc. Comput. Linguistics, 2021

An Information-Theoretic Characterization of Morphological Fusion.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Theoretical Limitations of Self-Attention in Neural Sequence Models.
Trans. Assoc. Comput. Linguistics, 2020

RNNs can generate bounded hierarchical languages with optimal memory.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
Tabula nearly rasa: Probing the linguistic knowledge of character-level neural language models trained on unsegmented text.
Trans. Assoc. Comput. Linguistics, 2019

Estimating Predictive Rate-Distortion Curves via Neural Variational Inference.
Entropy, 2019

Character-based Surprisal as a Model of Human Reading in the Presence of Errors.
CoRR, 2019

Character-based Surprisal as a Model of Reading Difficulty in the Presence of Errors.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

2018
Modeling Task Effects in Human Reading with Neural Attention.
CoRR, 2018

Wreath Products of Distributive Forest Algebras.
Proceedings of the 33rd Annual ACM/IEEE Symposium on Logic in Computer Science, 2018

An Information-Theoretic Explanation of Adjective Ordering Preferences.
Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

2016
Modeling Human Reading with Neural Attention.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015
Henkin semantics for reasoning with natural language.
J. Lang. Model., 2015

Visibly Counter Languages and the Structure of NC<sup>1</sup>.
Proceedings of the Mathematical Foundations of Computer Science 2015, 2015

2013
CoMeT: Integrating different levels of linguistic modeling for meaning assessment.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013

2012
Evaluating the Meaning of Answers to Reading Comprehension Questions: A Semantics-Based Approach.
Proceedings of the Seventh Workshop on Building Educational Applications Using NLP, 2012

2011
On deriving semantic representations from dependencies: A Practical approach for evaluating meaning in learner corpora.
Proceedings of the Computational Dependency Theory [papers from the International Conference on Dependency Linguistics, 2011


  Loading...