We stand with Ukraine

We stand with Ukraine

Valentin Hofmann

Orcid: 0000-0001-6603-3428

According to our database¹, Valentin Hofmann authored at least 40 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Greater accessibility can amplify discrimination in generative AI.

[DOI]

Carolin Holtermann

,

,

,

Valentin Hofmann

,

Katharina von der Wense

,

CoRR, March, 2026

Can Large Language Models Generalize Procedures Across Representations?

[DOI]

,

Valentin Hofmann

,

,

,

,

Anthony G. Cohn

,

Janet B. Pierrehumbert

CoRR, February, 2026

Demographic Probing of Large Language Models Lacks Construct Validity.

[DOI]

,

Neil K. R. Seghal

,

Niyati Malhotra

,

Víctor Orozco-Olvera

,

Ana María Muñoz Boudet

,

Lakshmi Subramanian

,

Sharath Chandra Guntuku

,

Valentin Hofmann

CoRR, January, 2026

IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance.

[DOI]

,

,

Valentin Hofmann

,

Kobi Hackenburg

,

Valentina Pyatkin

,

,

Trans. Assoc. Comput. Linguistics, 2026

2025

Bolmo: Byteifying the Next Generation of Language Models.

[DOI]

Benjamin Minixhofer

,

,

Tomasz Limisiewicz

,

,

Luke Zettlemoyer

,

,

Edoardo M. Ponti

,

,

Valentin Hofmann

CoRR, December, 2025

Measuring what Matters: Construct Validity in Large Language Model Benchmarks.

[DOI]

CoRR, November, 2025

Fluid Language Model Benchmarking.

[DOI]

Valentin Hofmann

,

,

,

,

,

,

,

,

Hannaneh Hajishirzi

,

CoRR, September, 2025

Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation.

[DOI]

,

Valentin Hofmann

,

,

,

,

Hannaneh Hajishirzi

,

,

CoRR, August, 2025

BLAB: Brutally Long Audio Bench.

[DOI]

Orevaoghene Ahia

,

Martijn Bartelds

,

,

,

Valentin Hofmann

,

,

Shuyue Stella Li

,

Vishal Puttagunta

,

Mofetoluwa Adeyemi

,

Charishma Buchireddy

,

,

,

Shinji Watanabe

,

,

,

CoRR, May, 2025

SuperBPE: Space Travel for Language Models.

[DOI]

,

Jonathan Hayase

,

Valentin Hofmann

,

,

,

CoRR, March, 2025

Large Language Models Discriminate Against Speakers of German Dialects.

[DOI]

,

Carolin Holtermann

,

Valentin Hofmann

,

,

Katharina von der Wense

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Aligned but Blind: Alignment Increases Implicit Bias by Reducing Awareness of Race.

[DOI]

,

,

Valentin Hofmann

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Assessing Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks.

[DOI]

,

,

Emanuele La Malfa

,

Valentin Hofmann

,

Adrian de Wynter

,

,

,

Michael J. Wooldridge

,

Janet B. Pierrehumbert

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

AI generates covertly racist decisions about people based on their dialect.

[DOI]

Valentin Hofmann

,

Pratyusha Ria Kalluri

,

,

Nat., September, 2024

Geographic Adaptation of Pretrained Language Models.

[DOI]

Valentin Hofmann

,

,

Nikola Ljubesic

,

Janet B. Pierrehumbert

,

Hinrich Schütze

Trans. Assoc. Comput. Linguistics, 2024

Derivational Morphology Reveals Analogical Generalization in Large Language Models.

[DOI]

Valentin Hofmann

,

Leonie Weissweiler

,

David R. Mortensen

,

Hinrich Schütze

,

Janet B. Pierrehumbert

CoRR, 2024

One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks.

[DOI]

,

,

Emanuele La Malfa

,

Valentin Hofmann

,

Adrian de Wynter

,

,

,

Michael J. Wooldridge

,

CoRR, 2024

Dialect prejudice predicts AI decisions about people's character, employability, and criminality.

[DOI]

Valentin Hofmann

,

Pratyusha Ria Kalluri

,

,

CoRR, 2024

Paloma: A Benchmark for Evaluating Language Model Fit.

[DOI]

,

,

Valentin Hofmann

,

,

Ananya Harsh Jha

,

,

,

Evan Pete Walsh

,

,

,

Dirk Groeneveld

,

,

Hanna Hajishirzi

,

,

Kyle Richardson

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization.

[DOI]

Orevaoghene Ahia

,

,

,

Valentin Hofmann

,

Tomasz Limisiewicz

,

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Graph-enhanced Large Language Models in Asynchronous Plan Reasoning.

[DOI]

,

Emanuele La Malfa

,

Valentin Hofmann

,

Elle Michelle Yang

,

Anthony G. Cohn

,

Janet B. Pierrehumbert

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models.

[DOI]

,

Valentin Hofmann

,

Valentina Pyatkin

,

,

,

Hinrich Schütze

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Explaining pretrained language models' understanding of linguistic structures using construction grammar.

[DOI]

Leonie Weissweiler

,

Valentin Hofmann

,

Abdullatif Köksal

,

Hinrich Schütze

Frontiers Artif. Intell., February, 2023

Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model.

[DOI]

Leonie Weissweiler

,

Valentin Hofmann

,

Anjali Kantharuban

,

,

,

,

,

Atharva Kulkarni

,

Abhishek Vijayakumar

,

,

Hinrich Schütze

,

,

David R. Mortensen

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022

The Reddit Politosphere: A Large-Scale Text and Network Resource of Online Political Discourse.

[DOI]

Valentin Hofmann

,

Hinrich Schütze

,

Janet B. Pierrehumbert

Dataset, January, 2022

Modeling Ideological Salience and Framing in Polarized Online Groups with Graph Neural Networks and Structured Sparsity.

[DOI]

Valentin Hofmann

,

,

Janet B. Pierrehumbert

,

Hinrich Schütze

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

The Reddit Politosphere: A Large-Scale Text and Network Resource of Online Political Discourse.

[DOI]

Valentin Hofmann

,

Hinrich Schütze

,

Janet B. Pierrehumbert

Proceedings of the Sixteenth International AAAI Conference on Web and Social Media, 2022

Unsupervised Detection of Contextualized Embedding Bias with Application to Ideology.

[DOI]

Valentin Hofmann

,

Janet B. Pierrehumbert

,

Hinrich Schütze

Proceedings of the International Conference on Machine Learning, 2022

The better your Syntax, the better your Semantics? Probing Pretrained Language Models for the English Comparative Correlative.

[DOI]

Leonie Weissweiler

,

Valentin Hofmann

,

Abdullatif Köksal

,

Hinrich Schütze

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

CaMEL: Case Marker Extraction without Labels.

[DOI]

Leonie Weissweiler

,

Valentin Hofmann

,

Masoud Jalili Sabet

,

Hinrich Schütze

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

An Embarrassingly Simple Method to Mitigate Undesirable Properties of Pretrained Language Model Tokenizers.

[DOI]

Valentin Hofmann

,

Hinrich Schütze

,

Janet B. Pierrehumbert

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

2021

Modeling Ideological Agenda Setting and Framing in Polarized Online Groups with Graph Neural Networks and Structured Sparsity.

[DOI]

Valentin Hofmann

,

Janet B. Pierrehumbert

,

Hinrich Schütze

CoRR, 2021

Superbizarre Is Not Superb: Improving BERT's Interpretations of Complex Words with Derivational Morphology.

[DOI]

Valentin Hofmann

,

Janet B. Pierrehumbert

,

Hinrich Schütze

CoRR, 2021

Dynamic Contextualized Word Embeddings.

[DOI]

Valentin Hofmann

,

Janet B. Pierrehumbert

,

Hinrich Schütze

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words.

[DOI]

Valentin Hofmann

,

Janet B. Pierrehumbert

,

Hinrich Schütze

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Generating Derivational Morphology with BERT.

[DOI]

Valentin Hofmann

,

Janet B. Pierrehumbert

,

Hinrich Schütze

CoRR, 2020

DagoBERT: Generating Derivational Morphology with a Pretrained Language Model.

[DOI]

Valentin Hofmann

,

Janet B. Pierrehumbert

,

Hinrich Schütze

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

A Graph Auto-encoder Model of Derivational Morphology.

[DOI]

Valentin Hofmann

,

Hinrich Schütze

,

Janet B. Pierrehumbert

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Predicting the Growth of Morphological Families from Social and Linguistic Factors.

[DOI]

Valentin Hofmann

,

Janet B. Pierrehumbert

,

Hinrich Schütze

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Loading...