Valentin Hofmann

Orcid: 0000-0001-6603-3428

According to our database1, Valentin Hofmann authored at least 23 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Dialect prejudice predicts AI decisions about people's character, employability, and criminality.
CoRR, 2024

Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models.
CoRR, 2024

Graph-enhanced Large Language Models in Asynchronous Plan Reasoning.
CoRR, 2024

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.
CoRR, 2024

2023
Explaining pretrained language models' understanding of linguistic structures using construction grammar.
Frontiers Artif. Intell., February, 2023

Paloma: A Benchmark for Evaluating Language Model Fit.
CoRR, 2023

Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
The Reddit Politosphere: A Large-Scale Text and Network Resource of Online Political Discourse.
Dataset, January, 2022

Geographic Adaptation of Pretrained Language Models.
CoRR, 2022

Modeling Ideological Salience and Framing in Polarized Online Groups with Graph Neural Networks and Structured Sparsity.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

The Reddit Politosphere: A Large-Scale Text and Network Resource of Online Political Discourse.
Proceedings of the Sixteenth International AAAI Conference on Web and Social Media, 2022

Unsupervised Detection of Contextualized Embedding Bias with Application to Ideology.
Proceedings of the International Conference on Machine Learning, 2022

The better your Syntax, the better your Semantics? Probing Pretrained Language Models for the English Comparative Correlative.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

CaMEL: Case Marker Extraction without Labels.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

An Embarrassingly Simple Method to Mitigate Undesirable Properties of Pretrained Language Model Tokenizers.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

2021
Modeling Ideological Agenda Setting and Framing in Polarized Online Groups with Graph Neural Networks and Structured Sparsity.
CoRR, 2021

Superbizarre Is Not Superb: Improving BERT's Interpretations of Complex Words with Derivational Morphology.
CoRR, 2021

Dynamic Contextualized Word Embeddings.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Generating Derivational Morphology with BERT.
CoRR, 2020

DagoBERT: Generating Derivational Morphology with a Pretrained Language Model.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

A Graph Auto-encoder Model of Derivational Morphology.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Predicting the Growth of Morphological Families from Social and Linguistic Factors.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020


  Loading...