Simon Ostermann

Orcid: 0000-0002-0899-0657

Affiliations:
  • DFKI, Saarbrücken, Germany
  • Saarland University, Saarbrücken, Germany


According to our database1, Simon Ostermann authored at least 66 papers between 2013 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
The Latin Substrate: How Language Models Represent and Mediate Script Choice.
CoRR, May, 2026

DFKI-MLT at SemEval-2026 TASK 7: Steering Multilingual Models Towards Cultural Knowledge.
CoRR, May, 2026

Multilingual Steering by Design: Multilingual Sparse Autoencoders and Principled Layer Selection.
CoRR, May, 2026

Judge Circuits.
CoRR, May, 2026

Enhancing Multilingual Counterfactual Generation through Alignment-as-Preference Optimization.
CoRR, May, 2026

DualFact+: A Multimodal Fact Verification Framework for Procedural Video Understanding.
CoRR, April, 2026

Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models.
CoRR, April, 2026

Disentangling Mathematical Reasoning in LLMs: A Methodological Investigation of Internal Mechanisms.
CoRR, April, 2026

From Weights to Activations: Is Steering the Next Frontier of Adaptation?
CoRR, April, 2026

ReasonXL: Shifting LLM Reasoning Language Without Sacrificing Performance.
CoRR, April, 2026

CLaS-Bench: A Cross-Lingual Alignment and Steering Benchmark.
CoRR, January, 2026

Can Large Language Models Still Explain Themselves? Investigating the Impact of Quantization on Self-Explanations.
CoRR, January, 2026

Assessing Web Search Credibility and Response Groundedness in Chat Assistants.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025
Sparse Subnetwork Enhancement for Underrepresented Languages in Large Language Models.
CoRR, October, 2025

Cross-Prompt Encoder for Low-Performing Languages.
CoRR, August, 2025

The AI Language Proficiency Monitor - Tracking the Progress of LLMs on Multilingual Benchmarks.
CoRR, July, 2025

OpenFActScore: Open-Source Atomic Evaluation of Factuality in Text Generation.
CoRR, July, 2025

On Multilingual Encoder Language Model Compression for Low-Resource Languages.
CoRR, May, 2025

Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability.
CoRR, May, 2025

AutoPsyC: Automatic Recognition of Psychodynamic Conflicts from Semi-structured Interviews with Large Language Models.
CoRR, March, 2025

SemEval-2025 Task 7: Multilingual and Crosslingual Fact-Checked Claim Retrieval.
Dataset, March, 2025

The Lookahead Limitation: Why Multi-Operand Addition is Hard for LLMs.
CoRR, February, 2025

Reverse Probing: Evaluating Knowledge Transfer via Finetuned Task Embeddings for Coreference Resolution.
CoRR, January, 2025

Saarland-Groningen at NADI 2025 Shared Task: Effective Dialectal Arabic Speech Processing under Data Constraints.
Proceedings of The Third Arabic Natural Language Processing Conference: ArabicNLP 2025, 2025

SemEval-2025 Task 7: Multilingual and Crosslingual Fact-Checked Claim Retrieval.
Proceedings of the 19th International Workshop on Semantic Evaluation, 2025

Task Prompt Vectors: Effective Initialization Through Multi-task Soft Prompt Transfer.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track and Applied Data Science Track, 2025

Soft Language Prompts for Language Transfer.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

GrEmLIn: A Repository of Green Baseline Embeddings for 87 Low-Resource Languages Injected with Multilingual Graph Knowledge.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Building Common Ground in Dialogue: A Survey.
Proceedings of the 2nd Language Understanding in the Human-Machine Era Workshop, 2025

Cross-Prompt Encoder for Low-Performing Languages.
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

Multilingual Political Views of Large Language Models: Identification and Steering.
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation.
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

Modular Arithmetic: Language Models Solve Math Digit by Digit.
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

Multilingual Datasets for Custom Input Extraction and Explanation Requests Parsing in Conversational XAI Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Large Language Models for Multilingual Previously Fact-Checked Claim Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

A Rigorous Evaluation of LLM Data Generation Strategies for Low-Resource Languages.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Cross-Refine: Improving Natural Language Explanation Generation by Learning in Tandem.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

dfkinit2b at CheckThat! 2025: Leveraging LLMs and Ensemble of Methods for Multilingual Claim Normalization.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum, 2025

FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Only for the Unseen Languages, Say the Llamas: On the Efficacy of Language Adapters for Cross-lingual Transfer in English-centric LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), 2025

Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), 2025

2024
LowREm: A Repository of Word Embeddings for 87 Low-Resource Languages Enhanced with Multilingual Graph Knowledge.
CoRR, 2024

Probing Context Localization of Polysemous Words in Pre-trained Language Model Sub-Layers.
CoRR, 2024

Soft Begging: Modular and Efficient Shielding of LLMs against Prompt Injection and Jailbreaking based on Prompt Tuning.
CoRR, 2024

Generative Large Language Models in Automated Fact-Checking: A Survey.
CoRR, 2024

Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via Adapters.
CoRR, 2024

HybridBERT - Making BERT Pretraining More Efficient Through Hybrid Mixture of Attention Mechanisms.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, 2024

UoM-DFKI submission to the low resource shared task.
Proceedings of the 21st International Conference on Spoken Language Translation, 2024

A Comparison of Different Tokenization Methods for the Georgian Language.
Proceedings of the 7th International Conference on Natural Language and Speech Processing, 2024

CoXQL: A Dataset for Parsing Explanation Requests in Conversational XAI Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

MMAR: Multilingual and Multimodal Anaphora Resolution in Instructional Videos.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

KI-basierte Analyse von E-Portfolios.
Proceedings of the DELFI 2024, 2024


DFKI-MLST at DialAM-2024 Shared Task: System Description.
Proceedings of the 11th Workshop on Argument Mining, ArgMining 2024, Bangkok, Thailand, 2024

2023
Where exactly does contextualization in a PLM happen?
CoRR, 2023

Find-2-Find: Multitask Learning for Anaphora Resolution and Object Localization.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Investigating the Encoding of Words in BERT's Neurons Using Feature Textualization.
Proceedings of the 6th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, 2023

2019
MCScript2.0: A Machine Comprehension Corpus Focused on Script Events and Participants.
Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics, 2019

2018
SemEval-2018 Task 11: Machine Comprehension Using Commonsense Knowledge.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

Mapping Texts to Scripts: An Entailment Study.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

MCScript: A Novel Dataset for Assessing Machine Comprehension Using Script Knowledge.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

2017
Aligning Script Events with Narrative Texts.
Proceedings of the 6th Joint Conference on Lexical and Computational Semantics, 2017

2016
InScript: Narrative texts annotated with script information.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015
Annotating Entailment Relations for Shortanswer Questions.
Proceedings of the 2nd Workshop on Natural Language Processing Techniques for Educational Applications, 2015

2014
CSGS: Adapting a Short Answer Scoring System for Multiple-choice Reading Comprehension Exercises.
Proceedings of the Working Notes for CLEF 2014 Conference, 2014

2013
Ingredients and Recipe for a Robust Mobile Speech-Enabled Cooking Assistant for German.
Proceedings of the KI 2013: Advances in Artificial Intelligence, 2013


  Loading...