Philipp Mondorf

According to our database¹, Philipp Mondorf authored at least 18 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Reasoning that Travels: Dissecting How Chain-of-Thought Transfers Across Models.

[BibT_eX]

[DOI]

CoRR, May, 2026

LPDS: Evaluating LLM Robustness Through Logic-Preserving Difficulty Scaling.

[BibT_eX]

[DOI]

CoRR, May, 2026

Tracing Uncertainty in Language Model "Reasoning".

[BibT_eX]

[DOI]

CoRR, May, 2026

LogicSkills: A Structured Benchmark for Formal Reasoning in Large Language Models.

[BibT_eX]

[DOI]

Brian Rabern

Philipp Mondorf

Barbara Plank

CoRR, February, 2026

If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models.

[BibT_eX]

[DOI]

Jasmin Orth

Philipp Mondorf

Barbara Plank

Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

Language Models Learn Universal Representations of Numbers and Here's Why You Should Care.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025

Unravelling the Mechanisms of Manipulating Numbers in Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

BlackboxNLP-2025 MIB Shared Task: Exploring Ensemble Strategies for Circuit Localization Methods.

[BibT_eX]

[DOI]

CoRR, October, 2025

Grokking ExPLAIND: Unifying Model, Data, and Training Attribution to Study Model Behavior.

[BibT_eX]

[DOI]

CoRR, May, 2025

Enabling Systematic Generalization in Abstract Spatial Reasoning through Meta-Learning for Compositionality.

[BibT_eX]

[DOI]

CoRR, April, 2025

Reason to Rote: Rethinking Memorization in Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

The Validation Gap: A Mechanistic Analysis of How Language Models Compute Arithmetic but Fail to Validate It.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Circuit Compositions: Exploring Modular Structures in Transformer-Based Language Models.

[BibT_eX]

[DOI]

Philipp Mondorf

Sondre Wold

Barbara Plank

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

2024

Understanding When Tree of Thoughts Succeeds: Larger Models Excel in Generation, Not Discrimination.

[BibT_eX]

[DOI]

CoRR, 2024

Beyond Accuracy: Evaluating the Reasoning Behavior of Large Language Models - A Survey.

[BibT_eX]

[DOI]

Philipp Mondorf

Barbara Plank

CoRR, 2024

Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models.

[BibT_eX]

[DOI]

Philipp Mondorf

Barbara Plank

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning.

[BibT_eX]

[DOI]

Philipp Mondorf

Barbara Plank

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Philipp Mondorf

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...