Stephan Wäldchen

Orcid: 0000-0001-7629-7021

According to our database1, Stephan Wäldchen authored at least 14 papers between 2019 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
A Family of LLMs Liberated from Static Vocabularies.
CoRR, March, 2026

Aleph-Alpha-GermanWeb: Improving German-language LLM pre-training with model-based data curation and synthetic data generation.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025
Measuring and Guiding Monosemanticity.
CoRR, June, 2025

2024
Interpretability Guarantees with Merlin-Arthur Classifiers.
Dataset, February, 2024

Interpretability Guarantees with Merlin-Arthur Classifiers.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023
Hardness of Deceptive Certificate Selection.
Proceedings of the Explainable Artificial Intelligence, 2023

2022
Towards explainable artificial intelligence: interpreting neural network classifiers with probabilistic prime implicants.
PhD thesis, 2022

Merlin-Arthur Classifiers: Formal Interpretability with Interactive Black Boxes.
CoRR, 2022

Training Characteristic Functions with Reinforcement Learning: XAI-methods play Connect Four.
Proceedings of the International Conference on Machine Learning, 2022

A Complete Characterisation of ReLU-Invariant Distributions.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
The Computational Complexity of Understanding Binary Classifier Decisions.
J. Artif. Intell. Res., 2021

2019
A Rate-Distortion Framework for Explaining Neural Network Decisions.
CoRR, 2019

The Computational Complexity of Understanding Network Decisions.
CoRR, 2019

Unmasking Clever Hans Predictors and Assessing What Machines Really Learn.
CoRR, 2019


  Loading...