Stefan Heimersheim
According to our database1,
Stefan Heimersheim authored at least 18 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
CoRR, February, 2026
Concept Influence: Leveraging Interpretability to Improve Performance and Efficiency in Training Data Attribution.
CoRR, February, 2026
2025
CoRR, July, 2025
Transformers Don't Need LayerNorm at Inference Time: Scaling LayerNorm Removal to GPT-2 XL and the Implications for Mechanistic Interpretability.
CoRR, July, 2025
Interpretability in Parameter Space: Minimizing Mechanistic Description Length with Attribution-based Parameter Decomposition.
CoRR, January, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
2024
Investigating Sensitive Directions in GPT-2: An Improved Baseline and Comparative Analysis of SAEs.
CoRR, 2024
The Local Interaction Basis: Identifying Computationally-Relevant and Sparsely Interacting Features in Neural Networks.
CoRR, 2024
2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023