David Chanin

Orcid: 0009-0004-6619-5469

According to our database1, David Chanin authored at least 12 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
SynthSAEBench: Evaluating Sparse Autoencoders on Scalable Realistic Synthetic Data.
CoRR, February, 2026

Biases in the Blind Spot: Detecting What LLMs Fail to Mention.
CoRR, February, 2026

2025
Sparse but Wrong: Incorrect L0 Leads to Incorrect Features in Sparse Autoencoders.
CoRR, August, 2025

Feature Hedging: Correlated Features Break Narrow Sparse Autoencoders.
CoRR, May, 2025

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability.
CoRR, March, 2025

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders.
CoRR, 2024

Analyzing the Generalization and Reliability of Steering Vectors.
CoRR, 2024

Analysing the Generalisation and Reliability of Steering Vectors.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Identifying Linear Relational Concepts in Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

2023
Open-source Frame Semantic Parsing.
CoRR, 2023

Neuro-symbolic Commonsense Social Reasoning.
CoRR, 2023


  Loading...