We stand with Ukraine

We stand with Ukraine

David Chanin

Orcid: 0009-0004-6619-5469

According to our database¹, David Chanin authored at least 14 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2026

Are Sparse Autoencoder Benchmarks Reliable?

[DOI]

CoRR, May, 2026

SynthSAEBench: Evaluating Sparse Autoencoders on Scalable Realistic Synthetic Data.

[DOI]

,

Adrià Garriga-Alonso

CoRR, February, 2026

Biases in the Blind Spot: Detecting What LLMs Fail to Mention.

[DOI]

Iván Arcuschin

,

,

Adrià Garriga-Alonso

,

Oana-Maria Camburu

CoRR, February, 2026

2025

Sparse but Wrong: Incorrect L0 Leads to Incorrect Features in Sparse Autoencoders.

[DOI]

,

Adrià Garriga-Alonso

CoRR, August, 2025

Feature Hedging: Correlated Features Break Narrow Sparse Autoencoders.

[DOI]

,

,

Adrià Garriga-Alonso

CoRR, May, 2025

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability.

[DOI]

,

,

,

,

Joseph Isaac Bloom

,

,

,

,

Callum McDougall

,

,

Matthew Wearden

,

,

,

CoRR, March, 2025

A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders.

[DOI]

,

James Wilken-Smith

,

,

Hardik Bhatnagar

,

Satvik Golechha

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability.

[DOI]

,

,

,

,

Joseph Isaac Bloom

,

,

,

,

Callum McDougall

,

,

,

Matthew Wearden

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders.

[DOI]

,

James Wilken-Smith

,

,

Hardik Bhatnagar

,

CoRR, 2024

Analyzing the Generalization and Reliability of Steering Vectors.

[DOI]

,

,

,

Dimitrios Kanoulas

,

,

Adrià Garriga-Alonso

,

CoRR, 2024

Analysing the Generalisation and Reliability of Steering Vectors.

[DOI]

,

,

,

,

Dimitrios Kanoulas

,

Adrià Garriga-Alonso

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Identifying Linear Relational Concepts in Large Language Models.

[DOI]

,

,

Oana-Maria Camburu

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

2023

Open-source Frame Semantic Parsing.

[DOI]

CoRR, 2023

Neuro-symbolic Commonsense Social Reasoning.

[DOI]

,

CoRR, 2023

Loading...