Joseph Isaac Bloom

According to our database¹, Joseph Isaac Bloom authored at least 8 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Building Better Deception Probes Using Targeted Instruction Pairs.

[BibT_eX]

[DOI]

CoRR, February, 2026

2025

Auditing Games for Sandbagging.

[BibT_eX]

[DOI]

CoRR, December, 2025

ContextBench: Modifying Contexts for Targeted Latent Activation.

[BibT_eX]

[DOI]

CoRR, June, 2025

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability.

[BibT_eX]

[DOI]

CoRR, March, 2025

Open Problems in Mechanistic Interpretability.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Sparse Autoencoders Do Not Find Canonical Units of Analysis.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Interpreting Attention Layer Outputs with Sparse Autoencoders.

[BibT_eX]

[DOI]

CoRR, 2024

Joseph Isaac Bloom

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...