Sarah Schwettmann

According to our database¹, Sarah Schwettmann authored at least 20 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

ADAG: Automatically Describing Attribution Graphs.

[BibT_eX]

[DOI]

CoRR, April, 2026

Language Model Circuits Are Sparse in the Neuron Basis.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

Predictive Concept Decoders: Training Scalable End-to-End Interpretability Assistants.

[BibT_eX]

[DOI]

CoRR, December, 2025

Establishing Best Practices for Building Rigorous Agentic Benchmarks.

[BibT_eX]

[DOI]

CoRR, July, 2025

The Singapore Consensus on Global AI Safety Research Priorities.

[BibT_eX]

[DOI]

Vidhisha Balachandran

Bryan Low Kian Hsiang

CoRR, June, 2025

Line of Sight: On Linear Representations in VLLMs.

[BibT_eX]

[DOI]

CoRR, June, 2025

Establishing Best Practices in Building Rigorous Agentic Benchmarks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Eliciting Language Model Behaviors with Investigator Agents.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

Automatic Discovery of Visual Circuits.

[BibT_eX]

[DOI]

CoRR, 2024

A Multimodal Automated Interpretability Agent.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Nearest Neighbor Normalization Improves Multimodal Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

An Alternative to Regulation: The Case for Public AI.

[BibT_eX]

[DOI]

CoRR, 2023

A Function Interpretation Benchmark for Evaluating Interpretability Methods.

[BibT_eX]

[DOI]

CoRR, 2023

Multimodal Neurons in Pretrained Text-Only Transformers.

[BibT_eX]

[DOI]

Sarah Schwettmann

Neil Chowdhury

Antonio Torralba

CoRR, 2023

FIND: A Function Description Benchmark for Evaluating Interpretability Methods.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Multimodal Neurons in Pretrained Text-Only Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

Natural Language Descriptions of Deep Visual Features.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Toward a Visual Concept Vocabulary for GAN Latent Space.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Latent Compass: Creation by Navigation.

[BibT_eX]

[DOI]

Sarah Schwettmann

Hendrik Strobelt

Mauro Martino

CoRR, 2020

2018

Evidence for an Intuitive Physics Engine in the Human Brain.

[BibT_eX]

[DOI]

Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

Sarah Schwettmann

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...