Sarah Schwettmann

According to our database1, Sarah Schwettmann authored at least 20 papers between 2018 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
ADAG: Automatically Describing Attribution Graphs.
CoRR, April, 2026

Language Model Circuits Are Sparse in the Neuron Basis.
CoRR, January, 2026

2025
Predictive Concept Decoders: Training Scalable End-to-End Interpretability Assistants.
CoRR, December, 2025

Establishing Best Practices for Building Rigorous Agentic Benchmarks.
CoRR, July, 2025

The Singapore Consensus on Global AI Safety Research Priorities.
CoRR, June, 2025

Line of Sight: On Linear Representations in VLLMs.
CoRR, June, 2025

Establishing Best Practices in Building Rigorous Agentic Benchmarks.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Eliciting Language Model Behaviors with Investigator Agents.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
Automatic Discovery of Visual Circuits.
CoRR, 2024

A Multimodal Automated Interpretability Agent.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Nearest Neighbor Normalization Improves Multimodal Retrieval.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
An Alternative to Regulation: The Case for Public AI.
CoRR, 2023

A Function Interpretation Benchmark for Evaluating Interpretability Methods.
CoRR, 2023

Multimodal Neurons in Pretrained Text-Only Transformers.
CoRR, 2023

FIND: A Function Description Benchmark for Evaluating Interpretability Methods.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Multimodal Neurons in Pretrained Text-Only Transformers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Natural Language Descriptions of Deep Visual Features.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Toward a Visual Concept Vocabulary for GAN Latent Space.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Latent Compass: Creation by Navigation.
CoRR, 2020

2018
Evidence for an Intuitive Physics Engine in the Human Brain.
Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018


  Loading...