We stand with Ukraine

We stand with Ukraine

Dana Arad

According to our database¹, Dana Arad authored at least 12 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Mechanisms of Prompt-Induced Hallucination in Vision-Language Models.

[DOI]

,

Michal Golovanevsky

,

,

Yonatan Belinkov

,

Ritambhara Singh

,

Carsten Eickhoff

,

CoRR, January, 2026

CRISP: Persistent Concept Unlearning via Sparse Autoencoders.

[DOI]

,

,

,

,

Yonatan Belinkov

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025

Findings of the BlackboxNLP 2025 Shared Task: Localizing Circuits and Causal Variables in Language Models.

[DOI]

,

Yonatan Belinkov

,

,

,

,

,

,

CoRR, November, 2025

BlackboxNLP-2025 MIB Shared Task: Improving Circuit Faithfulness via Better Edge Selection.

[DOI]

,

,

,

,

,

Gal Kesten-Pomeranz

,

Yonatan Belinkov

CoRR, October, 2025

HACK: Hallucinations Along Certainty and Knowledge Axes.

[DOI]

,

Jonathan Herzig

,

,

,

,

,

,

Gabriel Stanovsky

,

,

Yonatan Belinkov

CoRR, October, 2025

Same Task, Different Circuits: Disentangling Modality-Specific Mechanisms in VLMs.

[DOI]

,

,

Yossi Gandelsman

,

Yonatan Belinkov

CoRR, June, 2025

MIB: A Mechanistic Interpretability Benchmark.

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

SAEs Are Good for Steering - If You Select the Right Features.

[DOI]

,

,

Yonatan Belinkov

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024

ReFACT: Updating Text-to-Image Models by Editing the Text Encoder.

[DOI]

,

,

Yonatan Belinkov

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Predicting Fact Contributions from Query Logs with Machine Learning.

[DOI]

,

,

Proceedings of the Proceedings 27th International Conference on Extending Database Technology, 2024

Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines.

[DOI]

,

,

,

,

Yonatan Belinkov

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2022

LearnShapley: Learning to Predict Rankings of Facts Contribution Based on Query Logs.

[DOI]

,

,

Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Loading...