Dana Arad

According to our database1, Dana Arad authored at least 12 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Mechanisms of Prompt-Induced Hallucination in Vision-Language Models.
CoRR, January, 2026

2025
Findings of the BlackboxNLP 2025 Shared Task: Localizing Circuits and Causal Variables in Language Models.
CoRR, November, 2025

BlackboxNLP-2025 MIB Shared Task: Improving Circuit Faithfulness via Better Edge Selection.
CoRR, October, 2025

HACK: Hallucinations Along Certainty and Knowledge Axes.
CoRR, October, 2025

CRISP: Persistent Concept Unlearning via Sparse Autoencoders.
CoRR, August, 2025

Same Task, Different Circuits: Disentangling Modality-Specific Mechanisms in VLMs.
CoRR, June, 2025


SAEs Are Good for Steering - If You Select the Right Features.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
ReFACT: Updating Text-to-Image Models by Editing the Text Encoder.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Predicting Fact Contributions from Query Logs with Machine Learning.
Proceedings of the Proceedings 27th International Conference on Extending Database Technology, 2024

Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2022
LearnShapley: Learning to Predict Rankings of Facts Contribution Based on Query Logs.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022


  Loading...