We stand with Ukraine

We stand with Ukraine

Shauli Ravfogel

Orcid: 0000-0001-8442-9311

According to our database¹, Shauli Ravfogel authored at least 58 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org
on scholar.google.com

On csauthors.net:

Bibliography

2026

Can LLMs Introspect? A Reality Check.

[DOI]

,

,

Shauli Ravfogel

CoRR, May, 2026

Geometric Factual Recall in Transformers.

[DOI]

Shauli Ravfogel

,

,

,

CoRR, May, 2026

The Truthfulness Spectrum Hypothesis.

[DOI]

Zhuofan Josh Ying

,

Shauli Ravfogel

,

Nikolaus Kriegeskorte

,

CoRR, February, 2026

Discrete Diffusion Models Exploit Asymmetry to Solve Lookahead Planning Tasks.

[DOI]

,

Shauli Ravfogel

,

,

CoRR, February, 2026

From Directions to Regions: Decomposing Activations in Language Models via Local Geometry.

[DOI]

,

,

,

Shauli Ravfogel

,

,

CoRR, February, 2026

2025

State over Tokens: Characterizing the Role of Reasoning Tokens.

[DOI]

,

,

Shauli Ravfogel

,

CoRR, December, 2025

Beyond Single Embeddings: Capturing Diverse Targets with Multi-Query Retrieval.

[DOI]

,

,

Shauli Ravfogel

,

CoRR, November, 2025

Emergence of Linear Truth Encodings in Language Models.

[DOI]

Shauli Ravfogel

,

,

,

,

CoRR, October, 2025

IQ Test for LLMs: An Evaluation Framework for Uncovering Core Skills in LLMs.

[DOI]

,

Amir David Nissan Cohen

,

,

Shauli Ravfogel

,

CoRR, July, 2025

The Medium Is Not the Message: Deconfounding Text Embeddings via Linear Concept Erasure.

[DOI]

,

,

Shauli Ravfogel

,

Mrinmaya Sachan

,

,

Alexander Miserlis Hoyle

CoRR, July, 2025

Preserving Task-Relevant Information Under Linear Concept Removal.

[DOI]

Floris Holstege

,

Shauli Ravfogel

,

CoRR, June, 2025

RELIC: Evaluating Compositional Instruction Following via Language Recognition.

[DOI]

,

,

,

Shauli Ravfogel

,

William Merrill

,

CoRR, June, 2025

A Practical Method for Generating String Counterfactuals.

[DOI]

,

,

,

Shauli Ravfogel

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Gumbel Counterfactual Generation From Language Models.

[DOI]

Shauli Ravfogel

,

,

Vésteinn Snæbjarnarson

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Intrinsic Test of Unlearning Using Parametric Knowledge Traces.

[DOI]

,

,

,

Shauli Ravfogel

,

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

The Medium Is Not the Message: Deconfounding Document Embeddings via Linear Concept Erasure.

[DOI]

,

,

Shauli Ravfogel

,

Mrinmaya Sachan

,

,

Alexander Miserlis Hoyle

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024

Diversity Over Quantity: A Lesson From Few Shot Relation Classification.

[DOI]

Amir David Nissan Cohen

,

Shauli Ravfogel

,

Shaltiel Shmidman

,

CoRR, 2024

Counterfactual Generation from Language Models.

[DOI]

Shauli Ravfogel

,

,

Vésteinn Snæbjarnarson

,

CoRR, 2024

GRADE: Quantifying Sample Diversity in Text-to-Image Models.

[DOI]

,

,

Shauli Ravfogel

,

,

CoRR, 2024

Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces.

[DOI]

,

,

Shauli Ravfogel

,

,

CoRR, 2024

Language Imbalance Can Boost Cross-lingual Generalisation.

[DOI]

,

Shauli Ravfogel

,

,

,

CoRR, 2024

What Changed? Converting Representational Interventions to Natural Language.

[DOI]

,

,

,

Shauli Ravfogel

CoRR, 2024

MiMiC: Minimally Modified Counterfactuals in the Representation Space.

[DOI]

,

Shauli Ravfogel

,

Jonathan Herzig

,

,

,

Ponnurangam Kumaraguru

CoRR, 2024

On Affine Homotopy between Language Encoders.

[DOI]

,

Reda Boumasmoud

,

,

,

,

,

Shauli Ravfogel

,

Mrinmaya Sachan

,

Bernhard Schölkopf

,

Mennatallah El-Assady

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Representation Surgery: Theory and Practice of Affine Steering.

[DOI]

,

Shauli Ravfogel

,

Jonathan Herzig

,

,

,

Ponnurangam Kumaraguru

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language Concept Erasure for Language-invariant Dense Retrieval.

[DOI]

,

,

Shauli Ravfogel

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

Visual Comparison of Language Model Adaptation.

[DOI]

Rita Sevastjanova

,

,

Shauli Ravfogel

,

,

Mennatallah El-Assady

IEEE Trans. Vis. Comput. Graph., 2023

The Curious Case of Hallucinatory Unanswerablity: Finding Truths in the Hidden States of Over-Confident Large Language Models.

[DOI]

,

,

,

,

Shauli Ravfogel

CoRR, 2023

All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations.

[DOI]

,

,

,

Shauli Ravfogel

,

Mrinmaya Sachan

,

Bernhard Schölkopf

,

CoRR, 2023

Retrieving Texts based on Abstract Descriptions.

[DOI]

Shauli Ravfogel

,

Valentina Pyatkin

,

Amir David Nissan Cohen

,

Avshalom Manevich

,

CoRR, 2023

Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment.

[DOI]

,

,

Daniel Glickman

,

Shauli Ravfogel

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LEACE: Perfect linear concept erasure in closed form.

[DOI]

,

David Schneider-Joseph

,

Shauli Ravfogel

,

,

,

Stella Biderman

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

The Curious Case of Hallucinatory (Un)answerability: Finding Truths in the Hidden States of Over-Confident Large Language Models.

[DOI]

,

,

,

,

Shauli Ravfogel

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers.

[DOI]

,

Shauli Ravfogel

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Conformal Nucleus Sampling.

[DOI]

Shauli Ravfogel

,

,

Jacob Goldberger

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Linear Guardedness and its Implications.

[DOI]

Shauli Ravfogel

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation.

[DOI]

,

,

Shauli Ravfogel

,

Dietrich Klakow

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Measuring Causal Effects of Data Statistics on Language Model's 'Factual' Predictions.

[DOI]

,

,

Shauli Ravfogel

,

,

Abhilasha Ravichander

,

,

Yonatan Belinkov

,

Hinrich Schütze

,

CoRR, 2022

Analyzing Gender Representation in Multilingual Models.

[DOI]

,

Shauli Ravfogel

,

Proceedings of the 7th Workshop on Representation Learning for NLP, 2022

Linear Adversarial Concept Erasure.

[DOI]

Shauli Ravfogel

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

Adversarial Concept Erasure in Kernel Space.

[DOI]

Shauli Ravfogel

,

Francisco Vargas

,

,

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

DALLE-2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image Models.

[DOI]

,

Shauli Ravfogel

,

Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models.

[DOI]

,

,

Shauli Ravfogel

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

2021

Amnesic Probing: Behavioral Explanation With Amnesic Counterfactuals.

[DOI]

,

Shauli Ravfogel

,

,

Trans. Assoc. Comput. Linguistics, 2021

Erratum: Measuring and Improving Consistency in Pretrained Language Models.

[DOI]

,

,

Shauli Ravfogel

,

Abhilasha Ravichander

,

,

Hinrich Schütze

,

Trans. Assoc. Comput. Linguistics, 2021

Measuring and Improving Consistency in Pretrained Language Models.

[DOI]

,

,

Shauli Ravfogel

,

Abhilasha Ravichander

,

,

Hinrich Schütze

,

Trans. Assoc. Comput. Linguistics, 2021

Ab Antiquo: Neural Proto-language Reconstruction.

[DOI]

,

Shauli Ravfogel

,

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Contrastive Explanations for Model Interpretability.

[DOI]

,

Swabha Swayamdipta

,

Shauli Ravfogel

,

,

,

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Counterfactual Interventions Reveal the Causal Effect of Relative Clause Representations on Agreement Prediction.

[DOI]

Shauli Ravfogel

,

,

,

Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

Neural Extractive Search.

[DOI]

Shauli Ravfogel

,

Hillel Taub-Tabib

,

Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

When Bert Forgets How To POS: Amnesic Probing of Linguistic Properties and MLM Predictions.

[DOI]

,

Shauli Ravfogel

,

,

CoRR, 2020

Unsupervised Distillation of Syntactic Information from Contextualized Word Representations.

[DOI]

Shauli Ravfogel

,

,

Jacob Goldberger

,

Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2020

It's not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT.

[DOI]

,

Shauli Ravfogel

,

,

Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2020

Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection.

[DOI]

Shauli Ravfogel

,

,

,

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

The Extraordinary Failure of Complement Coercion Crowdsourcing.

[DOI]

,

Victoria Basmova

,

Shauli Ravfogel

,

,

Proceedings of the First Workshop on Insights from Negative Results in NLP, 2020

2019

Ab Antiquo: Proto-language Reconstruction with RNNs.

[DOI]

,

Shauli Ravfogel

,

CoRR, 2019

Studying the Inductive Biases of RNNs with Synthetic Variations of Natural Languages.

[DOI]

Shauli Ravfogel

,

,

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018

Can LSTM Learn to Capture Agreement? The Case of Basque.

[DOI]

Shauli Ravfogel

,

,

Francis M. Tyers

Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Loading...