Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Towards Interpreting Visual Information Processing in Vision-Language Models.

[BibT_eX]

[DOI]

Clement Neo

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Enhancing Automated Interpretability with Output-Centric Feature Descriptions.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Eliciting Textual Descriptions from Representations of Continuous Prompts.

[BibT_eX]

[DOI]

Daniela Gottesman

Mor Geva

Dana Ramati

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Inferring Functionality of Attention Heads from their Parameters.

[BibT_eX]

[DOI]

Amit Elhelo

Mor Geva

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Performance Gap in Entity Knowledge Extraction Across Modalities in Vision Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Evaluating the Ripple Effects of Knowledge Editing in Language Models.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2024

CoverBench: A Challenging Benchmark for Complex Claim Verification.

[BibT_eX]

[DOI]

CoRR, 2024

When Can Transformers Count to n?

[BibT_eX]

[DOI]

CoRR, 2024

From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty.

[BibT_eX]

[DOI]

CoRR, 2024

Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces.

[BibT_eX]

[DOI]

CoRR, 2024

Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

The Hidden Language of Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words?

[BibT_eX]

[DOI]

Gal Yona

Roee Aharoni

Mor Geva

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Backward Lens: Projecting Language Model Gradients into the Vocabulary Space.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Estimating Knowledge in Large Language Models Without Generating a Single Token.

[BibT_eX]

[DOI]

Daniela Gottesman

Mor Geva

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Jump to Conclusions: Short-Cutting Transformers with Linear Transformations.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers.

[BibT_eX]

[DOI]

Gal Yona

Roee Aharoni

Mor Geva

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Do Large Language Models Latently Perform Multi-Hop Reasoning?

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

The Hidden Space of Transformer Language Adapters.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[BibT_eX]

[DOI]

Bartlomiej Bojanowski

Christopher D. Manning

Daniel Moseguí González

Eunice Engefu Manyasi

Evgenii Zheltonozhskii

Fanyue Xia

Fatemeh Siar

Fernando Martínez-Plumed

Giambattista Parascandolo

Giorgio Mariani

Gloria Wang

Gonzalo Jaimovitch-López

Jaime Fernández Fisac

Jascha Sohl-Dickstein

José Hernández-Orallo

Karthik Gopalakrishnan

Lidia Contreras Ochando

Louis-Philippe Morency

María José Ramírez-Quintana

Michael I. Ivanitskiy

Neta Gur-Ari Krakover

Nitish Shirish Keskar

Pablo Antonio Moreno Casares

Pegah Alipoormolabashi

Shyamolima (Shammie) Debnath

Sneha Priscilla Makini

Yadollah Yaghoobzadeh

Trans. Mach. Learn. Res., 2023

A Comprehensive Evaluation of Tool-Assisted Generation Strategies.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

In-Context Learning Creates Task Vectors.

[BibT_eX]

[DOI]

Roee Hendel

Mor Geva

Amir Globerson

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Dissecting Recall of Factual Associations in Auto-Regressive Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

LM vs LM: Detecting Factual Errors via Cross Examination.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Don't Blame the Annotator: Bias Already Starts in the Annotation Instructions.

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Understanding Transformer Memorization Recall Through Idioms.

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Crawling The Internal Knowledge-Base of Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Complex Reasoning in Natural Languag.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2023

Analyzing Transformers in Embedding Space.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition.

[BibT_eX]

[DOI]

Mor Geva

Tomer Wolfson

Jonathan Berant

Trans. Assoc. Comput. Linguistics, 2022

Inferring Implicit Relations with Language Models.

[BibT_eX]

[DOI]

Uri Katz

Mor Geva

Jonathan Berant

CoRR, 2022

Inferring Implicit Relations in Complex Questions with Language Models.

[BibT_eX]

[DOI]

Uri Katz

Mor Geva

Jonathan Berant

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

LM-Debugger: An Interactive Tool for Inspection and Intervention in Transformer-Based Language Models.

[BibT_eX]

[DOI]

Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

SCROLLS: Standardized CompaRison Over Long Language Sequences.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021

Robust and Interpretable Machine Reasoning Over Text

[BibT_eX]

[DOI]

Mor Geva

PhD thesis, 2021

Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2021

Transformer Feed-Forward Layers Are Key-Value Memories.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

What's in Your Head? Emergent Behaviour in Multi-Task Transformer Models.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

Break It Down: A Question Understanding Benchmark.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2020

Injecting Numerical Reasoning Skills into Language Models.

[BibT_eX]

[DOI]

Mor Geva

Ankit Gupta

Jonathan Berant

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

DiscoFuse: A Large-Scale Dataset for Discourse-Based Sentence Fusion.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets.

[BibT_eX]

[DOI]

Mor Geva

Yoav Goldberg

Jonathan Berant

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018

Emergence of Communication in an Interactive World with Consistent Speakers.

[BibT_eX]

[DOI]

Ben Bogin

Mor Geva

Jonathan Berant

CoRR, 2018

Learning to Search in Long Documents Using Document Structure.

[BibT_eX]

[DOI]

Mor Geva