Mor Geva

Orcid: 0000-0001-9529-6315

According to our database1, Mor Geva authored at least 63 papers between 2017 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
MoNaCo: More Natural and Complex Questions for Reasoning Across Dozens of Documents.
CoRR, August, 2025

Beyond the Rosetta Stone: Unification Forces in Generalization Dynamics.
CoRR, August, 2025

Universal Jailbreak Suffixes Are Strong Attention Hijackers.
CoRR, June, 2025

How Well Can Reasoning Models Identify and Recover from Unhelpful Thoughts?
CoRR, June, 2025

Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization.
CoRR, June, 2025

Precise In-Parameter Concept Erasure in Large Language Models.
CoRR, May, 2025

Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas.
CoRR, March, 2025

Preventing Rogue Agents Improves Multi-Agent Collaboration.
CoRR, February, 2025

Open Problems in Mechanistic Interpretability.
CoRR, January, 2025

Open Problems in Machine Unlearning for AI Safety.
CoRR, January, 2025

Language Models Encode Numbers Using Digit Representations in Base 10.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Towards Interpreting Visual Information Processing in Vision-Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Enhancing Automated Interpretability with Output-Centric Feature Descriptions.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Eliciting Textual Descriptions from Representations of Continuous Prompts.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Inferring Functionality of Attention Heads from their Parameters.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Performance Gap in Entity Knowledge Extraction Across Modalities in Vision Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Evaluating the Ripple Effects of Knowledge Editing in Language Models.
Trans. Assoc. Comput. Linguistics, 2024

CoverBench: A Challenging Benchmark for Complex Claim Verification.
CoRR, 2024

When Can Transformers Count to n?
CoRR, 2024

From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty.
CoRR, 2024

Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces.
CoRR, 2024

Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

The Hidden Language of Diffusion Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Backward Lens: Projecting Language Model Gradients into the Vocabulary Space.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Estimating Knowledge in Large Language Models Without Generating a Single Token.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Jump to Conclusions: Short-Cutting Transformers with Linear Transformations.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Do Large Language Models Latently Perform Multi-Hop Reasoning?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

The Hidden Space of Transformer Language Adapters.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

A Comprehensive Evaluation of Tool-Assisted Generation Strategies.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

In-Context Learning Creates Task Vectors.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Dissecting Recall of Factual Associations in Auto-Regressive Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

LM vs LM: Detecting Factual Errors via Cross Examination.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Don't Blame the Annotator: Bias Already Starts in the Annotation Instructions.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Understanding Transformer Memorization Recall Through Idioms.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Crawling The Internal Knowledge-Base of Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Complex Reasoning in Natural Languag.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2023

Analyzing Transformers in Embedding Space.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition.
Trans. Assoc. Comput. Linguistics, 2022

Inferring Implicit Relations with Language Models.
CoRR, 2022

Inferring Implicit Relations in Complex Questions with Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

LM-Debugger: An Interactive Tool for Inspection and Intervention in Transformer-Based Language Models.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

SCROLLS: Standardized CompaRison Over Long Language Sequences.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Robust and Interpretable Machine Reasoning Over Text
PhD thesis, 2021

Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies.
Trans. Assoc. Comput. Linguistics, 2021

Transformer Feed-Forward Layers Are Key-Value Memories.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

What's in Your Head? Emergent Behaviour in Multi-Task Transformer Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Break It Down: A Question Understanding Benchmark.
Trans. Assoc. Comput. Linguistics, 2020

Injecting Numerical Reasoning Skills into Language Models.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
DiscoFuse: A Large-Scale Dataset for Discourse-Based Sentence Fusion.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018
Emergence of Communication in an Interactive World with Consistent Speakers.
CoRR, 2018

Learning to Search in Long Documents Using Document Structure.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017
Evaluating Semantic Parsing against a Simple Web-based Question Answering Model.
Proceedings of the 6th Joint Conference on Lexical and Computational Semantics, 2017


  Loading...