Yanai Elazar

Orcid: 0009-0000-8138-4533

According to our database1, Yanai Elazar authored at least 56 papers between 2018 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
LLM-Generated or Human-Written? Comparing Review and Non-Review Papers on ArXiv.
CoRR, January, 2026

2025
Rewriting History: A Recipe for Interventional Analyses to Study Data Effects on Model Behavior.
CoRR, October, 2025

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens.
CoRR, April, 2025

Better Aligned with Survey Respondents or Training Data? Unveiling Political Leanings of LLMs on U.S. Supreme Court Cases.
CoRR, February, 2025

How Many Images Does It Take? Estimating Imitation Thresholds in Text-to-Image Models.
Trans. Mach. Learn. Res., 2025

On Linear Representations and Pretraining Data Frequency in Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025


Calibrating Large Language Models with Sample Consistency.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
A Survey on Data Selection for Language Models.
Trans. Mach. Learn. Res., 2024

GRADE: Quantifying Sample Diversity in Text-to-Image Models.
CoRR, 2024

How Many Van Goghs Does It Take to Van Gogh? Finding the Imitation Threshold.
CoRR, 2024

Paloma: A Benchmark for Evaluating Language Model Fit.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

The Bias Amplification Paradox in Text-to-Image Generation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Backtracking Mathematical Reasoning of Language Models to the Pretraining Data.
Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024

What's In My Big Data?
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Detection and Measurement of Syntactic Templates in Generated Text.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Evaluating n-Gram Novelty of Language Models Using Rusty-DAWG.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Measuring and Improving Attentiveness to Partial Inputs with Counterfactuals.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024


Estimating the Causal Effect of Early ArXiving on Paper Acceptance.
Proceedings of the Causal Learning and Reasoning, 2024



2023
A taxonomy and review of generalization research in NLP.
Nat. Mac. Intell., October, 2023

At Your Fingertips: Extracting Piano Fingering Instructions from Videos.
CoRR, 2023

CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA Paradigm.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Text-based NP Enrichment.
Trans. Assoc. Comput. Linguistics, 2022

State-of-the-art generalisation research in NLP: a taxonomy and review.
CoRR, 2022

Measuring Causal Effects of Data Statistics on Language Model's 'Factual' Predictions.
CoRR, 2022

Lexical Generalization Improves with Larger Models and Longer Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Revisiting Few-shot Relation Classification: Evaluation Data and Classification Schemes.
Trans. Assoc. Comput. Linguistics, 2021

Amnesic Probing: Behavioral Explanation With Amnesic Counterfactuals.
Trans. Assoc. Comput. Linguistics, 2021

Erratum: Measuring and Improving Consistency in Pretrained Language Models.
Trans. Assoc. Comput. Linguistics, 2021

Measuring and Improving Consistency in Pretrained Language Models.
Trans. Assoc. Comput. Linguistics, 2021

Back to Square One: Bias Detection, Training and Commonsense Disentanglement in the Winograd Schema.
CoRR, 2021

Contrastive Explanations for Model Interpretability.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

2020
oLMpics - On what Language Model Pre-training Captures.
Trans. Assoc. Comput. Linguistics, 2020

When Bert Forgets How To POS: Amnesic Probing of Linguistic Properties and MLM Predictions.
CoRR, 2020

Evaluating NLP Models via Contrast Sets.
CoRR, 2020


Do Language Embeddings capture Scales?
Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2020

Unsupervised Distillation of Syntactic Information from Contextualized Word Representations.
Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2020

It's not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT.
Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2020

Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

The Extraordinary Failure of Complement Coercion Crowdsourcing.
Proceedings of the First Workshop on Insights from Negative Results in NLP, 2020

2019
Where's My Head? Definition, Dataset and Models for Numeric Fused-Heads Identification and Resolution.
Trans. Assoc. Comput. Linguistics, 2019

Privacy and Fairness in Recommender Systems via Adversarial Training of User Representations.
Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods, 2019

Adversarial Removal of Demographic Attributes Revisited.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

How Large Are Lions? Inducing Distributions over Quantitative Attributes.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Privacy-Adversarial User Representations in Recommender Systems.
CoRR, 2018

Adversarial Removal of Demographic Attributes from Text Data.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018


  Loading...