Eugene Yang

CoRR, May, 2026

CoverageBench: Evaluating Information Coverage across Tasks and Domains.

[BibT_eX]

[DOI]

CoRR, March, 2026

Beyond Relevance: On the Relationship Between Retrieval and RAG Information Coverage.

[BibT_eX]

[DOI]

CoRR, March, 2026

Overview of the TREC 2025 RAGTIME Track.

[BibT_eX]

[DOI]

CoRR, February, 2026

NeuCLIRTech: Chinese Monolingual and Cross-Language Information Retrieval Evaluation in a Challenging Domain.

[BibT_eX]

[DOI]

CoRR, February, 2026

WSDM CUP 2026: Multilingual Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, 2026

RoutIR: Fast Serving of Retrieval Pipelines for Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2026

Does Reasoning Make Search More Fair? Comparing Fairness in Reasoning and Non-reasoning Rerankers.

[BibT_eX]

[DOI]

Saron Samuel

Benjamin Van Durme

Proceedings of the Advances in Information Retrieval, 2026

Investigating Retrieval-Augmented Generation Systems on Unanswerable, Uncheatable, Realistic, Multi-hop Queries.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2026

LANCER: LLM Reranking for Nugget Coverage.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2026

Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2026

Incorporating Q&A Nuggets Into Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2026

Principled Context Engineering for RAG: Statistical Guarantees via Conformal Prediction.

[BibT_eX]

[DOI]

Debashish Chakraborty

Proceedings of the Advances in Information Retrieval, 2026

2025

NeuCLIRBench: A Modern Evaluation Collection for Monolingual, Cross-Language, and Multilingual Information Retrieval.

[BibT_eX]

[DOI]

CoRR, November, 2025

Seeing Through the MiRAGE: Evaluating Multimodal Retrieval Augmented Generation.

[BibT_eX]

[DOI]

CoRR, October, 2025

Augmenting Researchy Questions with Sub-question Judgments.

[BibT_eX]

[DOI]

CoRR, October, 2025

Evaluating Retrieval-Augmented Generation Systems on Unanswerable, Uncheatable, Realistic, Multi-hop Queries.

[BibT_eX]

[DOI]

CoRR, October, 2025

Topic-Specific Classifiers are Better Relevance Judges than Prompted LLMs.

[BibT_eX]

[DOI]

CoRR, October, 2025

Milco: Learned Sparse Retrieval Across Languages via a Multilingual Connector.

[BibT_eX]

[DOI]

CoRR, October, 2025

Auto-ARGUE: LLM-Based Report Generation Evaluation.

[BibT_eX]

[DOI]

Yu Hou

CoRR, September, 2025

Linguistic Nepotism: Trading-off Quality for Language Preference in Multilingual RAG.

[BibT_eX]

[DOI]

CoRR, September, 2025

mmBERT: A Modern Multilingual Encoder with Annealed Language Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

HLTCOE at LiveRAG: GPT-Researcher using ColBERT retrieval.

[BibT_eX]

[DOI]

CoRR, June, 2025

Rank-K: Test-Time Reasoning for Listwise Reranking.

[BibT_eX]

[DOI]

CoRR, May, 2025

WikiVideo: Article Generation from Multiple Videos.

[BibT_eX]

[DOI]

CoRR, April, 2025

Rank1: Test-Time Compute for Reranking in Information Retrieval.

[BibT_eX]

[DOI]

CoRR, February, 2025

Neural Lexical Search with Learned Sparse Retrieval.

[BibT_eX]

[DOI]

Siddharth A. K. Singh

Thong Nguyen

Yibin Lei

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Nugget-based Annotation Protocol and Tool For Evaluating Long-form Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

System Comparison Using Automated Generation of Relevance Judgements in Multiple Languages.

[BibT_eX]

[DOI]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

MMMORRF: Multimodal Multilingual MOdularized Reciprocal Rank Fusion.

[BibT_eX]

[DOI]

Saron Samuel

Dan DeGenaro

Jimena Guallar-Blasco

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Variations in Relevance Judgments and the Shelf Life of Test Collections.

[BibT_eX]

[DOI]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Generate-Distill: Training Cross-Language IR Models with Synthetically-Generated Data.

[BibT_eX]

[DOI]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

A Reproducibility Study of LLM Setwise Reranker with Heapsort.

[BibT_eX]

[DOI]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

CLERC: A Dataset for U. S. Legal Case Retrieval and Retrieval-Augmented Analysis Generation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

MURR: Model Updating with Regularized Replay for Searching a Document Stream.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2025

Eval4RAG: Workshop on Evaluation of Retrieval-Augmented Generation Systems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2025

mFollowIR: A Multilingual Benchmark for Instruction Following in Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2025

Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval.

[BibT_eX]

[DOI]

Jimena Guallar-Blasco

Alexander Martin

Benjamin Van Durme

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Report on the Collab-a-Thon at ECIR 2024.

[BibT_eX]

[DOI]

SIGIR Forum, June, 2024

Report on the Search Futures Workshop at ECIR 2024.

[BibT_eX]

[DOI]

SIGIR Forum, June, 2024

MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval.

[BibT_eX]

[DOI]

Jimena Guallar-Blasco

CoRR, 2024

CLERC: A Dataset for Legal Case Retrieval and Retrieval-Augmented Analysis Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Efficiency-Effectiveness Tradeoff of Probabilistic Structured Queries for Cross-Language Information Retrieval.

[BibT_eX]

[DOI]

CoRR, 2024

HLTCOE at TREC 2024 NeuCLIR Track.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third Text REtrieval Conference, 2024

Overview of the TREC 2024 NeuCLIR Track.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third Text REtrieval Conference, 2024

Distillation for Multilingual Information Retrieval.

[BibT_eX]

[DOI]

Dawn J. Lawrie

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Language Fairness in Multilingual Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Contextualization with SPLADE for High Recall Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

On the Evaluation of Machine-Generated Reports.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

PLAID SHIRTTT for Large-Scale Streaming Dense Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

High Recall Retrieval Via Technology-Assisted Review.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Translate-Distill: Learning Cross-Language Dense Retrieval by Translation and Distillation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2024

Beyond the Bar: Generative AI as a Transformative Component in Legal Document Review.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Big Data, 2024

2023

Synthetic Cross-language Information Retrieval Training Data.

[BibT_eX]

[DOI]

CoRR, 2023

HLTCOE at TREC 2023 NeuCLIR Track.

[BibT_eX]

[DOI]

Dawn J. Lawrie

Proceedings of the Thirty-Second Text REtrieval Conference Proceedings (TREC 2023), 2023

Overview of the TREC 2023 NeuCLIR Track.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second Text REtrieval Conference Proceedings (TREC 2023), 2023

Neural Methods for Cross-Language Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

BLADE: Combining Vocabulary Pruning and Intermediate Pretraining for Scaleable Neural CLIR.

[BibT_eX]

[DOI]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

HC3: A Suite of Test Collections for CLIR Evaluation over Informal Text.

[BibT_eX]

[DOI]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Extending Translate-Train for ColBERT-X to African Language CLIR.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of FIRE 2023, 2023

Neural Approaches to Multilingual Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2023

2022

Parameter-efficient Zero-shot Transfer for Cross-Language Dense Retrieval with Adapters.

[BibT_eX]

[DOI]

CoRR, 2022

Multilingual ColBERT-X.

[BibT_eX]

[DOI]

CoRR, 2022

HLTCOE at TREC 2022 NeuCLIR Track.

[BibT_eX]

[DOI]

Dawn J. Lawrie

Proceedings of the Thirty-First Text REtrieval Conference, 2022

Overview of the TREC 2022 NeuCLIR Track.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First Text REtrieval Conference, 2022

C3: Continued Pretraining with Contrastive Weak Supervision for Cross Language Ad-Hoc Retrieval.

[BibT_eX]

[DOI]

Suraj Nair

Ramraj Chandradevan

Rebecca Iglesias-Flores

Douglas W. Oard

Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

TARexp: A Python Framework for Technology-Assisted Review Experiments.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Learning to Enrich Query Representation with Pseudo-Relevance Feedback for Cross-lingual Retrieval.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

ECIR 2022 Tutorial: Technology-Assisted Review for High Recall Retrieval.

[BibT_eX]

[DOI]

Jeremy Pickens

Proceedings of the Advances in Information Retrieval, 2022

Goldilocks: Just-Right Tuning of BERT for Technology-Assisted Review.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2022

Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2022

HC4: A New Suite of Test Collections for Ad Hoc CLIR.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2022

Patapasco: A Python Framework for Cross-Language Information Retrieval Experiments.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2022

Learning a Sparse Representation Model for Neural CLIR.

[BibT_eX]

[DOI]

Proceedings of the Third International Conference on Design of Experimental Search & Information REtrieval Systems, 2022

2021

ToxCCIn: Toxic Content Classification with Interpretability.

[BibT_eX]

[DOI]

Proceedings of the Eleventh Workshop on Computational Approaches to Subjectivity, 2021

Heuristic stopping rules for technology-assisted review.

[BibT_eX]

[DOI]

Proceedings of the DocEng '21: ACM Symposium on Document Engineering 2021, 2021

On minimizing cost in legal document review workflows.

[BibT_eX]

[DOI]

Proceedings of the DocEng '21: ACM Symposium on Document Engineering 2021, 2021

TAR on Social Media: A Framework for Online Content Moderation.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Design of Experimental Search & Information REtrieval Systems, 2021

Certifying One-Phase Technology-Assisted Reviews.

[BibT_eX]

[DOI]