Luca Soldaini

Orcid: 0000-0001-6998-9863

Affiliations:
  • Amazon Alexa, CA, USA


According to our database1, Luca Soldaini authored at least 57 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions.
CoRR, 2024

KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions.
CoRR, 2024

OLMo: Accelerating the Science of Language Models.
CoRR, 2024

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.
CoRR, 2024

AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters.
CoRR, 2024

When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

2023
Report on the SIGIR 2023 Session on Diversity, Equity and Inclusivity.
SIGIR Forum, December, 2023

Paloma: A Benchmark for Evaluating Language Model Fit.
CoRR, 2023

Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense Encoders.
CoRR, 2023

What's In My Big Data?
CoRR, 2023

The Surveillance AI Pipeline.
CoRR, 2023

Bound by the Bounty: Collaboratively Shaping Evaluation Processes for Queer AI Harms.
CoRR, 2023

A Controllable QA-based Framework for Decontextualization.
CoRR, 2023

Queer In AI: A Case Study in Community-Led Participatory AI.
CoRR, 2023

The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces.
CoRR, 2023

The Semantic Scholar Open Data Platform.
CoRR, 2023

One-Shot Labeling for Automatic Relevance Estimation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Scim: Intelligent Skimming Support for Scientific Papers.
Proceedings of the 28th International Conference on Intelligent User Interfaces, 2023


A Question Answering Framework for Decontextualizing User-facing Snippets from Scientific Documents.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Embedding Recycling for Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Bound by the Bounty: Collaboratively Shaping Evaluation Processes for Queer AI Harms.
Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023

2022
Exploring the Challenges of Open Domain Multi-Document Summarization.
CoRR, 2022

Overview of the TREC 2022 NeuCLIR Track.
Proceedings of the Thirty-First Text REtrieval Conference, 2022

Paragraph-based Transformer Pre-training for Multi-Sentence Inference.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Cross-Lingual Open-Domain Question Answering with Answer Sentence Generation.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Ensemble Transformer for Efficient and Accurate Ranking Tasks: an Application to Question Answering Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Knowledge Transfer from Answer Ranking to Answer Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Cross-Lingual GenQA: A Language-Agnostic Generative Question Answering Approach for Open-Domain Question Answering.
CoRR, 2021

Modeling Context in Answer Sentence Selection Systems on a Latency Budget.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Answer Generation for Retrieval-based Question Answering Systems.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Improving Spoken Language Understanding By Exploiting ASR N-best Hypotheses.
CoRR, 2020

Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Teaching a New Dog Old Tricks: Resurrecting Multilingual Retrieval Using Zero-Shot Learning.
Proceedings of the Advances in Information Retrieval, 2020

Multi-task Learning of Spoken Language Understanding by Integrating N-Best Hypotheses with Hierarchical Attention.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

The Cascade Transformer: an Application for Efficient Answer Sentence Selection.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Overcoming low-utility facets for complex answer retrieval.
Inf. Retr. J., 2019

2018
The Knowledge and Language Gap in Medical Information Seeking.
SIGIR Forum, 2018

Characterizing Question Facets for Complex Answer Retrieval.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

GU IRLAB at SemEval-2018 Task 7: Tree-LSTMs for Scientific Relation Classification.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

SMHD: a Large-Scale Resource for Exploring Online Language Usage for Multiple Mental Health Conditions.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Relation Extraction for Protein-protein Interactions Affected by Mutations.
Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018

Helping or Hurting? Predicting Changes in Users' Risk of Self-Harm Through Online Community Interactions.
Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic, 2018

RSDD-Time: Temporal Annotation of Self-Reported Mental Health Diagnoses.
Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic, 2018

2017
Learning to reformulate long queries for clinical decision support.
J. Assoc. Inf. Sci. Technol., 2017

Inferring Individual Attributes from Search Engine Queries and Auxiliary Information.
Proceedings of the 26th International Conference on World Wide Web, 2017

Learning to Rank for Consumer Health Search: A Semantic Approach.
Proceedings of the Advances in Information Retrieval, 2017

Denoising Clinical Notes for Medical Literature Retrieval with Convolutional Neural Model.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016
Enhancing web search in the medical domain via query clarification.
Inf. Retr. J., 2016

Team GU-IRLAB at CLEF eHealth 2016: Task 3.
Proceedings of the Working Notes of CLEF 2016, 2016

2015
Matching Citation Text and Cited Spans in Biomedical Literature: a Search-Oriented Approach.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Retrieving Medical Literature for Clinical Decision Support.
Proceedings of the Advances in Information Retrieval, 2015

2014
Query Reformulation for Clinical Decision Support Search.
Proceedings of The Twenty-Third Text REtrieval Conference, 2014

On clinical decision support.
Proceedings of the 5th ACM Conference on Bioinformatics, 2014


  Loading...