Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

DrawEduMath: Evaluating Vision Language Models with Expert-Annotated Students' Hand-Drawn Math Images.

[BibT_eX]

[DOI]

Sami Baral

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

OLMoE: Open Mixture-of-Experts Language Models.

[BibT_eX]

[DOI]

et al.

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Intent-aware Schema Generation and Refinement for Literature Review Tables.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Human-AI Collaboration: How AIs Augment Human Teammates.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 5: Tutorial Abstracts), 2025

RouterRetriever: Routing over a Mixture of Expert Embedding Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Accelerating Scientific Paper Skimming with Augmented Intelligence Through Customizable Faceted Highlights.

[BibT_eX]

[DOI]

ACM Trans. Interact. Intell. Syst., December, 2024

The Semantic Reader Project.

[BibT_eX]

[DOI]

Yoganand Chandrasekhar

Commun. ACM, October, 2024

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.

[BibT_eX]

[DOI]

CoRR, 2024

Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations.

[BibT_eX]

[DOI]

CoRR, 2024

LLMs as Research Tools: A Large Scale Survey of Researchers' Usage and Perceptions.

[BibT_eX]

[DOI]

CoRR, 2024

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models.

[BibT_eX]

[DOI]

CoRR, 2024

RouterRetriever: Exploring the Benefits of Routing over Multiple Expert Embedding Models.

[BibT_eX]

[DOI]

CoRR, 2024

OLMoE: Open Mixture-of-Experts Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Evaluating Language Model Math Reasoning via Grounding in Educational Curricula.

[BibT_eX]

[DOI]

CoRR, 2024

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature.

[BibT_eX]

[DOI]

CoRR, 2024

FABLES: Evaluating faithfulness and content selection in book-length summarization.

[BibT_eX]

[DOI]

CoRR, 2024

Paloma: A Benchmark for Evaluating Language Model Fit.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

DataComp-LM: In search of the next generation of training sets for language models.

[BibT_eX]

[DOI]

Khyathi Raghavi Chandu

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

BooookScore: A systematic exploration of book-length summarization in the era of LLMs.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MathFish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

One Thousand and One Pairs: A "novel" challenge for long-context language models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Know Your Audience: The benefits and pitfalls of generating plain language summaries beyond the "general" audience.

[BibT_eX]

[DOI]

Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

OLMo: Accelerating the Science of Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

LIMEADE: From AI Explanations to Advice Taking.

[BibT_eX]

[DOI]

Benjamin Charles Germain Lee

Doug Downey

Kyle Lo

Daniel S. Weld

ACM Trans. Interact. Intell. Syst., December, 2023

Paper Plain: Making Medical Research Papers Approachable to Healthcare Consumers with Natural Language Processing.

[BibT_eX]

[DOI]

ACM Trans. Comput. Hum. Interact., October, 2023

Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense Encoders.

[BibT_eX]

[DOI]

CoRR, 2023

The Rise of Open Science: Tracking the Evolution and Perceived Value of Data and Methods Link-Sharing Practices.

[BibT_eX]

[DOI]

CoRR, 2023

Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation.

[BibT_eX]

[DOI]

CoRR, 2023

A Controllable QA-based Framework for Decontextualization.

[BibT_eX]

[DOI]

CoRR, 2023

Complex Mathematical Symbol Definition Structures: A Dataset and Model for Coordination Resolution in Definition Extraction.

[BibT_eX]

[DOI]

CoRR, 2023

Beyond Summarization: Designing AI Support for Real-World Expository Writing Tasks.

[BibT_eX]

[DOI]

CoRR, 2023

The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces.

[BibT_eX]

[DOI]

Yoganand Chandrasekhar

CoRR, 2023

The Semantic Scholar Open Data Platform.

[BibT_eX]

[DOI]

CoRR, 2023

Scim: Intelligent Skimming Support for Scientific Papers.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Intelligent User Interfaces, 2023

A Question Answering Framework for Decontextualizing User-facing Snippets from Scientific Documents.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents.

[BibT_eX]

[DOI]

Yoganand Chandrasekhar

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Decomposing Complex Queries for Tip-of-the-tongue Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization.

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

CiteSee: Augmenting Citations in Scientific Papers with Persistent and Personalized Historical Context.

[BibT_eX]

[DOI]

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2022

Exploring the Challenges of Open Domain Multi-Document Summarization.

[BibT_eX]

[DOI]

CoRR, 2022

Data Governance in the Age of Large-Scale Data-Driven Language Technology.

[BibT_eX]

[DOI]

CoRR, 2022

Scim: Intelligent Faceted Highlights for Interactive, Multi-Pass Skimming of Scientific Papers.

[BibT_eX]

[DOI]

CoRR, 2022

Infrastructure for Rapid Open Knowledge Network Development.

[BibT_eX]

[DOI]

AI Mag., 2022

Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset.

[BibT_eX]

[DOI]

Albert Villanova del Moral

Teven Le Scao

Leandro von Werra

Chenghao Mou

Eduardo González Ponferrada

Angelina McMillan-Major

David Ifeoluwa Adelani

Alexandra Sasha Luccioni

Yacine Jernite

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MultiVerS: Improving scientific claim verification with weak supervision and full-document context.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

MultiCite: Modeling realistic citations requires moving beyond the single-sentence single-label setting.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Data Governance in the Age of Large-Scale Data-Driven Language Technology.

[BibT_eX]

[DOI]

Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022

SciFact-Open: Towards open-domain scientific claim verification.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts.

[BibT_eX]

[DOI]

Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Overview of the Third Workshop on Scholarly Document Processing.

[BibT_eX]

[DOI]

Drahomira Herrmannova

Petr Knoth

Kyle Lo

Philipp Mayr

Michal Shmueli-Scheuer

Anita de Waard

Lucy Lu Wang

Proceedings of the Third Workshop on Scholarly Document Processing, 2022

Exploring the Role of Local and Global Explanations in Recommender Systems.

[BibT_eX]

[DOI]

Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

Generating Scientific Claims for Zero-Shot Scientific Fact Checking.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Searching for scientific evidence in a pandemic: An overview of TREC-COVID.

[BibT_eX]

[DOI]

J. Biomed. Informatics, 2021

Harnessing the Power of Smart and Connected Health to Tackle COVID-19: IoT, AI, Robotics, and Blockchain for a Better World.

[BibT_eX]

[DOI]

IEEE Internet Things J., 2021

LongChecker: Improving scientific claim verification by modeling full-abstract context.

[BibT_eX]

[DOI]

CoRR, 2021

Overview and Insights from the SciVer Shared Task on Scientific Claim Verification.

[BibT_eX]

[DOI]

David Wadden

Kyle Lo

CoRR, 2021

Incorporating Visual Layout Structures for Scientific Text Classification.

[BibT_eX]

[DOI]

CoRR, 2021

Text mining approaches for dealing with the rapidly expanding literature on COVID-19.

[BibT_eX]

[DOI]

Lucy Lu Wang

Kyle Lo

Briefings Bioinform., 2021

FLEX: Unifying Evaluation for Few-Shot NLP.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Discourse Understanding and Factual Consistency in Abstractive Summarization.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Augmenting Scientific Papers with Just-in-Time, Position-Sensitive Definitions of Terms and Symbols.

[BibT_eX]

[DOI]

Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

Explaining Relationships Between Scientific Documents.

[BibT_eX]

[DOI]

Kelvin Luu

Xinyi Wu

Rik Koncel-Kedziorski

Kyle Lo

Isabel Cachola

Noah A. Smith

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

TREC-COVID: constructing a pandemic information retrieval test collection.

[BibT_eX]

[DOI]

SIGIR Forum, 2020

TREC-COVID: rationale and structure of an information retrieval shared task for COVID-19.

[BibT_eX]

[DOI]

J. Am. Medical Informatics Assoc., 2020

Mitigating Biases in CORD-19 for Analyzing COVID-19 Literature.

[BibT_eX]

[DOI]

Frontiers Res. Metrics Anal., 2020

CORD-19: The Covid-19 Open Research Dataset.

[BibT_eX]

[DOI]

CoRR, 2020

Explanation-Based Tuning of Opaque Machine Learners with Application to Paper Recommendation.

[BibT_eX]

[DOI]

Benjamin Charles Germain Lee

Kyle Lo

Doug Downey

Daniel S. Weld

CoRR, 2020

Citation Text Generation.

[BibT_eX]

[DOI]

Kelvin Luu

Rik Koncel-Kedziorski

Kyle Lo

Isabel Cachola

Noah A. Smith

CoRR, 2020

Fact or Fiction: Verifying Scientific Claims.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Document-Level Definition Detection in Scholarly Documents: Existing Models, Error Analyses, and Future Directions.

[BibT_eX]

[DOI]

Proceedings of the First Workshop on Scholarly Document Processing, 2020

TLDR: Extreme Summarization of Scientific Documents.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

The COVID-19 Open Research Dataset - Abstract.

[BibT_eX]

[DOI]

Lucy Lu Wang

Kyle Lo

Proceedings of the Workshop on Semantic Indexing and Information Retrieval for Health from heterogeneous content types and languages co-located with 42nd European Conference on Information Retrieval, 2020

S2ORC: The Semantic Scholar Open Research Corpus.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Don't Stop Pretraining: Adapt Language Models to Domains and Tasks.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

GORC: A large contextual citation graph of academic papers.

[BibT_eX]

[DOI]

CoRR, 2019

Cooperative Generator-Discriminator Networks for Abstractive Summarization with Narrative Flow.

[BibT_eX]

[DOI]

CoRR, 2019

SciBERT: Pretrained Contextualized Embeddings for Scientific Text.

[BibT_eX]

[DOI]

Iz Beltagy

Arman Cohan

Kyle Lo

CoRR, 2019

Combining Distant and Direct Supervision for Neural Relation Extraction.

[BibT_eX]

[DOI]

Iz Beltagy

Kyle Lo

Waleed Ammar

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

SciBERT: A Pretrained Language Model for Scientific Text.

[BibT_eX]

[DOI]

Iz Beltagy

Kyle Lo

Arman Cohan

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018

Improving Distant Supervision with Maxpooled Attention and Sentence-Level Supervision.

[BibT_eX]

[DOI]

Iz Beltagy

Kyle Lo

Waleed Ammar

CoRR, 2018

Citation Count Analysis for Papers with Preprints.

[BibT_eX]

[DOI]

Sergey Feldman