Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions.

[BibT_eX]

[DOI]

Orion Weller

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

SCIURus: Shared Circuits for Interpretable Uncertainty Representations in Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

ReIFE: Re-evaluating Instruction-Following Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Understanding Reference Policies in Direct Preference Optimization.

[BibT_eX]

[DOI]

Yixin Liu

Pengfei Liu

Arman Cohan

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

ChemAgent: Self-updating Memories in Large Language Models Improves Chemical Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Z1: Efficient Test-time Scaling with Code.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Table-R1: Inference-Time Scaling for Table Reasoning Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

From Scores to Steps: Diagnosing and Improving LLM Performance in Evidence-Based Medical Calculations.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

SciSketch: An Open-source Framework for Automated Schematic Diagram Generation in Scientific Papers.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

LimRank: Less is More for Reasoning-Intensive Information Reranking.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Judging with Many Minds: Do More Perspectives Mean Less Prejudice? On Bias Amplification and Resistance in Multi-Agent Based LLM-as-Judge.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

FinLFQA: Evaluating Attributed Text Generation of LLMs in Financial Long-Form Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs.

[BibT_eX]

[DOI]

Gabrielle Kaili-May Liu

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

MedTutor: A Retrieval-Augmented LLM System for Case-Based Medical Education.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

FinTrust: A Comprehensive Benchmark of Trustworthiness Evaluation in Finance Domain.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

CourtReasoner: Can LLM Agents Reason Like Judges?

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

mFollowIR: A Multilingual Benchmark for Instruction Following in Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2025

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Task.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Can LLMs Identify Critical Limitations within Scientific Research? A Systematic Evaluation on AI Research Papers.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Ref-Long: Benchmarking the Long-context Referencing Capability of Long-context Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SciVer: Evaluating Foundation Models for Multimodal Scientific Claim Verification.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

TESS 2: A Large-Scale Generalist Diffusion Language Model.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

MDCure: A Scalable Pipeline for Multi-Document Instruction-Following.

[BibT_eX]

[DOI]

Gabrielle Kaili-May Liu

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2025

MIR: Methodology Inspiration Retrieval for Scientific Research Problems.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Physics: Benchmarking Foundation Models on University-Level Physics Problem Solving.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

LocAgent: Graph-Guided LLM Agents for Code Localization.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Can Multimodal Foundation Models Understand Schematic Diagrams? An Empirical Study on Information-Seeking QA over Scientific Papers.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

RouterRetriever: Routing over a Mixture of Expert Embedding Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

<tt>L2CEval</tt>: Evaluating Language-to-Code Generation Capabilities of Large Language Models.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2024

SurgeryLLM: a retrieval-augmented generation large language model framework for surgical decision support and workflow enhancement.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2024

ReFIT: Reranker Relevance Feedback during Inference.

[BibT_eX]

[DOI]

IEEE Data Eng. Bull., 2024

HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation.

[BibT_eX]

[DOI]

CoRR, 2024

ChemSafetyBench: Benchmarking LLM Safety on Chemistry Domain.

[BibT_eX]

[DOI]

CoRR, 2024

FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents.

[BibT_eX]

[DOI]

CoRR, 2024

COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences.

[BibT_eX]

[DOI]

CoRR, 2024

MetaMath: Integrating Natural Language and Code for Enhanced Mathematical Reasoning in Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

RouterRetriever: Exploring the Benefits of Routing over Multiple Expert Embedding Models.

[BibT_eX]

[DOI]

CoRR, 2024

Unveiling the Spectrum of Data Contamination in Language Models: A Survey from Detection to Remediation.

[BibT_eX]

[DOI]

CoRR, 2024

Step-Back Profiling: Distilling User History for Personalized Scientific Writing.

[BibT_eX]

[DOI]

CoRR, 2024

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature.

[BibT_eX]

[DOI]

CoRR, 2024

MIMIR: A Streamlined Platform for Personalized Agent Tuning in Domain Expertise.

[BibT_eX]

[DOI]

CoRR, 2024

Evaluating LLMs at Detecting Errors in LLM Responses.

[BibT_eX]

[DOI]

Ryo Kamoi

Sarkar Snigdha Sarathi Das

Sujeeth Reddy Vummanthala

CoRR, 2024

On the Benefits of Fine-Grained Loss Truncation: A Case Study on Factuality in Summarization.

[BibT_eX]

[DOI]

Lorenzo Jaime Yu Flores

Arman Cohan

CoRR, 2024

Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science.

[BibT_eX]

[DOI]

CoRR, 2024

Yale NLP at TREC 2024: Tip-of-the-Tongue Track.

[BibT_eX]

[DOI]

Rohan Phanse

Gabrielle Kaili-May Liu

Arman Cohan

Proceedings of the Thirty-Third Text REtrieval Conference, 2024

Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data?

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

On Learning to Summarize with Large Language Models as References.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Investigating Data Contamination in Modern Benchmarks for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

NExT: Teaching Large Language Models to Reason about Code Execution.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Observable Propagation: Uncovering Feature Vectors in Transformers.

[BibT_eX]

[DOI]

Jacob Dunefsky

Arman Cohan

Proceedings of the Forty-first International Conference on Machine Learning, 2024

MIMIR: A Customizable Agent Tuning Platform for Enhanced Scientific Applications.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers.

[BibT_eX]

[DOI]

Shruti Singh

Nandan Sarkar

Arman Cohan

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

OMG-QA: Building Open-Domain Multi-Modal Generative Question Answering Systems.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Calibrating Long-form Generations From Large Language Models.

[BibT_eX]

[DOI]

Yukun Huang

Yixin Liu

Raghuveer Thirukovalluru

Arman Cohan

Bhuwan Dhingra

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

FOLIO: Natural Language Reasoning with First-Order Logic.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

TAIL: A Toolkit for Automatic and Realistic Long-Context Large Language Model Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

Bayesian Calibration of Win Rate Estimation with LLM Evaluators.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

OpenT2T: An Open-Source Toolkit for Table-to-Text Generation.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

FinDVer: Explainable Claim Verification over Long and Hybrid-content Financial Documents.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

TESS: Text-to-Text Self-Conditioned Simplex Diffusion.

[BibT_eX]

[DOI]

Rabeeh Karimi Mahabadi

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

On the Benefits of Fine-Grained Loss Truncation: A Case Study on Factuality in Summarization.

[BibT_eX]

[DOI]

Lorenzo Jaime Flores

Arman Cohan

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models.

[BibT_eX]

[DOI]

Martin Riddell

Ansong Ni

Arman Cohan

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Rethinking Efficient Multilingual Text Summarization Meta-Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

OLMo: Accelerating the Science of Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Unveiling the Spectrum of Data Contamination in Language Model: A Survey from Detection to Remediation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

KnowledgeFMath: A Knowledge-Intensive Math Reasoning Dataset in Finance Domains.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Financial Documents.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

TaPERA: Enhancing Faithfulness and Interpretability in Long-Form Table QA by Content Planning and Execution-based Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Observable Propagation: A Data-Efficient Approach to Uncover Feature Vectors in Transformers.

[BibT_eX]

[DOI]

Jacob Dunefsky

Arman Cohan

CoRR, 2023

MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning.

[BibT_eX]

[DOI]

CoRR, 2023

ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks.

[BibT_eX]

[DOI]

CoRR, 2023

DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data.

[BibT_eX]

[DOI]

CoRR, 2023

KnowledgeMath: Knowledge-Intensive Math Word Problem Solving in Finance Domains.

[BibT_eX]

[DOI]

CoRR, 2023

Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense Encoders.

[BibT_eX]

[DOI]

CoRR, 2023

L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?

[BibT_eX]

[DOI]

CoRR, 2023

ODSum: New Benchmarks for Open Domain Multi-Document Summarization.

[BibT_eX]

[DOI]

CoRR, 2023

Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers.

[BibT_eX]

[DOI]

CoRR, 2023

A Controllable QA-based Framework for Decontextualization.

[BibT_eX]

[DOI]

CoRR, 2023

QTSumm: A New Benchmark for Query-Focused Table Summarization.

[BibT_eX]

[DOI]

CoRR, 2023

On Learning to Summarize with Large Language Models as References.

[BibT_eX]

[DOI]

CoRR, 2023

Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies.

[BibT_eX]

[DOI]

CoRR, 2023

Inference-time Re-ranker Relevance Feedback for Neural Information Retrieval.

[BibT_eX]

[DOI]

CoRR, 2023

The Semantic Scholar Open Data Platform.

[BibT_eX]

[DOI]

CoRR, 2023

SciRepEval: A Multi-Format Benchmark for Scientific Document Representations.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Question Answering Framework for Decontextualizing User-facing Snippets from Scientific Documents.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Enhancing Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Medical Text Simplification: Optimizing for Readability with Unlikelihood Training and Reranked Beam Search Decoding.

[BibT_eX]

[DOI]

Lorenzo Jaime Yu Flores

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Investigating Table-to-Text Generation Capabilities of Large Language Models in Real-World Information Seeking Scenarios.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

QTSumm: Query-Focused Summarization over Tabular Data.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Embedding Recycling for Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization.

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

OpenRT: An Open-source Framework for Reasoning Over Tabular Data.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2023

Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Aligning Factual Consistency for Clinical Studies Summarization through Reinforcement Learning.

[BibT_eX]

[DOI]

Xiangru Tang

Arman Cohan

Mark Gerstein

Proceedings of the 5th Clinical Natural Language Processing Workshop, 2023

2022

ABNIRML: Analyzing the Behavior of Neural IR Models.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2022

Exploring the Challenges of Open Domain Multi-Document Summarization.

[BibT_eX]

[DOI]

CoRR, 2022

MultiVerS: Improving scientific claim verification with weak supervision and full-document context.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Multi-Vector Models with Textual Guidance for Fine-Grained Scientific Document Similarity.

[BibT_eX]

[DOI]

Sheshera Mysore

Arman Cohan

Tom Hope

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

MultiCite: Modeling realistic citations requires moving beyond the single-sentence single-label setting.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Long Context Question Answering via Supervised Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

SciFact-Open: Towards open-domain scientific claim verification.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Overview of the First Shared Task on Multi Perspective Scientific Document Summarization (MuP).

[BibT_eX]

[DOI]

Arman Cohan

Guy Feigenblat

Tirthankar Ghosal

Michal Shmueli-Scheuer

Proceedings of the Third Workshop on Scholarly Document Processing, 2022

Overview of the Third Workshop on Scholarly Document Processing.

[BibT_eX]

[DOI]

Drahomira Herrmannova

Petr Knoth

Kyle Lo

Philipp Mayr

Michal Shmueli-Scheuer

Anita de Waard

Lucy Lu Wang

Proceedings of the Third Workshop on Scholarly Document Processing, 2022

PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Improving the Generalizability of Depression Detection by Leveraging Clinical Questionnaires.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Zero- and Few-Shot NLP with Pretrained Language Models.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

Generating Scientific Claims for Zero-Shot Scientific Fact Checking.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

ParsiNLU: A Suite of Language Understanding Challenges for Persian.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2021

Utilizing Evidence Spans via Sequence-Level Contrastive Learning for Long-Context Question Answering.

[BibT_eX]

[DOI]

CoRR, 2021

LongChecker: Improving scientific claim verification by modeling full-abstract context.

[BibT_eX]

[DOI]

CoRR, 2021

PRIMER: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization.

[BibT_eX]

[DOI]

CoRR, 2021

Cross-Document Language Modeling.

[BibT_eX]

[DOI]

CoRR, 2021

Simplified Data Wrangling with ir_datasets.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

FLEX: Unifying Evaluation for Few-Shot NLP.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

CDLM: Cross-Document Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

On Generating Extended Summaries of Long Documents.

[BibT_eX]

[DOI]

Sajad Sotudeh

Arman Cohan

Nazli Goharian

Proceedings of the Workshop on Scientific Document Understanding co-located with 35th AAAI Conference on Artificial Inteligence, 2021

2020

SLEDGE: A Simple Yet Effective Baseline for Coronavirus Scientific Knowledge Search.

[BibT_eX]

[DOI]

Sean MacAvaney

Arman Cohan

Nazli Goharian

CoRR, 2020

Longformer: The Long-Document Transformer.

[BibT_eX]

[DOI]

Iz Beltagy

Matthew E. Peters

Arman Cohan

CoRR, 2020

Fact or Fiction: Verifying Scientific Claims.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

SLEDGE-Z: A Zero-Shot Baseline for COVID-19 Literature Search.

[BibT_eX]

[DOI]

Sean MacAvaney

Arman Cohan

Nazli Goharian

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

GUIR @ LongSumm 2020: Learning to Generate Long Summaries from Scientific Documents.

[BibT_eX]

[DOI]

Sajad Sotudeh Gharebagh

Arman Cohan

Nazli Goharian

Proceedings of the First Workshop on Scholarly Document Processing, 2020

TLDR: Extreme Summarization of Scientific Documents.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Ranking Significant Discrepancies in Clinical Reports.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2020

SUPP.AI: finding evidence for supplement-drug interactions.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

SPECTER: Document-level Representation Learning using Citation-informed Transformers.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Extracting evidence of supplement-drug interactions from literature.

[BibT_eX]

[DOI]

CoRR, 2019

SciBERT: Pretrained Contextualized Embeddings for Scientific Text.

[BibT_eX]

[DOI]

Iz Beltagy

Arman Cohan

Kyle Lo

CoRR, 2019

CEDR: Contextualized Embeddings for Document Ranking.

[BibT_eX]

[DOI]

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Ontology-Aware Clinical Abstractive Summarization.

[BibT_eX]

[DOI]

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Structural Scaffolds for Citation Intent Classification in Scientific Publications.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Pretrained Language Models for Sequential Sentence Classification.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

SciBERT: A Pretrained Language Model for Scientific Text.

[BibT_eX]

[DOI]

Iz Beltagy

Kyle Lo

Arman Cohan

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018

Text Summarization and Categorization for Scientific and Health-Related Data.

[BibT_eX]

[DOI]

Arman Cohan

SIGIR Forum, 2018

Scientific document summarization via citation contextualization and scientific discourse.

[BibT_eX]

[DOI]

Arman Cohan

Nazli Goharian

Int. J. Digit. Libr., 2018

Overcoming Low-Utility Facets for Complex Answer Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Joint Proceedings of the First International Workshop on Professional Search (ProfS2018); the Second Workshop on Knowledge Graphs and Semantics for Text Retrieval, 2018

Characterizing Question Facets for Complex Answer Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

GU IRLAB at SemEval-2018 Task 7: Tree-LSTMs for Scientific Relation Classification.

[BibT_eX]

[DOI]

Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

SMHD: a Large-Scale Resource for Exploring Online Language Usage for Multiple Mental Health Conditions.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computational Linguistics, 2018

Relation Extraction for Protein-protein Interactions Affected by Mutations.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018

Helping or Hurting? Predicting Changes in Users' Risk of Self-Harm Through Online Community Interactions.

[BibT_eX]

[DOI]

Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic, 2018

RSDD-Time: Temporal Annotation of Self-Reported Mental Health Diagnoses.

[BibT_eX]

[DOI]

Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic, 2018

2017

Triaging content severity in online mental health forums.

[BibT_eX]

[DOI]

J. Assoc. Inf. Sci. Technol., 2017

Contextualizing Citations for Scientific Summarization using Word Embeddings and Domain Knowledge.

[BibT_eX]

[DOI]

Arman Cohan

Nazli Goharian

Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

GUIR at SemEval-2017 Task 12: A Framework for Cross-Domain Clinical Temporal Information Extraction.

[BibT_eX]

[DOI]

Sean MacAvaney

Arman Cohan

Nazli Goharian

Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Depression and Self-Harm Risk Assessment in Online Forums.

[BibT_eX]

[DOI]

Andrew Yates

Arman Cohan

Nazli Goharian

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Neural Attention Model for Categorizing Patient Safety Events.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2017

Identifying Harm Events in Clinical Care through Medical Narratives.

[BibT_eX]

[DOI]

Proceedings of the 8th ACM International Conference on Bioinformatics, 2017

2016

GUIR at SemEval-2016 task 12: Temporal Information Processing for Clinical Narratives.

[BibT_eX]

[DOI]

Arman Cohan

Kevin Meurer

Nazli Goharian

Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Triaging Mental Health Forum Posts.

[BibT_eX]

[DOI]

Arman Cohan

Sydney Young

Nazli Goharian

Proceedings of the 3rd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, 2016

Revisiting Summarization Evaluation for Scientific Articles.

[BibT_eX]

[DOI]

Arman Cohan

Nazli Goharian

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015

Matching Citation Text and Cited Spans in Biomedical Literature: a Search-Oriented Approach.

[BibT_eX]

[DOI]

Arman Cohan

Luca Soldaini

Nazli Goharian

Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Scientific Article Summarization Using Citation-Context and Article's Discourse Structure.

[BibT_eX]

[DOI]

Arman Cohan

Nazli Goharian

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Retrieving Medical Literature for Clinical Decision Support.

[BibT_eX]

[DOI]

Luca Soldaini

Arman Cohan

Andrew Yates