Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

ReIFE: Re-evaluating Instruction-Following Evaluation.

[BibT_eX]

[DOI]

Yixin Liu

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Understanding Reference Policies in Direct Preference Optimization.

[BibT_eX]

[DOI]

Yixin Liu

Pengfei Liu

Arman Cohan

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Physics: Benchmarking Foundation Models on University-Level Physics Problem Solving.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Evaluating Mathematical Reasoning Beyond Accuracy.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences.

[BibT_eX]

[DOI]

CoRR, 2024

Fair Abstractive Summarization of Diverse Perspectives.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

On the Role of Summary Content Units in Text Summarization Evaluation.

[BibT_eX]

[DOI]

Leonardo F. R. Ribeiro

Khyathi Raghavi Chandu

Yufang Hou

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

On Learning to Summarize with Large Language Models as References.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Calibrating Long-form Generations From Large Language Models.

[BibT_eX]

[DOI]

Yukun Huang

Yixin Liu

Raghuveer Thirukovalluru

Arman Cohan

Bhuwan Dhingra

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

FOLIO: Natural Language Reasoning with First-Order Logic.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Rethinking Efficient Multilingual Text Summarization Meta-Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Financial Documents.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data.

[BibT_eX]

[DOI]

CoRR, 2023

ODSum: New Benchmarks for Open Domain Multi-Document Summarization.

[BibT_eX]

[DOI]

CoRR, 2023

QTSumm: A New Benchmark for Query-Focused Table Summarization.

[BibT_eX]

[DOI]

CoRR, 2023

On Learning to Summarize with Large Language Models as References.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

QTSumm: Query-Focused Summarization over Tabular Data.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for Summarization.

[BibT_eX]

[DOI]

Khyathi Raghavi Chandu

João Sedoc

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

On Improving Summarization Factual Consistency from Natural Language Feedback.

[BibT_eX]

[DOI]

Ahmed Hassan Awadallah

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization.

[BibT_eX]

[DOI]

Khyathi Raghavi Chandu

CoRR, 2022

FOLIO: Natural Language Reasoning with First-Order Logic.

[BibT_eX]

[DOI]

CoRR, 2022

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code.

[BibT_eX]

[DOI]

Alexandros Papangelis

Aman Madaan

Angelina McMillan-Major

Khyathi Raghavi Chandu

Laura Perez-Beltrachini

Leonardo F. R. Ribeiro

Pawan Sasanka Ammanamanchi

CoRR, 2022

Surfer100: Generating Surveys From Web Resources, Wikipedia-style.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

R2D2: Robust Data-to-Text with Replacement Detection.

[BibT_eX]

[DOI]

Linyong Nan

Lorenzo Jaime Yu Flores

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Leveraging Locality in Abstractive Text Summarization.

[BibT_eX]

[DOI]

Ahmed Hassan Awadallah

Dragomir Radev

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

DataLab: A Platform for Data Analysis and Intervention.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

BRIO: Bringing Order to Abstractive Summarization.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

CLICKER: A Computational LInguistics Classification Scheme for Educational Resources.

[BibT_eX]

[DOI]

CoRR, 2021

On Learning Text Style Transfer with Direct Rewards.

[BibT_eX]

[DOI]

Yixin Liu

Graham Neubig

John Wieting

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

RefSum: Refactoring Neural Summarization.

[BibT_eX]

[DOI]

Yixin Liu

Zi-Yi Dou

Pengfei Liu

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

ExplainaBoard: An Explainable Leaderboard for NLP.

[BibT_eX]

[DOI]

Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization.

[BibT_eX]

[DOI]

Yixin Liu

Pengfei Liu

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Yixin Liu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...