Nishant Balepur

Feng Gu

Abhilasha Ravichander

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

A Good Plan is Hard to Find: Aligning Models with Preferences is Misaligned with What Helps Users.

[BibT_eX]

[DOI]

Seraphina Goldfarb-Tarrant

Matthew Shu

Yoo Yeon Sung

Fumeng Yang

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Is Your Large Language Model Knowledgeable or a Choices-Only Cheater?

[BibT_eX]

[DOI]

CoRR, 2024

The Prompt Report: A Systematic Survey of Prompting Techniques.

[BibT_eX]

[DOI]

CoRR, 2024

KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students.

[BibT_eX]

[DOI]

Matthew Shu

Jordan L. Boyd-Graber

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Plausibly Problematic Questions in Multiple-Choice Benchmarks for Commonsense Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick.

[BibT_eX]

[DOI]

Matthew Shu

Alexander Miserlis Hoyle

Alison Robey

Seraphina Goldfarb-Tarrant

Jordan L. Boyd-Graber

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?

[BibT_eX]

[DOI]

Abhilasha Ravichander

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

It's Not Easy Being Wrong: Large Language Models Struggle with Process of Elimination Reasoning.

[BibT_eX]

[DOI]

Shramay Palta

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

It's Not Easy Being Wrong: Evaluating Process of Elimination Reasoning in Large Language Models.

[BibT_eX]

[DOI]

Shramay Palta

CoRR, 2023

Mastering the ABCDs of Complex Questions: Answer-Based Claim Decomposition for Fine-grained Self-Evaluation.

[BibT_eX]

[DOI]

Kevin Chen-Chuan Chang

CoRR, 2023

Expository Text Generation: Imitate, Retrieve, Paraphrase.

[BibT_eX]

[DOI]

Jie Huang

Kevin Chen-Chuan Chang

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Text Fact Transfer.

[BibT_eX]

[DOI]

Jie Huang

Kevin Chen-Chuan Chang

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DynaMiTE: Discovering Explosive Topic Evolutions with User Guidance.

[BibT_eX]

[DOI]