Nishant Balepur

According to our database1, Nishant Balepur authored at least 24 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
DRACULA: Hunting for the Actions Users Want Deep Research Agents to Execute.
CoRR, April, 2026

Measuring User's Mental Models of Speech Translation in Human-AI Collaboration.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

BenchMarker: An Education-Inspired Toolkit for Highlighting Flaws in Multiple-Choice Benchmarks.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Language Models Don't Know What You Want: Evaluating Personalization in Deep Research Needs Real Users.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite.
CoRR, October, 2025

Can They Dixit? Yes they Can! Dixit as a Playground for Multimodal Language Model Capabilities.
CoRR, October, 2025

Test-Time Reasoners Are Strategic Multiple-Choice Test-Takers.
CoRR, October, 2025

MoDS: Moderating a Mixture of Document Speakers to Summarize Debatable Queries in Document Collections.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer?
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

A Good Plan is Hard to Find: Aligning Models with Preferences is Misaligned with What Helps Users.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Is Your Large Language Model Knowledgeable or a Choices-Only Cheater?
CoRR, 2024

The Prompt Report: A Systematic Survey of Prompting Techniques.
CoRR, 2024

KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Plausibly Problematic Questions in Multiple-Choice Benchmarks for Commonsense Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

It's Not Easy Being Wrong: Large Language Models Struggle with Process of Elimination Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
It's Not Easy Being Wrong: Evaluating Process of Elimination Reasoning in Large Language Models.
CoRR, 2023

Mastering the ABCDs of Complex Questions: Answer-Based Claim Decomposition for Fine-grained Self-Evaluation.
CoRR, 2023

Expository Text Generation: Imitate, Retrieve, Paraphrase.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Text Fact Transfer.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DynaMiTE: Discovering Explosive Topic Evolutions with User Guidance.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023


  Loading...