Jack Hessel

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness.

[BibT_eX]

[DOI]

CoRR, 2024

The Art of Saying No: Contextual Noncompliance in Language Models.

[BibT_eX]

[DOI]

Faeze Brahman

Sachin Kumar

Vidhisha Balachandran

Pradeep Dasigi

Valentina Pyatkin

Abhilasha Ravichander

Sarah Wiegreffe

Nouha Dziri

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

WildChat: 1M ChatGPT Interaction Logs in the Wild.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Tailoring Self-Rationalizers with Multi-Reward Distillation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

WildVis: Open Source Visualizer for Million-Scale Chat Logs in the Wild.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

🐱 FunQA: Towards Surprising Video Comprehension.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models.

[BibT_eX]

[DOI]

Anthony Sicilia

Hyunwoo Kim

Malihe Alikhani

Proceedings of the Findings of the Association for Computational Linguistics, 2024

OLMo: Accelerating the Science of Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging.

[BibT_eX]

[DOI]

CoRR, 2023

VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use.

[BibT_eX]

[DOI]

CoRR, 2023

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

FunQA: Towards Surprising Video Comprehension.

[BibT_eX]

[DOI]

CoRR, 2023

Text encoders are performance bottlenecks in contrastive vision-language models.

[BibT_eX]

[DOI]

Amita Kamath

Kai-Wei Chang

CoRR, 2023

Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved with Text.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Localized Symbolic Knowledge Distillation for Visual Commonsense Models.

[BibT_eX]

[DOI]

Jae Sung Park

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

VisIT-Bench: A Dynamic Benchmark for Evaluating Instruction-Following Vision-and-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization.

[BibT_eX]

[DOI]

Rajkumar Ramamurthy

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Champagne: Learning Real-world Conversation from Large-Scale Web Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

What's "up" with vision-language models? Investigating their struggle with spatial reasoning.

[BibT_eX]

[DOI]

Amita Kamath

Kai-Wei Chang

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Text encoders bottleneck compositionality in contrastive vision-language models.

[BibT_eX]

[DOI]

Amita Kamath

Kai-Wei Chang

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Commonsense Norms.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Fusing Pre-Trained Language Models with Multimodal Prompts through Reinforcement Learning.

[BibT_eX]

[DOI]

Ronan Le Bras

Gunhee Kim

Yejin Choi

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization.

[BibT_eX]

[DOI]

CoRR, 2022

Multimodal Knowledge Alignment with Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

QUARK: Controllable Text Generation with Reinforced Unlearning.

[BibT_eX]

[DOI]

Yejin Choi

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Reframing Human-AI Collaboration for Generating Free-Text Explanations.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Symbolic Knowledge Distillation: from General Language Models to Commonsense Models.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

MERLOT RESERVE: Neural Script Knowledge through Vision and Language and Sound.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

MERLOT: Multimodal Neural Script Knowledge Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

CLIPScore: A Reference-free Evaluation Metric for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

How effective is BERT without word ordering? Implications for language understanding and data privacy.

[BibT_eX]

[DOI]

Alexandra Schofield

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Domain-Specific Lexical Grounding in Noisy Visual-Textual Documents.

[BibT_eX]

[DOI]

Gregory Yauney

David Mimno

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Beyond Instructional Videos: Probing for More Diverse Visual-Textual Grounding on YouTube.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!

[BibT_eX]

[DOI]

Lillian Lee

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019

Something's Brewing! Early Prediction of Controversy-causing Posts from Discussion Features.

[BibT_eX]

[DOI]

Lillian Lee

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Unsupervised Discovery of Multimodal Links in Multi-image, Multi-sentence Documents.

[BibT_eX]

[DOI]