Jack Hessel

Orcid: 0000-0002-4012-8979

According to our database1, Jack Hessel authored at least 52 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning.
CoRR, 2024

L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects.
CoRR, 2024

Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models.
CoRR, 2024

OLMo: Accelerating the Science of Language Models.
CoRR, 2024

2023
Localized Symbolic Knowledge Distillation for Visual Commonsense Models.
CoRR, 2023

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations.
CoRR, 2023

Tailoring Self-Rationalizers with Multi-Reward Distillation.
CoRR, 2023

Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging.
CoRR, 2023

VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use.
CoRR, 2023

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models.
CoRR, 2023

FunQA: Towards Surprising Video Comprehension.
CoRR, 2023

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources.
CoRR, 2023

Text encoders are performance bottlenecks in contrastive vision-language models.
CoRR, 2023

Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved with Text.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Localized Symbolic Knowledge Distillation for Visual Commonsense Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

VisIT-Bench: A Dynamic Benchmark for Evaluating Instruction-Following Vision-and-Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Champagne: Learning Real-world Conversation from Large-Scale Web Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

What's "up" with vision-language models? Investigating their struggle with spatial reasoning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Text encoders bottleneck compositionality in contrastive vision-language models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Commonsense Norms.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Fusing Pre-Trained Language Models with Multimodal Prompts through Reinforcement Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization.
CoRR, 2022

Multimodal Knowledge Alignment with Reinforcement Learning.
CoRR, 2022

QUARK: Controllable Text Generation with Reinforced Unlearning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Reframing Human-AI Collaboration for Generating Free-Text Explanations.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Symbolic Knowledge Distillation: from General Language Models to Commonsense Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning.
Proceedings of the Computer Vision - ECCV 2022, 2022

MERLOT RESERVE: Neural Script Knowledge through Vision and Language and Sound.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
MERLOT: Multimodal Neural Script Knowledge Models.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

CLIPScore: A Reference-free Evaluation Metric for Image Captioning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

How effective is BERT without word ordering? Implications for language understanding and data privacy.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Domain-Specific Lexical Grounding in Noisy Visual-Textual Documents.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Beyond Instructional Videos: Probing for More Diverse Visual-Textual Grounding on YouTube.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
Something's Brewing! Early Prediction of Controversy-causing Posts from Discussion Features.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Unsupervised Discovery of Multimodal Links in Multi-image, Multi-sentence Documents.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

A Case Study on Combining ASR and Visual Features for Generating Instructional Video Captions.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

2018
Quantifying the Visual Concreteness of Words and Topics in Multimodal Datasets.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

2017
Cats and Captions vs. Creators and the Clock: Comparing Multimodal Content to Context in Predicting Relative Popularity.
Proceedings of the 26th International Conference on World Wide Web, 2017

2016
Science, AskScience, and BadScience: On the Coexistence of Highly Related Communities.
Proceedings of the Tenth International Conference on Web and Social Media, 2016

2015
What do Vegans do in their Spare Time? Latent Interest Detection in Multi-Community Networks.
CoRR, 2015

Image Representations and New Domains in Neural Image Captioning.
Proceedings of the Fourth Workshop on Vision and Language, 2015

2013
Evolving multicellularity in digital organisms through reproductive altruism.
Proceedings of the Genetic and Evolutionary Computation Conference, 2013

Using Reproductive Altruism to Evolve Multicellularity in Digital Organisms.
Proceedings of the Twelfth European Conference on the Synthesis and Simulation of Living Systems: Advances in Artificial Life, 2013


  Loading...