Yejin Choi

Orcid: 0000-0003-3032-5378

Affiliations:
  • University of Washington, School of Computer Science & Engineering, Seattle, WA, USA
  • Allen Institute for Artificial Intelligence, Seattle, WA, USA
  • Stony Brook University, Department of Computer Science, Stony Brook, NY, USA
  • Cornell University, Ithaca, NY, USA (PhD 2010)


According to our database1, Yejin Choi authored at least 308 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models.
CoRR, 2024

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs.
CoRR, 2024

Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration.
CoRR, 2024

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens.
CoRR, 2024

WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences.
CoRR, 2024

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback.
CoRR, 2024

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing.
CoRR, 2024

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models.
CoRR, 2024

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild.
CoRR, 2024

From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step.
CoRR, 2024

WildChat: 1M ChatGPT Interaction Logs in the Wild.
CoRR, 2024

CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting.
CoRR, 2024

CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge.
CoRR, 2024

Particip-AI: A Democratic Surveying Framework for Anticipating Future AI Use Cases, Harms and Benefits.
CoRR, 2024

RewardBench: Evaluating Reward Models for Language Modeling.
CoRR, 2024

Information-Theoretic Distillation for Reference-less Summarization.
CoRR, 2024

Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs.
CoRR, 2024

Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning.
CoRR, 2024

Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs.
CoRR, 2024

L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects.
CoRR, 2024

JAMDEC: Unsupervised Authorship Obfuscation using Constrained Decoding over Small Language Models.
CoRR, 2024

Do Membership Inference Attacks Work on Large Language Models?
CoRR, 2024

A Roadmap to Pluralistic Alignment.
CoRR, 2024

Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens.
CoRR, 2024

Tuning Language Models by Proxy.
CoRR, 2024

Agent AI: Surveying the Horizons of Multimodal Interaction.
CoRR, 2024

Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

A Call for Clarity in Beam Search: How It Works and When It Stops.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Common Sense: the Dark Matter of Language and Intelligence.
Proc. VLDB Endow., 2023

The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning.
CoRR, 2023

VIM: Probing Multimodal Large Language Models for Visual Embedded Instruction Following.
CoRR, 2023

MacGyver: Are Large Language Models Creative Problem Solvers?
CoRR, 2023

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations.
CoRR, 2023

In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search.
CoRR, 2023

Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs.
CoRR, 2023

Tailoring Self-Rationalizers with Multi-Reward Distillation.
CoRR, 2023

The Generative AI Paradox: "What It Can Create, It May Not Understand".
CoRR, 2023

Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory.
CoRR, 2023

Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging.
CoRR, 2023

Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting.
CoRR, 2023

FiLM: Fill-in Language Models for Any-Order Generation.
CoRR, 2023

Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement.
CoRR, 2023

Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding.
CoRR, 2023

PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Planning.
CoRR, 2023

Impossible Distillation: from Low-Quality Model to High-Quality Dataset & Model for Summarization and Paraphrasing.
CoRR, 2023

NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge.
CoRR, 2023

ArK: Augmented Reality with Knowledge Interactive Emergent Ability.
CoRR, 2023

BotPercent: Estimating Twitter Bot Populations from Groups to Crowds.
CoRR, 2023

Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved with Text.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Localized Symbolic Knowledge Distillation for Visual Commonsense Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

RealTime QA: What's the Answer Right Now?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Faith and Fate: Limits of Transformers on Compositionality.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling.
Proceedings of the International Conference on Machine Learning, 2023

Generating Sequences by Learning to Self-Correct.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Champagne: Learning Real-world Conversation from Large-Scale Web Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

BotPercent: Estimating Bot Populations in Twitter Communities.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

What Makes it Ok to Set a Fire? Iterative Self-distillation of Contexts and Rationales for Disambiguating Defeasible Social and Moral Situations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

We're Afraid Language Models Aren't Modeling Ambiguity.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Commonsense Norms.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

STEER: Unified Style Transfer with Expert Reinforcement.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

"You Are An Expert Linguistic Annotator": Limits of LLMs as Analyzers of Abstract Meaning Representation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Crystal: Introspective Reasoners Reinforced with Self-Feedback.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Penguins Don't Fly: Reasoning about Generics through Instantiations and Exceptions.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Fusing Pre-Trained Language Models with Multimodal Prompts through Reinforcement Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Influence Diagnostics under Self-concordance.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Commonsense Knowledge Transfer for Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

ClarifyDelphi: Reinforced Clarification Questions with Defeasibility Rewards for Social and Moral Situations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created through Human-Machine Collaboration.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-text Rationales.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Detoxifying Text with MaRCo: Controllable Revision with Experts and Anti-Experts.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

REV: Information-Theoretic Evaluation of Free-Text Rationales.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
It's not Rocket Science: Interpreting Figurative Language in Narratives.
Trans. Assoc. Comput. Linguistics, 2022

MAUVE Scores for Generative Models: Theory and Practice.
CoRR, 2022

SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization.
CoRR, 2022

Reinforced Clarification Question Generation with Defeasibility Rewards for Disambiguating Social and Moral Situations.
CoRR, 2022

An AI Dungeon Master's Guide: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons.
CoRR, 2022

I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation.
CoRR, 2022

Statistical and Computational Guarantees for Influence Diagnostics.
CoRR, 2022

Multimodal Knowledge Alignment with Reinforcement Learning.
CoRR, 2022

Benchmarking Generalization via In-Context Instructions on 1, 600+ Language Tasks.
CoRR, 2022

Beam Decoding with Controlled Patience.
CoRR, 2022

Faking Fake News for Real Fake News Detection: Propaganda-loaded Training Data Generation.
CoRR, 2022

Computational Lens on Cognition: Study Of Autobiographical Versus Imagined Stories With Large-Scale Language Models.
CoRR, 2022

Knowledge is Power: Symbolic Knowledge Distillation, Commonsense Morality, & Multimodal Script Knowledge.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

NaturalProver: Grounded Mathematical Proof Generation with Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

COLD Decoding: Energy-based Constrained Text Generation with Langevin Dynamics.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

QUARK: Controllable Text Generation with Reinforced Unlearning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Reframing Human-AI Collaboration for Generating Free-Text Explanations.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Symbolic Knowledge Distillation: from General Language Models to Commonsense Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Exposing the Limits of Video-Text Models through Contrast Sets.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Transparent Human Evaluation for Image Captioning.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Aligning to Social Norms and Values in Interactive Narratives.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Understanding Dataset Difficulty with <i>V</i>-Usable Information.
Proceedings of the International Conference on Machine Learning, 2022

Referee: Reference-Free Sentence Summarization with Sharper Controllability through Symbolic Knowledge Distillation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

GENIE: Toward Reproducible and Standardized Human Evaluation for Text Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Twist Decoding: Diverse Generators Guide Each Other.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Maieutic Prompting: Logically Consistent Reasoning with Recursive Explanations.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data Augmentation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

NaturalAdversaries: Can Naturalistic Adversaries Be as Effective as Artificial Adversaries?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

ProsocialDialog: A Prosocial Backbone for Conversational Agents.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning.
Proceedings of the Computer Vision - ECCV 2022, 2022

MERLOT RESERVE: Neural Script Knowledge through Vision and Language and Sound.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Probing Factually Grounded Content Transfer with Factual Ablation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Reframing Instructional Prompts to GPTk's Language.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Misinfo Reaction Frames: Reasoning about Readers' Reactions to News Headlines.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Is GPT-3 Text Indistinguishable from Human Text? Scarecrow: A Framework for Scrutinizing Machine Text.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Generated Knowledge Prompting for Commonsense Reasoning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Symbolic Brittleness in Sequence Models: On Systematic Generalization in Symbolic Mathematics.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
PROMPT WAYWARDNESS: The Curious Case of Discretized Interpretation of Continuous Prompts.
CoRR, 2021

Information-Theoretic Measures of Dataset Difficulty.
CoRR, 2021

Delphi: Towards Machine Ethics and Norms.
CoRR, 2021

Reframing Instructional Prompts to GPTk's Language.
CoRR, 2021

Scarecrow: A Framework for Scrutinizing Machine Text.
CoRR, 2021

Divergence Frontiers for Generative Models: Sample Complexity, Quantization Level, and Frontier Integral.
CoRR, 2021

On-the-Fly Controlled Text Generation with Experts and Anti-Experts.
CoRR, 2021

Misinfo Belief Frames: A Case Study on Covid & Climate News.
CoRR, 2021

proScript: Partially Ordered Scripts Generation via Pre-trained Language Models.
CoRR, 2021

MAUVE: Human-Machine Divergence Curves for Evaluating Open-Ended Text Generation.
CoRR, 2021

GENIE: A Leaderboard for Human-in-the-Loop Evaluation of Text Generation.
CoRR, 2021

VinVL: Making Visual Representations Matter in Vision-Language Models.
CoRR, 2021

On-the-Fly Attention Modularization for Neural Generation.
CoRR, 2021

Understanding Few-Shot Commonsense Knowledge Models.
CoRR, 2021

WinoGrande: an adversarial winograd schema challenge at scale.
Commun. ACM, 2021

MERLOT: Multimodal Neural Script Knowledge Models.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

NaturalProofs: Mathematical Theorem Proving in Natural Language.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

CommonsenseQA 2.0: Exposing the Limits of AI through Gamification.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Divergence Frontiers for Generative Models: Sample Complexity, Quantization Effects, and Frontier Integrals.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

TuringAdvice: A Generative and Dynamic Evaluation of Language Use.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

NeuroLogic Decoding: (Un)supervised Neural Text Generation with Predicate Logic Constraints.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

"I'm Not Mad": Commonsense Implications of Negation and Contradiction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

proScript: Partially Ordered Scripts Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Contrastive Explanations for Model Interpretability.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Surface Form Competition: Why the Highest Probability Answer Isn't Always Right.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

CLIPScore: A Reference-free Evaluation Metric for Image Captioning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Moral Stories: Situated Reasoning about Norms, Intents, Actions, and their Consequences.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Conversational Multi-Hop Reasoning with Neural Commonsense Knowledge and Symbolic Logic Rules.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Challenges in Automated Debiasing for Toxic Language Detection.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Discourse Understanding and Factual Consistency in Abstractive Summarization.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

VinVL: Revisiting Visual Representations in Vision-Language Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Analyzing Commonsense Emergence in Few-shot Knowledge Models.
Proceedings of the 3rd Conference on Automated Knowledge Base Construction, 2021

PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Reflective Decoding: Beyond Unidirectional Generation with Off-the-Shelf Language Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

TIMEDIAL: Temporal Commonsense Reasoning in Dialog.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

GO FIGURE: A Meta Evaluation of Factuality in Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

On-the-Fly Attention Modulation for Neural Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Edited Media Understanding Frames: Reasoning About the Intent and Implications of Visual Misinformation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

SCRUPLES: A Corpus of Community Ethical Judgments on 32, 000 Real-Life Anecdotes.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

(Comet-) Atomic 2020: On Symbolic and Neural Commonsense Knowledge Graphs.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Paragraph-level Commonsense Transformers with Recurrent Memory.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

MultiTalk: A Highly-Branching Dialog Testbed for Diverse Conversations.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Learning to Rationalize for Nonmonotonic Reasoning with Distant Supervision.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Dynamic Neuro-Symbolic Knowledge Graph Construction for Zero-shot Commonsense Question Answering.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Edited Media Understanding: Reasoning About Implications of Manipulated Images.
CoRR, 2020

Reflective Decoding: Unsupervised Paraphrasing and Abductive Reasoning.
CoRR, 2020

Visual Commonsense Graphs: Reasoning about the Dynamic Context of a Still Image.
CoRR, 2020

Evaluating Machines by their Real-World Language Use.
CoRR, 2020

Multi-View Learning for Vision-and-Language Navigation.
CoRR, 2020

Adversarial Filters of Dataset Biases.
Proceedings of the 37th International Conference on Machine Learning, 2020

The Curious Case of Neural Text Degeneration.
Proceedings of the 8th International Conference on Learning Representations, 2020

Abductive Commonsense Reasoning.
Proceedings of the 8th International Conference on Learning Representations, 2020

G-DAug: Generative Data Augmentation for Commonsense Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Unsupervised Commonsense Question Answering with Self-Talk.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Thinking Like a Skeptic: Defeasible Inference in Natural Language.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

PlotMachines: Outline-Conditioned Generation with Dynamic Plot State Tracking.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Back to the Future: Unsupervised Backprop-based Decoding for Counterfactual and Abductive Commonsense Reasoning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

PowerTransformer: Unsupervised Controllable Revision for Biased Language Correction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Social Chemistry 101: Learning to Reason about Social and Moral Norms.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

VisualCOMET: Reasoning About the Dynamic Context of a Still Image.
Proceedings of the Computer Vision - ECCV 2020, 2020

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks.
Proceedings of the Computer Vision - ECCV 2020, 2020

Do Neural Language Models Overcome Reporting Bias?
Proceedings of the 28th International Conference on Computational Linguistics, 2020

CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

Procedural Reading Comprehension with Attribute-Aware Context Flow.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

Commonsense Reasoning for Natural Language Processing.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2020

Recollection versus Imagination: Exploring Human Memory and Cognition via Neural Language Models.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Social Bias Frames: Reasoning about Social and Power Implications of Language.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Commonsense Knowledge Base Completion with Structural and Semantic Context.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

PIQA: Reasoning about Physical Commonsense in Natural Language.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
DREAM: A Challenge Dataset and Models for Dialogue-Based Reading Comprehension.
Trans. Assoc. Comput. Linguistics, 2019

Dynamic Knowledge Graph Construction for Zero-shot Commonsense Question Answering.
CoRR, 2019

Exploiting Structural and Semantic Context for Commonsense Knowledge Base Completion.
CoRR, 2019

Cooperative Generator-Discriminator Networks for Abstractive Summarization with Narrative Flow.
CoRR, 2019

Efficient Adaptation of Pretrained Transformers for Abstractive Summarization.
CoRR, 2019

The Curious Case of Neural Text Degeneration.
CoRR, 2019

SocialIQA: Commonsense Reasoning about Social Interactions.
CoRR, 2019

Defending Against Neural Fake News.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Benchmarking Hierarchical Script Knowledge.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

EARLY FUSION for Goal Directed Robotic Vision.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

BottleSum: Unsupervised and Self-supervised Sentence Summarization using the Information Bottleneck Principle.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Social IQa: Commonsense Reasoning about Social Interactions.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Counterfactual Story Reasoning and Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Robust Navigation with Language Pretraining and Stochastic Sampling.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Cosmos QA: Machine Reading Comprehension with Contextual Commonsense Reasoning.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

From Recognition to Cognition: Visual Commonsense Reasoning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Tactical Rewind: Self-Correction via Backtracking in Vision-And-Language Navigation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Do Neural Language Representations Learn Physical Commonsense?
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

HellaSwag: Can a Machine Really Finish Your Sentence?
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

The Risk of Racial Bias in Hate Speech Detection.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Conversing by Reading: Contentful Neural Conversation with On-demand Machine Reading.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

COMET: Commonsense Transformers for Automatic Knowledge Graph Construction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

ATOMIC: An Atlas of Machine Commonsense for If-Then Reasoning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Balancing Shared Autonomy with Human-Robot Communication.
CoRR, 2018

Neural Poetry Translation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Sounding Board: A User-Centric and Content-Driven Social Chatbot.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, 2018

Deep Communicating Agents for Abstractive Summarization.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Discourse-Aware Neural Rewards for Coherent Text Generation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Simulating Action Dynamics with Neural Process Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Neural Metaphor Detection in Context.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

QuAC: Question Answering in Context.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Neural Motifs: Scene Graph Parsing With Global Context.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Event2Mind: Commonsense Inference on Events, Intents, and Reactions.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Ultra-Fine Entity Typing.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Modeling Naive Psychology of Characters in Simple Commonsense Stories.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Learning to Write with Cooperative Discriminators.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Learning Interpretable Spatial Operations in a Rich 3D Blocks World.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Detecting English Writing Styles For Non Native Speakers.
CoRR, 2017

Zero-Shot Activity Recognition with Verb Attribute Induction.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Connotation Frames of Power and Agency in Modern Films.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Truth of Varying Shades: Analyzing Language in Fake News and Political Fact-Checking.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Dynamic Entity Representations in Neural Language Models.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Story Cloze Task: UW NLP System.
Proceedings of the 2nd Workshop on Linking Models of Lexical, 2017

The Effect of Different Writing Tasks on Linguistic Style: A Case Study of the ROC Story Cloze Task.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

Multilingual Connotation Frames: A Case Study on Social Media for Targeted Sentiment Analysis and Forecast.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Neural AMR: Sequence-to-Sequence Models for Parsing and Generation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Verb Physics: Relative Physical Knowledge of Actions and Objects.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Large Scale Retrieval and Generation of Image Descriptions.
Int. J. Comput. Vis., 2016

AI's 10 to Watch.
IEEE Intell. Syst., 2016

Learning to name objects.
Commun. ACM, 2016

Sketch-to-Text Generation: Toward Contextual, Creative, and Coherent Composition.
Proceedings of the INLG 2016, 2016

Globally Coherent Text Generation with Neural Checklist Models.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Generating Topical Poetry.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Connotation Frames: A Data-Driven Investigation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Document-level Sentiment Inference with Social, Faction, and Discourse Context.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Learning Prototypical Event Structure from Photo Albums.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Are Elephants Bigger than Butterflies? Reasoning about Sizes of Objects.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Predicting Entry-Level Categories.
Int. J. Comput. Vis., 2015

Connotation Frames: Typed Relations of Implied Sentiment in Predicate-Argument Structure.
CoRR, 2015

Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing.
CoRR, 2015

Internet Outages, the Eyewitness Accounts: Analysis of the Outages Mailing List.
Proceedings of the Passive and Active Measurement - 16th International Conference, 2015

Déjà Image-Captions: A Corpus of Expressive Descriptions in Repetition.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Event Detection and Factuality Assessment with Non-Expert Supervision.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Mise en Place: Unsupervised Interpretation of Instructional Recipes.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Refer-to-as Relations as Semantic Knowledge.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
TREETALK: Composition and Compression of Trees for Image Descriptions.
Trans. Assoc. Comput. Linguistics, 2014

Keystroke Patterns as Prosody in Digital Writings: A Case Study with Deceptive Reviews and Essays.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

ConnotationWordNet: Learning Connotation over the Word+Sense Network.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
BabyTalk: Understanding and Generating Simple Image Descriptions.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

From Large Scale Image Categorization to Entry-Level Categories.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Understanding and Quantifying Creativity in Lexical Composition.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Where Not to Eat? Improving Public Policy by Predicting Hygiene Inspections Using Online Reviews.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Success with Style: Using Writing Style to Predict the Success of Novels.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Generalizing Image Captions for Image-Text Parallel Corpus.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Connotation Lexicon: A Dash of Sentiment Beneath the Surface Meaning.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Detecting Visual Text.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Distributional Footprints of Deceptive Product Reviews.
Proceedings of the Sixth International Conference on Weblogs and Social Media, 2012

Characterizing Stylistic Elements in Syntactic Structure.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Collective Generation of Natural Image Descriptions.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Syntactic Stylometry for Deception Detection.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Domain Independent Authorship Attribution without Domain Adaptation.
Proceedings of the Recent Advances in Natural Language Processing, 2011

Learning General Connotation of Words using Graph-based Algorithms.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Baby talk: Understanding and generating simple image descriptions.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Gender Attribution: Tracing Stylometric Evidence Beyond Topic and Genre.
Proceedings of the Fifteenth Conference on Computational Natural Language Learning, 2011

Composing Simple Image Descriptions using Web-scale N-grams.
Proceedings of the Fifteenth Conference on Computational Natural Language Learning, 2011

Finding Deceptive Opinion Spam by Any Stretch of the Imagination.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Language of Vandalism: Improving Wikipedia Vandalism Detection via Stylometric Analysis.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
Fine-Grained Opinion Analysis: Structure-aware Appraoches.
PhD thesis, 2010

Using landing pages for sponsored search ad selection.
Proceedings of the 19th International Conference on World Wide Web, 2010

Automatically Generating Annotator Rationales to Improve Sentiment Classification.
Proceedings of the ACL 2010, 2010

Hierarchical Sequential Learning for Extracting Opinions and Their Attributes.
Proceedings of the ACL 2010, 2010

2009
Adapting a Polarity Lexicon using Integer Linear Programming for Domain-Specific Sentiment Classification.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

2008
Learning with Compositional Semantics as Structural Inference for Subsentential Sentiment Analysis.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

2007
Cornell System Description for the NTCIR-6 Opinion Task.
Proceedings of the 6th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2007

Structured Local Training and Biased Potential Functions for Conditional Random Fields with Application to Coreference Resolution.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Identifying Expressions of Opinion in Context.
Proceedings of the IJCAI 2007, 2007

2006
Joint Extraction of Entities and Relations for Opinion Recognition.
Proceedings of the EMNLP 2006, 2006

2005
OpinionFinder: A System for Subjectivity Analysis.
Proceedings of the HLT/EMNLP 2005, 2005

Identifying Sources of Opinions with Conditional Random Fields and Extraction Patterns.
Proceedings of the HLT/EMNLP 2005, 2005


  Loading...