Tatsunori B. Hashimoto

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers.

[BibT_eX]

[DOI]

Chenglei Si

Diyi Yang

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

AutoBencher: Towards Declarative Benchmark Construction.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Locality Alignment Improves Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

s1: Simple test-time scaling.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024

Robust Distortion-free Watermarks for Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

A Survey on Data Selection for Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Benchmarking Large Language Models for News Summarization.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2024

The Extractive-Abstractive Spectrum: Uncovering Verifiability Trade-offs in LLM Generations.

[BibT_eX]

[DOI]

Theodora Worledge

Carlos Guestrin

CoRR, 2024

Graph-based Uncertainty Metrics for Long-form Language Model Outputs.

[BibT_eX]

[DOI]

CoRR, 2024

The Future of Open Human Feedback.

[BibT_eX]

[DOI]

CoRR, 2024

AutoBencher: Creating Salient, Novel, Difficult Datasets for Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Observational Scaling Laws and the Predictability of Language Model Performance.

[BibT_eX]

[DOI]

Yangjun Ruan

Chris J. Maddison

CoRR, 2024

Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators.

[BibT_eX]

[DOI]

Yann Dubois

Balázs Galambosi

CoRR, 2024

Linguistic Calibration of Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Security and Privacy, 2024

Observational Scaling Laws and the Predictability of Langauge Model Performance.

[BibT_eX]

[DOI]

Yangjun Ruan

Chris J. Maddison

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Graph-based Uncertainty Metrics for Long-form Language Model Generations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Removing RLHF Protections in GPT-4 via Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

Trustless Audits without Revealing Data or Models.

[BibT_eX]

[DOI]

Suppakit Waiwitlikhit

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language Models with Conformal Factuality Guarantees.

[BibT_eX]

[DOI]

Christopher Mohri

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Understanding Finetuning for Factual Knowledge Extraction.

[BibT_eX]

[DOI]

Gaurav Rohit Ghosal

Aditi Raghunathan

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Scaling Laws for the Value of Individual Data Points in Machine Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Linguistic Calibration of Long-Form Generations.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Identifying the Risks of LM Agents with an LM-Emulated Sandbox.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Proving Test Set Contamination in Black-Box Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention.

[BibT_eX]

[DOI]

Arvind V. Mahankali

Tengyu Ma

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Benchmarking and Improving Generator-Validator Consistency of Language Models.

[BibT_eX]

[DOI]

Xiang Lisa Li

Vaishnavi Shrivastava

Siyan Li

Proceedings of the Twelfth International Conference on Learning Representations, 2024

On the Learnability of Watermarks for Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

On the Fairness ROAD: Robust Optimization for Adversarial Debiasing.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Distributionally Robust Losses for Latent Covariate Mixtures.

[BibT_eX]

[DOI]

John C. Duchi

Hongseok Namkoong

Oper. Res., March, 2023

Holistic Evaluation of Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Undersampling is a Minimax Optimal Robustness Intervention in Nonparametric Classification.

[BibT_eX]

[DOI]

Niladri S. Chatterji

Saminul Haque

Trans. Mach. Learn. Res., 2023

Accelerating Aggregation Queries on Unstructured Streams of Data.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2023

Foundation Models and Fair Use.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2023

Identifying and Mitigating the Security Risks of Generative AI.

[BibT_eX]

[DOI]

Found. Trends Priv. Secur., 2023

Benchmarking Multi-Domain Active Learning on Image Classification.

[BibT_eX]

[DOI]

Jiayi Li

Rohan Taori

CoRR, 2023

Learning to (Learn at Test Time).

[BibT_eX]

[DOI]

CoRR, 2023

Identifying and Mitigating the Security Risks of Generative AI.

[BibT_eX]

[DOI]

CoRR, 2023

Where's the Liability in Harmful AI Speech?

[BibT_eX]

[DOI]

Peter Henderson

Mark A. Lemley

CoRR, 2023

AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback.

[BibT_eX]

[DOI]

CoRR, 2023

Navigating the Grey Area: Expressions of Overconfidence and Uncertainty in Language Models.

[BibT_eX]

[DOI]

Kaitlyn Zhou

Dan Jurafsky

CoRR, 2023

MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks.

[BibT_eX]

[DOI]

Tobias Gerstenberg

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Likelihood-Based Diffusion Language Models.

[BibT_eX]

[DOI]

Ishaan Gulrajani

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Congestion Control Safety via Comparative Statics.

[BibT_eX]

[DOI]

Pratiksha Thaker

Matei Zaharia

Proceedings of the IEEE INFOCOM 2023, 2023

Coder Reviewer Reranking for Code Generation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Data Feedback Loops: Model-driven Amplification of Dataset Biases.

[BibT_eX]

[DOI]

Rohan Taori

Proceedings of the International Conference on Machine Learning, 2023

Whose Opinions Do Language Models Reflect?

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Out-of-Domain Robustness via Targeted Augmentations.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Evaluating Self-Supervised Learning via Risk Decomposition.

[BibT_eX]

[DOI]

Yann Dubois

Proceedings of the International Conference on Machine Learning, 2023

Is a Caption Worth a Thousand Images? A Study on Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale.

[BibT_eX]

[DOI]

Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023

Navigating the Grey Area: How Expressions of Uncertainty and Overconfidence Affect Language Models.

[BibT_eX]

[DOI]

Kaitlyn Zhou

Dan Jurafsky

Fatemehsadat Mireshghallah

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

When Do Pre-Training Biases Propagate to Downstream Tasks? A Case Study in Text Summarization.

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

TempLM: Distilling Language Models into Template-Based Generators.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Privacy-Preserving Domain Adaptation of Semantic Parsers.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Contrastive Decoding: Open-ended Text Generation as Optimization.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Contrastive Error Attribution for Finetuned Language Models.

[BibT_eX]

[DOI]

Faisal Ladhak

Esin Durmus

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Emergent Abilities of Large Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2022

Tracing and Removing Data Errors in Natural Language Generation Datasets.

[BibT_eX]

[DOI]

Faisal Ladhak

Esin Durmus

CoRR, 2022

ZK-IMG: Attested Images via Zero-Knowledge Proofs to Fight Disinformation.

[BibT_eX]

[DOI]

CoRR, 2022

Scaling up Trustless DNN Inference with Zero-Knowledge Proofs.

[BibT_eX]

[DOI]

CoRR, 2022

A Closer Look at the Calibration of Differentially Private Learners.

[BibT_eX]

[DOI]

CoRR, 2022

Is a Caption Worth a Thousand Images? A Controlled Study for Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2022

TempLM: Distilling Language Models into Template-Based Generators.

[BibT_eX]

[DOI]

CoRR, 2022

TASTI: Semantic Indexes for Machine Learning-based Queries over Unstructured Data.

[BibT_eX]

[DOI]

Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Factored DRO: Factored Distributionally Robust Policies for Contextual Bandits.

[BibT_eX]

[DOI]

Tong Mu

Yash Chandak

Emma Brunskill

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Diffusion-LM Improves Controllable Text Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

When Does Differentially Private Learning Not Suffer in High Dimensions?

[BibT_eX]

[DOI]

Xuechen Li

Daogao Liu

Huseyin A. Inan

Janardhan Kulkarni

Yin Tat Lee

Abhradeep Guha Thakurta

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving Self-Supervised Learning by Characterizing Idealized Representations.

[BibT_eX]

[DOI]

Yann Dubois

Stefano Ermon

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Identifiability Conditions for Domain Adaptation.

[BibT_eX]

[DOI]

Ishaan Gulrajani

Niladri Shekhar Chatterji

Proceedings of the International Conference on Machine Learning, 2022

Language modeling via stochastic processes.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Is Importance Weighting Incompatible with Interpolating Classifiers?

[BibT_eX]

[DOI]

Ke Alexander Wang

Saminul Haque

Proceedings of the Tenth International Conference on Learning Representations, 2022

Extending the WILDS Benchmark for Unsupervised Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Distributionally Robust Models with Parametric Likelihood Ratios.

[BibT_eX]

[DOI]

Paul Michel

Graham Neubig

Proceedings of the Tenth International Conference on Learning Representations, 2022

Large Language Models Can Be Strong Differentially Private Learners.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Jury Learning: Integrating Dissenting Voices into Machine Learning Models.

[BibT_eX]

[DOI]

Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

Spurious Correlations in Reference-Free Evaluation of Text Generation.

[BibT_eX]

[DOI]

Esin Durmus

Faisal Ladhak

Pawan Sasanka Ammanamanchi

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Accelerating Approximate Aggregation Queries with Expensive Predicates.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2021

On the Opportunities and Risks of Foundation Models.

[BibT_eX]

[DOI]

et al.

CoRR, 2021

Proof: Accelerating Approximate Aggregation Queries with Expensive Predicates.

[BibT_eX]

[DOI]

CoRR, 2021

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics.

[BibT_eX]

[DOI]

Sebastian Gehrmann

Tosin P. Adewumi

Karmanya Aggarwal

Aremu Anuoluwapo

Antoine Bosselut

Khyathi Raghavi Chandu

Miruna-Adriana Clinciu

Bodhisattwa Prasad Majumder

Pedro Henrique Martins

Angelina McMillan-Major

Rubungo Andre Niyongabo

Salomey Osei

Ankur P. Parikh

Laura Perez-Beltrachini

Marco Antonio Sobrevilla Cabezudo

CoRR, 2021

On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies.

[BibT_eX]

[DOI]

Tianyi Zhang

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

DReCa: A General Task Augmentation Strategy for Few-Shot Natural Language Inference.

[BibT_eX]

[DOI]

Shikhar Murty

Christopher D. Manning

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Model Performance Scaling with Multiple Data Sources.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Modeling the Second Player in Distributionally Robust Optimization.

[BibT_eX]

[DOI]

Paul Michel

Graham Neubig

Proceedings of the 9th International Conference on Learning Representations, 2021

Don't Hate the Player, Hate the Game: Safety and Utility in Multi-Agent Congestion Control.

[BibT_eX]

[DOI]

Pratiksha Thaker

Matei Zaharia

Proceedings of the HotNets '21: The 20th ACM Workshop on Hot Topics in Networks, 2021

The Disagreement Deconvolution: Bringing Machine Learning Performance Metrics In Line With Reality.

[BibT_eX]

[DOI]

Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

Measuring Conversational Uptake: A Case Study on Student-Teacher Interactions.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Approximate Selection with Guarantees using Proxies.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2020

Task-agnostic Indexes for Deep Learning-based Queries over Unstructured Data.

[BibT_eX]

[DOI]

CoRR, 2020

Robustness to Spurious Correlations via Human Annotations.

[BibT_eX]

[DOI]

Megha Srivastava

Proceedings of the 37th International Conference on Machine Learning, 2020

Distributionally Robust Neural Networks.

[BibT_eX]

[DOI]

Shiori Sagawa

Proceedings of the 8th International Conference on Learning Representations, 2020

Improved Natural Language Generation via Loss Truncation.

[BibT_eX]

[DOI]

Daniel Kang

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization.

[BibT_eX]

[DOI]

Shiori Sagawa

CoRR, 2019

Learning Autocomplete Systems as a Communication Game.

[BibT_eX]

[DOI]

Mina Lee

CoRR, 2019

Unifying Human and Statistical Evaluation for Natural Language Generation.

[BibT_eX]

[DOI]

Hugh Zhang

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Distributionally Robust Language Modeling.

[BibT_eX]

[DOI]

Yonatan Oren

Shiori Sagawa

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Inferring Multidimensional Rates of Aging from Cross-Sectional Data.

[BibT_eX]

[DOI]

Emma Pierson

Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018

Generating Sentences by Editing Prototypes.

[BibT_eX]

[DOI]

Kelvin Guu

Yonatan Oren

Trans. Assoc. Comput. Linguistics, 2018

Inferring Multi-Dimensional Rates of Aging from Cross-Sectional Data.

[BibT_eX]

[DOI]

Emma Pierson

CoRR, 2018

A Retrieve-and-Edit Framework for Predicting Structured Outputs.

[BibT_eX]

[DOI]

Kelvin Guu

Yonatan Oren

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Fairness Without Demographics in Repeated Loss Minimization.

[BibT_eX]

[DOI]

Megha Srivastava

Hongseok Namkoong

Proceedings of the 35th International Conference on Machine Learning, 2018

Derivative Free Optimization Via Repeated Classification.

[BibT_eX]

[DOI]

Steve Yadlowsky

John C. Duchi

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017

Unsupervised Transformation Learning via Convex Relaxations.

[BibT_eX]

[DOI]

Tatsunori Benjamin Hashimoto

John C. Duchi

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

2016

Continuous representations and models from random walk diffusion limits.

[BibT_eX]

[DOI]

PhD thesis, 2016

Word Embeddings as Metric Recovery in Semantic Spaces.

[BibT_eX]

[DOI]

David Alvarez-Melis

Trans. Assoc. Comput. Linguistics, 2016

GERV: a statistical method for generative evaluation of regulatory variants for transcription factor binding.

[BibT_eX]

[DOI]

Haoyang Zeng

Daniel D. Kang

David K. Gifford

Bioinform., 2016

Learning Population-Level Diffusions with Generative RNNs.

[BibT_eX]

[DOI]

David K. Gifford

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

Word, graph and manifold embedding from Markov processes.

[BibT_eX]

[DOI]

David Alvarez-Melis

CoRR, 2015

From random walks to distances on unweighted graphs.

[BibT_eX]

[DOI]

Yi Sun

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Metric recovery from directed unweighted graphs.

[BibT_eX]

[DOI]

Yi Sun

Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2014

Universal Count Correction for High-Throughput Sequencing.

[BibT_eX]

[DOI]

Matthew D. Edwards

David K. Gifford

PLoS Comput. Biol., 2014

2012

Lineage-based identification of cellular states and expression programs.

[BibT_eX]

[DOI]

Bioinform., 2012

2011

Tree preserving embedding.

[BibT_eX]

[DOI]

Albert Shieh

Edoardo M. Airoldi

Proceedings of the 28th International Conference on Machine Learning, 2011

2009

Superconducting Narrowband Filter for Receiver of Weather Radar.

[BibT_eX]

[DOI]

IEICE Trans. Electron., 2009

BFL: a node and edge betweenness based fast layout algorithm for large scale networks.

[BibT_eX]

[DOI]