Sean Welleck

According to our database1, Sean Welleck authored at least 60 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of three.
  • Erdős number3 of two.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
OptimalThinkingBench: Evaluating Over and Underthinking in LLMs.
CoRR, August, 2025

Agentic-R1: Distilled Dual-Strategy Reasoning.
CoRR, July, 2025

Premise Selection for a Lean Hammer.
CoRR, June, 2025

Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening.
CoRR, June, 2025

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think.
CoRR, May, 2025

Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators.
CoRR, March, 2025

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning.
CoRR, March, 2025

Programming with Pixels: Computer-Use Meets Software Engineering.
CoRR, February, 2025

Optimizing Temperature for Language Models with Multi-Sample Inference.
CoRR, February, 2025

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for LLM Problem-Solving.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Lean-STaR: Learning to Interleave Thinking and Proving.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

miniCTX: Neural Theorem Proving with (Long-)Contexts.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ImProver: Agent-Based Automated Proof Optimization.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Evaluating Language Models as Synthetic Data Generators.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models.
Trans. Mach. Learn. Res., 2024

Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning.
CoRR, 2024

AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement.
CoRR, 2024

An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models.
CoRR, 2024

miniCodeProps: a Minimal Benchmark for Proving Code Properties.
CoRR, 2024

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Llemma: An Open Language Model for Mathematics.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
MAUVE Scores for Generative Models: Theory and Practice.
J. Mach. Learn. Res., 2023

LLMSTEP: LLM proofstep suggestions in Lean.
CoRR, 2023

Self-Refine: Iterative Refinement with Self-Feedback.
CoRR, 2023

Self-Refine: Iterative Refinement with Self-Feedback.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Faith and Fate: Limits of Transformers on Compositionality.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Generating Sequences by Learning to Self-Correct.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

STEER: Unified Style Transfer with Expert Reinforcement.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

A Survey of Deep Learning for Mathematical Reasoning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
NaturalProver: Grounded Mathematical Proof Generation with Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

COLD Decoding: Energy-based Constrained Text Generation with Langevin Dynamics.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

QUARK: Controllable Text Generation with Reinforced Unlearning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Symbolic Knowledge Distillation: from General Language Models to Commonsense Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

LILA: A Unified Benchmark for Mathematical Reasoning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Maieutic Prompting: Logically Consistent Reasoning with Recursive Explanations.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Generated Knowledge Prompting for Commonsense Reasoning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Symbolic Brittleness in Sequence Models: On Systematic Generalization in Symbolic Mathematics.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Order and Learning in Sequential Neural Structured Prediction.
PhD thesis, 2021

PROMPT WAYWARDNESS: The Curious Case of Discretized Interpretation of Continuous Prompts.
CoRR, 2021

Divergence Frontiers for Generative Models: Sample Complexity, Quantization Level, and Frontier Integral.
CoRR, 2021

NaturalProofs: Mathematical Theorem Proving in Natural Language.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Divergence Frontiers for Generative Models: Sample Complexity, Quantization Effects, and Frontier Integrals.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Mode recovery in neural autoregressive sequence modeling.
Proceedings of the 5th Workshop on Structured Prediction for NLP, 2021

MLE-Guided Parameter Search for Task Loss Minimization in Neural Sequence Modeling.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Neural Text Generation With Unlikelihood Training.
Proceedings of the 8th International Conference on Learning Representations, 2020

Consistency of a Recurrent Language Model With Respect to Incomplete Decoding.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Don't Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Sequential Graph Dependency Parser.
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

Non-Monotonic Sequential Text Generation.
Proceedings of the 36th International Conference on Machine Learning, 2019

Dialogue Natural Language Inference.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Loss Functions for Multiset Prediction.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
Saliency-based Sequential Image Attention with Multiset Prediction.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017


  Loading...