David Brandfonbrener

According to our database¹, David Brandfonbrener authored at least 34 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Interleaved Head Attention.

[BibT_eX]

[DOI]

CoRR, February, 2026

2025

GQ-VAE: A gated quantized VAE for learning variable length tokens.

[BibT_eX]

[DOI]

CoRR, December, 2025

Let's (not) just put things in Context: Test-Time Training for Long-Context LLMs.

[BibT_eX]

[DOI]

CoRR, December, 2025

The Art of Scaling Reinforcement Learning Compute for LLMs.

[BibT_eX]

[DOI]

CoRR, October, 2025

Generalized Parallel Scaling with Interdependent Generations.

[BibT_eX]

[DOI]

Karthik Abinav Sankararaman

CoRR, October, 2025

The Role of Sparsity for Length Generalization in Transformers.

[BibT_eX]

[DOI]

CoRR, February, 2025

Loss-to-Loss Prediction: Scaling Laws for All Datasets.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

Universal Length Generalization with Turing Programs.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

The Role of Sparsity for Length Generalization in LLMs.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Deconstructing What Makes a Good Optimizer for Autoregressive Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Mixture of Parrots: Experts improve memorization more than reasoning.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SOAP: Improving and Stabilizing Shampoo using Adam for Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

SOAP: Improving and Stabilizing Shampoo using Adam.

[BibT_eX]

[DOI]

CoRR, 2024

Deconstructing What Makes a Good Optimizer for Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Verified Multi-Step Synthesis using Large Language Models and Monte Carlo Tree Search.

[BibT_eX]

[DOI]

CoRR, 2024

CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training.

[BibT_eX]

[DOI]

David Brandfonbrener

Hanlin Zhang

Andreas Kirsch

Jonathan Richard Schwarz

Sham M. Kakade

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Repeat After Me: Transformers are Better than State Space Models at Copying.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Q-Probe: A Lightweight Approach to Reward Maximization for Language Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Bridging the Gap from Supervised Learning to Control

[BibT_eX]

[DOI]

David Brandfonbrener

PhD thesis, 2023

Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation.

[BibT_eX]

[DOI]

David Brandfonbrener

Ofir Nachum

Joan Bruna

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

2022

Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning.

[BibT_eX]

[DOI]

David Brandfonbrener

Remi Tachet des Combes

Romain Laroche

CoRR, 2022

Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

When does return-conditioned supervised learning work for offline reinforcement learning?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

Quantile Filtered Imitation Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Offline RL Without Off-Policy Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Offline Contextual Bandits with Overparameterized Models.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

Evaluating representations by the complexity of learning low-loss predictors.

[BibT_eX]

[DOI]

CoRR, 2020

Overfitting and Optimization in Offline Policy Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Geometric Insights into the Convergence of Nonlinear TD Learning.

[BibT_eX]

[DOI]

David Brandfonbrener

Joan Bruna

Proceedings of the 8th International Conference on Learning Representations, 2020

Frequentist Regret Bounds for Randomized Least-Squares Value Iteration.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019

Frequentist Regret Bounds for Randomized Least-Squares Value Iteration.

[BibT_eX]

[DOI]

CoRR, 2019

On the Expected Dynamics of Nonlinear TD Learning.

[BibT_eX]

[DOI]

David Brandfonbrener

Joan Bruna

CoRR, 2019

2018

Two-Vertex Generators of Jacobians of Graphs.

[BibT_eX]

[DOI]

Electron. J. Comb., 2018

David Brandfonbrener

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...