David Brandfonbrener

According to our database1, David Brandfonbrener authored at least 18 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models.
CoRR, 2024

Verified Multi-Step Synthesis using Large Language Models and Monte Carlo Tree Search.
CoRR, 2024

Repeat After Me: Transformers are Better than State Space Models at Copying.
CoRR, 2024

2023
Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

2022
Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning.
CoRR, 2022

Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning.
CoRR, 2022

When does return-conditioned supervised learning work for offline reinforcement learning?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Quantile Filtered Imitation Learning.
CoRR, 2021

Offline RL Without Off-Policy Evaluation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Offline Contextual Bandits with Overparameterized Models.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Evaluating representations by the complexity of learning low-loss predictors.
CoRR, 2020

Overfitting and Optimization in Offline Policy Learning.
CoRR, 2020

Geometric Insights into the Convergence of Nonlinear TD Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Frequentist Regret Bounds for Randomized Least-Squares Value Iteration.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
Frequentist Regret Bounds for Randomized Least-Squares Value Iteration.
CoRR, 2019

On the Expected Dynamics of Nonlinear TD Learning.
CoRR, 2019

2018
Two-Vertex Generators of Jacobians of Graphs.
Electron. J. Comb., 2018


  Loading...