Audrey Huang

According to our database1, Audrey Huang authored at least 18 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment.
CoRR, March, 2025

Computational-Statistical Tradeoffs at the Next-Token Prediction Barrier: Autoregressive and Imitation Learning under Misspecification.
CoRR, February, 2025

Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol.
CoRR, February, 2025

Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Self-Improvement in Language Models: The Sharpening Mechanism.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Computational-Statistical Tradeoffs at the Next-Token Prediction Barrier: Autoregressive and Imitation Learning under Misspecification (extended abstract).
Proceedings of the Thirty Eighth Annual Conference on Learning Theory, 2025

2024
Non-adaptive Online Finetuning for Offline Reinforcement Learning.
RLJ, 2024

Occupancy-based Policy Gradient: Estimation, Convergence, and Optimality.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Timing as an Action: Learning When to Observe and Act.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023
Reinforcement Learning in Low-rank MDPs with Density Features.
Proceedings of the International Conference on Machine Learning, 2023

Extended Abstract: Learning in Low-rank MDPs with Density Features.
Proceedings of the 57th Annual Conference on Information Sciences and Systems, 2023

2022
Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Supervised Learning with General Risk Functionals.
Proceedings of the International Conference on Machine Learning, 2022

Offline Reinforcement Learning with Realizability and Single-policy Concentrability.
Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022

Off-Policy Risk Assessment for Markov Decision Processes.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk.
CoRR, 2021

Off-Policy Risk Assessment in Contextual Bandits.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2019
Graph-Structured Visual Imitation.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019


  Loading...