We stand with Ukraine

We stand with Ukraine

Audrey Huang

According to our database¹, Audrey Huang authored at least 19 papers between 2019 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

The Coverage Principle: How Pre-Training Enables Post-Training.

[BibT_eX]

[DOI]

,

,

,

Sadhika Malladi

,

,

,

Akshay Krishnamurthy

,

Dylan J. Foster

CoRR, October, 2025

Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment.

[BibT_eX]

[DOI]

,

,

,

,

Dylan J. Foster

,

Akshay Krishnamurthy

CoRR, March, 2025

Computational-Statistical Tradeoffs at the Next-Token Prediction Barrier: Autoregressive and Imitation Learning under Misspecification.

[BibT_eX]

[DOI]

,

,

,

Akshay Krishnamurthy

,

Dylan J. Foster

CoRR, February, 2025

Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol.

[BibT_eX]

[DOI]

,

,

Shivangi Agarwal

,

,

,

Philip Amortila

,

CoRR, February, 2025

Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

Akshay Krishnamurthy

,

Dylan J. Foster

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Self-Improvement in Language Models: The Sharpening Mechanism.

[BibT_eX]

[DOI]

,

,

Dylan J. Foster

,

,

,

,

,

Akshay Krishnamurthy

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Computational-Statistical Tradeoffs at the Next-Token Prediction Barrier: Autoregressive and Imitation Learning under Misspecification (extended abstract).

[BibT_eX]

[DOI]

,

,

,

Akshay Krishnamurthy

,

Dylan J. Foster

Proceedings of the Thirty Eighth Annual Conference on Learning Theory, 2025

2024

Non-adaptive Online Finetuning for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

Mohammad Ghavamzadeh

,

,

RLJ, 2024

Occupancy-based Policy Gradient: Estimation, Convergence, and Optimality.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Timing as an Action: Learning When to Observe and Act.

[BibT_eX]

[DOI]

,

,

Kamyar Azizzadenesheli

,

,

Zachary C. Lipton

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023

Reinforcement Learning in Low-rank MDPs with Density Features.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Machine Learning, 2023

Extended Abstract: Learning in Low-rank MDPs with Density Features.

[BibT_eX]

[DOI]

,

,

Proceedings of the 57th Annual Conference on Information Sciences and Systems, 2023

2022

Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Supervised Learning with General Risk Functionals.

[BibT_eX]

[DOI]

,

,

Zachary C. Lipton

,

Kamyar Azizzadenesheli

Proceedings of the International Conference on Machine Learning, 2022

Offline Reinforcement Learning with Realizability and Single-policy Concentrability.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022

Off-Policy Risk Assessment for Markov Decision Processes.

[BibT_eX]

[DOI]

,

,

Zachary C. Lipton

,

Kamyar Azizzadenesheli

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk.

[BibT_eX]

[DOI]

,

,

Zachary C. Lipton

,

Kamyar Azizzadenesheli

CoRR, 2021

Off-Policy Risk Assessment in Contextual Bandits.

[BibT_eX]

[DOI]

,

,

Zachary C. Lipton

,

Kamyar Azizzadenesheli

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2019

Graph-Structured Visual Imitation.

[BibT_eX]

[DOI]

Maximilian Sieb

,

,

,

,

Katerina Fragkiadaki

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Loading...