Rosie Zhao

According to our database1, Rosie Zhao authored at least 19 papers between 2020 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs.
CoRR, June, 2025

Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining.
CoRR, April, 2025

Distributional Scaling Laws for Emergent Capabilities.
CoRR, February, 2025

Deconstructing What Makes a Good Optimizer for Autoregressive Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SOAP: Improving and Stabilizing Shampoo using Adam for Language Modeling.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Policy Gradient Methods in the Presence of Symmetries and State Abstractions.
J. Mach. Learn. Res., 2024

Creating a Cooperative AI Policymaking Platform through Open Source Collaboration.
CoRR, 2024

SOAP: Improving and Stabilizing Shampoo using Adam.
CoRR, 2024

Deconstructing What Makes a Good Optimizer for Language Models.
CoRR, 2024

Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Feature emergence via margin maximization: case studies in algebraic tasks.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
On the peel number and the leaf-height of Galton-Watson trees.
Comb. Probab. Comput., January, 2023

Loss of Plasticity in Continual Deep Reinforcement Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2023

2022
Lower Bound Methods for Sign-rank and their Limitations.
Electron. Colloquium Comput. Complex., 2022

Boolean functions with small approximate spectral norm.
Electron. Colloquium Comput. Complex., 2022

Leaf multiplicity in a Bienaymé-Galton-Watson tree.
Discret. Math. Theor. Comput. Sci., 2022

Continuous MDP Homomorphisms and Homomorphic Policy Gradient.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management.
CoRR, 2021

2020
A Study of Policy Gradient on a Class of Exactly Solvable Models.
CoRR, 2020


  Loading...