Atish Agarwala

According to our database1, Atish Agarwala authored at least 9 papers between 2020 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Gradient descent induces alignment between weights and the empirical NTK for deep non-linear networks.
CoRR, 2024

Neglected Hessian component explains mysteries in Sharpness regularization.
CoRR, 2024

2023
Temperature check: theory and practice for training models with softmax-cross-entropy losses.
Trans. Mach. Learn. Res., 2023

On the Interplay Between Stepsize Tuning and Progressive Sharpening.
CoRR, 2023

Second-order regression models exhibit progressive sharpening to the edge of stability.
Proceedings of the International Conference on Machine Learning, 2023

SAM operates far from home: eigenvalue regularization as a dynamical phenomenon.
Proceedings of the International Conference on Machine Learning, 2023

2022
Deep equilibrium networks are sensitive to initialization statistics.
Proceedings of the International Conference on Machine Learning, 2022

2021
One Network Fits All? Modular versus Monolithic Task Formulations in Neural Networks.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Learning the gravitational force law and other analytic functions.
CoRR, 2020


  Loading...