Dayal Singh Kalra

According to our database1, Dayal Singh Kalra authored at least 6 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
When Can You Get Away with Low Memory Adam?
CoRR, March, 2025

(How) Can Transformers Predict Pseudo-Random Numbers?
CoRR, February, 2025

Universal Sharpness Dynamics in Neural Network Training: Fixed Point Analysis, Edge of Stability, and Route to Chaos.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Why Warmup the Learning Rate? Underlying Mechanisms and Improvements.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
Phase diagram of training dynamics in deep neural networks: effect of learning rate, depth, and width.
CoRR, 2023

Phase diagram of early training dynamics in deep neural networks: effect of the learning rate, depth, and width.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


  Loading...