Felix Dangel

According to our database1, Felix Dangel authored at least 11 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective.
CoRR, 2024

2023
Backpropagation Beyond the Gradient.
PhD thesis, 2023

ViViT: Curvature Access Through The Generalized Gauss-Newton's Low-Rank Structure.
Trans. Mach. Learn. Res., 2023

Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC for Large Neural Nets.
CoRR, 2023

On the Disconnect Between Theory and Practice of Overparametrized Neural Networks.
CoRR, 2023

Convolutions Through the Lens of Tensor Networks.
CoRR, 2023

The Geometry of Neural Nets' Parameter Spaces Under Reparametrization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2021
Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
BackPACK: Packing more into Backprop.
Proceedings of the 8th International Conference on Learning Representations, 2020

Modular Block-diagonal Curvature Approximations for Feedforward Architectures.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
A Modular Approach to Block-diagonal Hessian Approximations for Second-order Optimization Methods.
CoRR, 2019


  Loading...