Andrey Gromov

According to our database1, Andrey Gromov authored at least 9 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The Unreasonable Ineffectiveness of the Deeper Layers.
CoRR, 2024

Bridging Associative Memory and Probabilistic Modeling.
CoRR, 2024

2023
Deconstructing the generalization gap.
Nat. Mac. Intell., December, 2023

To grok or not to grok: Disentangling generalization and memorization on corrupted algorithmic datasets.
CoRR, 2023

Grokking modular arithmetic.
CoRR, 2023

Critical Initialization of Wide and Deep Neural Networks using Partial Jacobians: General Theory and Applications.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
AutoInit: Automatic Initialization via Jacobian Tuning.
CoRR, 2022

2021
Critical initialization of wide and deep neural networks through partial Jacobians: general theory and applications to LayerNorm.
CoRR, 2021

2014
The Concept of Automation in Conventional Systems Creation Applied to the Preliminary Aircraft Design.
Proceedings of the Soft Computing in Computer and Information Science, 2014


  Loading...