James B. Simon

According to our database1, James B. Simon authored at least 21 papers between 2021 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
FACT: the Features At Convergence Theorem for neural networks.
CoRR, July, 2025

Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks.
CoRR, June, 2025

Saddle-To-Saddle Dynamics in Deep ReLU Networks: Low-Rank Bias in the First Saddle Escape.
CoRR, May, 2025

Solvable Dynamics of Self-Supervised Word Embeddings and the Emergence of Analogical Reasoning.
CoRR, February, 2025

The Optimization Landscape of SGD Across the Feature Learning Strength.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
An Agnostic View on the Cost of Overfitting in (Kernel) Ridge Regression.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

More is Better: when Infinite Overparameterization is Optimal and Overfitting is Obligatory.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

The Eigenlearning Framework: A Conservation Law Perspective on Kernel Ridge Regression and Wide Neural Networks.
Trans. Mach. Learn. Res., 2023

More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory.
CoRR, 2023

A Spectral Condition for Feature Learning.
CoRR, 2023

Les Houches Lectures on Deep Learning at Large & Infinite Width.
CoRR, 2023

Tune As You Scale: Hyperparameter Optimization For Compute Efficient Training.
CoRR, 2023

On the Stepwise Nature of Self-Supervised Learning.
Proceedings of the International Conference on Machine Learning, 2023

2022
Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds.
CoRR, 2022

On Kernel Regression with Data-Dependent Kernels.
CoRR, 2022

Benign, Tempered, or Catastrophic: A Taxonomy of Overfitting.
CoRR, 2022

Benign, Tempered, or Catastrophic: Toward a Refined Taxonomy of Overfitting.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Reverse Engineering the Neural Tangent Kernel.
Proceedings of the International Conference on Machine Learning, 2022

SGD Can Converge to Local Maxima.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Neural Tangent Kernel Eigenvalues Accurately Predict Generalization.
CoRR, 2021


  Loading...