James B. Simon

According to our database¹, James B. Simon authored at least 22 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Predicting kernel regression learning curves from only raw data statistics.

[BibT_eX]

[DOI]

CoRR, October, 2025

FACT: the Features At Convergence Theorem for neural networks.

[BibT_eX]

[DOI]

CoRR, July, 2025

Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks.

[BibT_eX]

[DOI]

Daniel Kunin

Giovanni Luca Marchetti

Feng Chen

Dhruva Karkada

James B. Simon

Michael Robert DeWeese

Surya Ganguli

Nina Miolane

CoRR, June, 2025

Saddle-To-Saddle Dynamics in Deep ReLU Networks: Low-Rank Bias in the First Saddle Escape.

[BibT_eX]

[DOI]

Ioannis Bantzis

James B. Simon

Arthur Jacot

CoRR, May, 2025

Solvable Dynamics of Self-Supervised Word Embeddings and the Emergence of Analogical Reasoning.

[BibT_eX]

[DOI]

Dhruva Karkada

James B. Simon

Yasaman Bahri

Michael Robert DeWeese

CoRR, February, 2025

The Optimization Landscape of SGD Across the Feature Learning Strength.

[BibT_eX]

[DOI]

Alexander B. Atanasov

Alexandru Meterez

James B. Simon

Cengiz Pehlevan

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

An Agnostic View on the Cost of Overfitting in (Kernel) Ridge Regression.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

More is Better: when Infinite Overparameterization is Optimal and Overfitting is Obligatory.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[BibT_eX]

[DOI]

Bartlomiej Bojanowski

Christopher D. Manning

Daniel Moseguí González

Eunice Engefu Manyasi

Evgenii Zheltonozhskii

Fanyue Xia

Fatemeh Siar

Fernando Martínez-Plumed

Giambattista Parascandolo

Giorgio Mariani

Gloria Wang

Gonzalo Jaimovitch-López

Jaime Fernández Fisac

Jascha Sohl-Dickstein

José Hernández-Orallo

Karthik Gopalakrishnan

Lidia Contreras Ochando

Louis-Philippe Morency

María José Ramírez-Quintana

Michael I. Ivanitskiy

Neta Gur-Ari Krakover

Nitish Shirish Keskar

Pablo Antonio Moreno Casares

Pegah Alipoormolabashi

Shyamolima (Shammie) Debnath

Sneha Priscilla Makini

Yadollah Yaghoobzadeh

Trans. Mach. Learn. Res., 2023

The Eigenlearning Framework: A Conservation Law Perspective on Kernel Ridge Regression and Wide Neural Networks.

[BibT_eX]

[DOI]

James B. Simon

Madeline Dickens

Dhruva Karkada

Michael Robert DeWeese

Trans. Mach. Learn. Res., 2023

More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory.

[BibT_eX]

[DOI]

CoRR, 2023

A Spectral Condition for Feature Learning.

[BibT_eX]

[DOI]

Greg Yang

James B. Simon

Jeremy Bernstein

CoRR, 2023

Les Houches Lectures on Deep Learning at Large & Infinite Width.

[BibT_eX]

[DOI]

CoRR, 2023

Tune As You Scale: Hyperparameter Optimization For Compute Efficient Training.

[BibT_eX]

[DOI]

CoRR, 2023

On the Stepwise Nature of Self-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

2022

Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds.

[BibT_eX]

[DOI]

CoRR, 2022

On Kernel Regression with Data-Dependent Kernels.

[BibT_eX]

[DOI]

James B. Simon

CoRR, 2022

Benign, Tempered, or Catastrophic: A Taxonomy of Overfitting.

[BibT_eX]

[DOI]

CoRR, 2022

Benign, Tempered, or Catastrophic: Toward a Refined Taxonomy of Overfitting.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Reverse Engineering the Neural Tangent Kernel.

[BibT_eX]

[DOI]

James Benjamin Simon

Sajant Anand

Michael Robert DeWeese

Proceedings of the International Conference on Machine Learning, 2022

SGD Can Converge to Local Maxima.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Neural Tangent Kernel Eigenvalues Accurately Predict Generalization.

[BibT_eX]

[DOI]

James B. Simon

Madeline Dickens

Michael Robert DeWeese

CoRR, 2021

James B. Simon

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...