Nikhil Ghosh

According to our database¹, Nikhil Ghosh authored at least 18 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

There Will Be a Scientific Theory of Deep Learning.

[BibT_eX]

[DOI]

Jamie Simon

Daniel Kunin

Alexander B. Atanasov

CoRR, April, 2026

Does a Global Perspective Help Prune Sparse MoEs Elegantly?

[BibT_eX]

[DOI]

CoRR, April, 2026

2025

Understanding the Mechanisms of Fast Hyperparameter Transfer.

[BibT_eX]

[DOI]

Nikhil Ghosh

Denny Wu

Alberto Bietti

CoRR, December, 2025

GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments.

[BibT_eX]

[DOI]

CoRR, September, 2025

PLoP: Precise LoRA Placement for Efficient Finetuning of Large Models.

[BibT_eX]

[DOI]

Soufiane Hayou

Nikhil Ghosh

Bin Yu

CoRR, June, 2025

The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2025

2024

The Impact of Initialization on LoRA Finetuning Dynamics.

[BibT_eX]

[DOI]

Soufiane Hayou

Nikhil Ghosh

Bin Yu

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

LoRA+: Efficient Low Rank Adaptation of Large Models.

[BibT_eX]

[DOI]

Soufiane Hayou

Nikhil Ghosh

Bin Yu

Proceedings of the Forty-first International Conference on Machine Learning, 2024

More is Better: when Infinite Overparameterization is Optimal and Overfitting is Obligatory.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

A Universal Trade-off Between the Model Size, Test Loss, and Training Loss of Linear Predictors.

[BibT_eX]

[DOI]

Nikhil Ghosh

Mikhail Belkin

SIAM J. Math. Data Sci., December, 2023

More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory.

[BibT_eX]

[DOI]

CoRR, 2023

The Power of External Memory in Increasing Predictive Model Capacity.

[BibT_eX]

[DOI]

CoRR, 2023

Alternating Updates for Efficient Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

Alternating Updates for Efficient Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Deconstructing Distributions: A Pointwise Framework of Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

On the Benefits of Learning to Route in Mixture-of-Experts Models.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022

The Three Stages of Learning Dynamics in High-dimensional Kernel Methods.

[BibT_eX]

[DOI]

Nikhil Ghosh

Song Mei

Bin Yu

Proceedings of the Tenth International Conference on Learning Representations, 2022

2019

Landmark Ordinal Embedding.

[BibT_eX]

[DOI]

Nikhil Ghosh

Yuxin Chen

Yisong Yue

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Nikhil Ghosh

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...