Nikhil Ghosh

According to our database1, Nikhil Ghosh authored at least 18 papers between 2019 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
There Will Be a Scientific Theory of Deep Learning.
CoRR, April, 2026

Does a Global Perspective Help Prune Sparse MoEs Elegantly?
CoRR, April, 2026

2025
Understanding the Mechanisms of Fast Hyperparameter Transfer.
CoRR, December, 2025

GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments.
CoRR, September, 2025

PLoP: Precise LoRA Placement for Efficient Finetuning of Large Models.
CoRR, June, 2025

The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning.
J. Mach. Learn. Res., 2025

2024
The Impact of Initialization on LoRA Finetuning Dynamics.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

LoRA+: Efficient Low Rank Adaptation of Large Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

More is Better: when Infinite Overparameterization is Optimal and Overfitting is Obligatory.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
A Universal Trade-off Between the Model Size, Test Loss, and Training Loss of Linear Predictors.
SIAM J. Math. Data Sci., December, 2023

More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory.
CoRR, 2023

The Power of External Memory in Increasing Predictive Model Capacity.
CoRR, 2023

Alternating Updates for Efficient Transformers.
CoRR, 2023

Alternating Updates for Efficient Transformers.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Deconstructing Distributions: A Pointwise Framework of Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

On the Benefits of Learning to Route in Mixture-of-Experts Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
The Three Stages of Learning Dynamics in High-dimensional Kernel Methods.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2019
Landmark Ordinal Embedding.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019


  Loading...