Srinadh Bhojanapalli

Orcid: 0000-0002-4147-2106

According to our database¹, Srinadh Bhojanapalli authored at least 53 papers between 2014 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Autoregressive Ranking: Bridging the Gap Between Dual and Cross Encoders.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

Scalable In-context Ranking with Generative Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

Spark Transformer: Reactivating Sparsity in FFN and Attention.

[BibT_eX]

[DOI]

CoRR, June, 2025

Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Mimetic Initialization Helps State Space Models Learn to Recall.

[BibT_eX]

[DOI]

CoRR, 2024

Position Coupling: Leveraging Task Structure for Improved Length Generalization of Transformers.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient Language Model Architectures for Differentially Private Federated Learning.

[BibT_eX]

[DOI]

Ananda Theertha Suresh

CoRR, 2024

HiRE: High Recall Approximate Top-k Estimation for Efficient LLM Inference.

[BibT_eX]

[DOI]

CoRR, 2024

Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Functional Interpolation for Relative Positions improves Long Context Transformers.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Dual-Encoders for Extreme Multi-label Classification.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Efficacy of Dual-Encoders for Extreme Multi-Label Classification.

[BibT_eX]

[DOI]

CoRR, 2023

Depth Dependence of μP Learning Rates in ReLU MLPs.

[BibT_eX]

[DOI]

CoRR, 2023

On student-teacher deviations in distillation: does it pay to disobey?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Treeformer: Dense Gradient Trees for Efficient Attention Computation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

The Lazy Neuron Phenomenon: On Emergence of Activation Sparsity in Transformers.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Teacher's pet: understanding and mitigating biases in distillation.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2022

Large Models are Parsimonious Learners: Activation Sparsity in Trained Transformers.

[BibT_eX]

[DOI]

CoRR, 2022

On the Adversarial Robustness of Mixture of Experts.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Robust Training of Neural Networks Using Scale Invariant Architectures.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

2021

Leveraging redundancy in attention with Reuse Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

Eigen Analysis of Self-Attention and its Reconstruction from Partial Computation.

[BibT_eX]

[DOI]

CoRR, 2021

Demystifying the Better Performance of Position Encoding Variants for Transformer.

[BibT_eX]

[DOI]

CoRR, 2021

On the Reproducibility of Neural Network Predictions.

[BibT_eX]

[DOI]

CoRR, 2021

Coping with Label Shift via Distributionally Robust Optimisation.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Understanding Robustness of Transformers for Image Classification.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Simple and Effective Positional Encoding for Transformers.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

Modifying Memories in Transformer Models.

[BibT_eX]

[DOI]

CoRR, 2020

O(n) Connections are Expressive Enough: Universal Approximability of Sparse Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

An efficient nonconvex reformulation of stagewise convex optimization problems.

[BibT_eX]

[DOI]

Rudy Bunel

Oliver Hinder

Srinadh Bhojanapalli

Krishnamurthy Dvijotham

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Does label smoothing mitigate label noise?

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Low-Rank Bottleneck in Multi-head Attention Models.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Are Transformers universal approximators of sequence-to-sequence functions?

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Large Batch Optimization for Deep Learning: Training BERT in 76 minutes.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Semantic Label Smoothing for Sequence to Sequence Problems.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019

The role of over-parametrization in generalization of neural networks.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

2018

Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2018

A PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks.

[BibT_eX]

[DOI]

Behnam Neyshabur

Srinadh Bhojanapalli

Nathan Srebro

Proceedings of the 6th International Conference on Learning Representations, 2018

Smoothed analysis for low-rank solutions to semidefinite programs in quadratic penalty form.

[BibT_eX]

[DOI]

Proceedings of the Conference On Learning Theory, 2018

2017

Provable quantum state tomography via non-convex methods.

[BibT_eX]

[DOI]

Anastasios Kyrillidis

Amir Kalev

Dohyung Park

Srinadh Bhojanapalli

Constantine Caramanis

Sujay Sanghavi

CoRR, 2017

A PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2017

Stabilizing GAN Training with Multiple Random Projections.

[BibT_eX]

[DOI]

Behnam Neyshabur

Srinadh Bhojanapalli

Ayan Chakrabarti

CoRR, 2017

Exploring Generalization in Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Implicit Regularization in Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

2016

Provable non-convex projected gradient descent for a class of constrained matrix optimization problems.

[BibT_eX]

[DOI]

Dohyung Park

Anastasios Kyrillidis

Srinadh Bhojanapalli

Constantine Caramanis

Sujay Sanghavi

CoRR, 2016

Single Pass PCA of Matrix Products.

[BibT_eX]

[DOI]

Shanshan Wu

Srinadh Bhojanapalli

Sujay Sanghavi

Alexandros G. Dimakis

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Global Optimality of Local Search for Low Rank Matrix Recovery.

[BibT_eX]

[DOI]

Srinadh Bhojanapalli

Behnam Neyshabur

Nati Srebro

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Dropping Convexity for Faster Semi-definite Optimization.

[BibT_eX]

[DOI]

Srinadh Bhojanapalli

Anastasios Kyrillidis

Sujay Sanghavi

Proceedings of the 29th Conference on Learning Theory, 2016

2015

Completing any low-rank matrix, provably.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2015

A New Sampling Technique for Tensors.

[BibT_eX]

[DOI]

Srinadh Bhojanapalli

Sujay Sanghavi

CoRR, 2015

Tighter Low-rank Approximation via Sampling the Leveraged Element.

[BibT_eX]

[DOI]

Srinadh Bhojanapalli

Prateek Jain

Sujay Sanghavi

Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, 2015

2014

Coherent Matrix Completion.

[BibT_eX]

[DOI]

Proceedings of the 31th International Conference on Machine Learning, 2014

Universal Matrix Completion.

[BibT_eX]

[DOI]

Srinadh Bhojanapalli

Prateek Jain

Proceedings of the 31th International Conference on Machine Learning, 2014

Srinadh Bhojanapalli

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...