We stand with Ukraine

We stand with Ukraine

Behnam Neyshabur

According to our database¹, Behnam Neyshabur authored at least 59 papers between 2013 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2024

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models.

[DOI]

Trans. Mach. Learn. Res., 2024

2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[DOI]

Aarohi Srivastava

,

Abhinav Rastogi

,

,

Abu Awal Md Shoeb

,

,

,

,

,

,

Adrià Garriga-Alonso

,

Agnieszka Kluska

,

Aitor Lewkowycz

,

,

,

,

,

Alexander W. Kocurek

,

,

,

,

,

,

,

,

,

,

,

Anantharaman S. Iyer

,

Anders Andreassen

,

,

Andrea Santilli

,

Andreas Stuhlmüller

,

,

,

Andrew K. Lampinen

,

,

,

,

,

,

,

Antonio Norelli

,

,

Arash Gholamidavoodi

,

,

,

Arun Kirubarajan

,

Asher Mullokandov

,

Ashish Sabharwal

,

,

,

,

,

B. Ryan Roberts

,

,

,

Bartlomiej Bojanowski

,

Batuhan Özyurt

,

Behnam Hedayatnia

,

Behnam Neyshabur

,

,

,

,

Bill Yuchen Lin

,

,

,

,

,

Catherine Stinson

,

Cedrick Argueta

,

Cèsar Ferri Ramírez

,

,

Charles Rathkopf

,

,

,

,

Chris Callison-Burch

,

,

Christian Voigt

,

Christopher D. Manning

,

Christopher Potts

,

,

Clara E. Rivera

,

,

,

Courtney Ashcraft

,

Cristina Garbacea

,

,

,

,

,

,

,

Daniel Khashabi

,

,

Daniel Moseguí González

,

Danielle Perszyk

,

Danny Hernandez

,

,

Daphne Ippolito

,

,

,

,

,

Debajyoti Datta

,

,

,

,

,

,

,

,

,

,

Dimitri Coelho Mollo

,

,

,

,

Ekaterina Shutova

,

Ekin Dogus Cubuk

,

,

Eleanor Hagerman

,

Elizabeth Barnes

,

Elizabeth Donoway

,

,

Emanuele Rodolà

,

,

,

,

,

,

,

,

Ethan J. Jerzak

,

,

Eunice Engefu Manyasi

,

Evgenii Zheltonozhskii

,

,

,

Fernando Martínez-Plumed

,

Francesca Happé

,

François Chollet

,

,

,

Genta Indra Winata

,

,

Germán Kruszewski

,

Giambattista Parascandolo

,

Giorgio Mariani

,

,

Gonzalo Jaimovitch-López

,

,

,

Hana Galijasevic

,

,

,

Hannaneh Hajishirzi

,

,

,

,

Hinrich Schütze

,

,

,

,

,

,

,

Jack Geissinger

,

Jackson Kernion

,

,

,

Jaime Fernández Fisac

,

,

,

,

,

,

,

Janelle Wingfield

,

,

,

Jascha Sohl-Dickstein

,

,

,

,

Jekaterina Novikova

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Jonathan Batchelder

,

Jonathan Berant

,

,

,

José Hernández-Orallo

,

Joseph Boudeman

,

,

,

Joshua B. Tenenbaum

,

,

,

,

,

,

Karthik Gopalakrishnan

,

Katerina Ignatyeva

,

,

Kaustubh D. Dhole

,

,

,

Kory W. Mathewson

,

Kristen Chiafullo

,

Ksenia Shkaruta

,

,

,

Kyle Richardson

,

,

,

,

,

,

Lidia Contreras Ochando

,

Louis-Philippe Morency

,

,

,

,

,

,

Luis Oliveros Colón

,

,

Lütfi Kerem Senel

,

,

,

Maartje ter Hoeve

,

,

,

,

,

,

,

María José Ramírez-Quintana

,

,

Mario Giulianelli

,

,

Martin Potthast

,

Matthew L. Leavitt

,

,

Mátyás Schubert

,

Medina Baitemirova

,

,

Melvin McElrath

,

,

,

,

Michael I. Ivanitskiy

,

Michael Starritt

,

,

Michal Swedrowski

,

Michele Bevilacqua

,

Michihiro Yasunaga

,

,

,

,

,

,

,

,

Moin Aminnaseri

,

,

,

Mukund Varma T.

,

,

,

,

Neta Gur-Ari Krakover

,

Nicholas Cameron

,

Nicholas Roberts

,

,

Nicole Martinez

,

,

,

Niklas Muennighoff

,

Nitish Shirish Keskar

,

,

,

,

,

,

,

Omar Elbaghdadi

,

,

,

Pablo Antonio Moreno Casares

,

,

,

,

,

Pegah Alipoormolabashi

,

,

,

,

Peter Eckersley

,

,

,

Piotr Milkowski

,

,

Pouya Pezeshkpour

,

,

,

,

,

,

Rachel Etta Rudolph

,

,

,

,

Raphaël Millière

,

,

,

,

,

Robbe Raymaekers

,

,

,

,

,

,

,

,

,

Ruslan Salakhutdinov

,

,

,

,

,

,

,

Saif M. Mohammad

,

,

,

,

,

Samuel Gruetter

,

Samuel R. Bowman

,

Samuel S. Schoenholz

,

,

,

,

Sarik Ghazarian

,

,

,

Sebastian Bischoff

,

Sebastian Gehrmann

,

Sebastian Schuster

,

Sepideh Sadeghi

,

,

,

Shashank Srivastava

,

,

,

,

Shixiang Shane Gu

,

Shubh Pachchigar

,

Shubham Toshniwal

,

,

Shyamolima (Shammie) Debnath

,

,

Simon Thormeyer

,

,

,

Sneha Priscilla Makini

,

,

,

Sriharsha Hatwar

,

Stanislas Dehaene

,

,

,

Stella Biderman

,

,

,

Steven T. Piantadosi

,

Stuart M. Shieber

,

Summer Misherghi

,

Svetlana Kiritchenko

,

,

,

,

,

,

,

Tatsu Hashimoto

,

,

Théo Desbordes

,

Theodore Rothschild

,

,

,

Tiberius Nkinyili

,

,

,

,

Tobias Gerstenberg

,

,

Trishala Neeraj

,

,

,

,

,

,

Victoria Nyamai

,

,

Vinay V. Ramasesh

,

Vinay Uday Prabhu

,

Vishakh Padmakumar

,

,

,

William Saunders

,

,

,

,

,

,

,

,

Yadollah Yaghoobzadeh

,

,

,

,

,

,

,

,

Yonatan Belinkov

,

,

,

,

,

,

,

,

,

Trans. Mach. Learn. Res., 2023

Long Range Language Modeling via Gated State Spaces.

[DOI]

,

,

,

Behnam Neyshabur

Proceedings of the Eleventh International Conference on Learning Representations, 2023

REPAIR: REnormalizing Permuted Activations for Interpolation Repair.

[DOI]

,

,

,

,

Behnam Neyshabur

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning.

[DOI]

Anders Johan Andreassen

,

,

Behnam Neyshabur

,

Rebecca Roelofs

Trans. Mach. Learn. Res., 2022

Convexifying Transformers: Improving optimization and understanding of transformer networks.

[DOI]

,

Behnam Neyshabur

,

CoRR, 2022

Layer-Stack Temperature Scaling.

[DOI]

,

Michael C. Mozer

,

,

Behnam Neyshabur

,

Ibrahim Alabdulmohsin

CoRR, 2022

Teaching Algorithmic Reasoning via In-context Learning.

[DOI]

,

,

Hugo Larochelle

,

Aaron C. Courville

,

Behnam Neyshabur

,

CoRR, 2022

Understanding the effect of sparsity on neural networks robustness.

[DOI]

,

,

,

Behnam Neyshabur

,

CoRR, 2022

Data Scaling Laws in NMT: The Effect of Noise and Architecture.

[DOI]

,

Behrooz Ghorbani

,

,

,

,

,

Behnam Neyshabur

,

CoRR, 2022

Solving Quantitative Reasoning Problems with Language Models.

[DOI]

Aitor Lewkowycz

,

Anders Andreassen

,

,

,

Henryk Michalewski

,

Vinay V. Ramasesh

,

,

,

,

Theo Gutman-Solo

,

,

Behnam Neyshabur

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Block-Recurrent Transformers.

[DOI]

DeLesley Hutchins

,

,

,

,

Behnam Neyshabur

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exploring Length Generalization in Large Language Models.

[DOI]

,

,

Anders Andreassen

,

Aitor Lewkowycz

,

,

Vinay V. Ramasesh

,

,

,

,

Behnam Neyshabur

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Revisiting Neural Scaling Laws in Language and Vision.

[DOI]

Ibrahim M. Alabdulmohsin

,

Behnam Neyshabur

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Data Scaling Laws in NMT: The Effect of Noise and Architecture.

[DOI]

,

Behrooz Ghorbani

,

,

,

,

Behnam Neyshabur

,

Proceedings of the International Conference on Machine Learning, 2022

A Loss Curvature Perspective on Training Instabilities of Deep Learning Models.

[DOI]

,

Behrooz Ghorbani

,

,

Sneha Kudugunta

,

Behnam Neyshabur

,

,

George Edward Dahl

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Leveraging unlabeled data to predict out-of-distribution performance.

[DOI]

,

Sivaraman Balakrishnan

,

Zachary Chase Lipton

,

Behnam Neyshabur

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

The Role of Permutation Invariance in Linear Mode Connectivity of Neural Networks.

[DOI]

,

,

,

Behnam Neyshabur

Proceedings of the Tenth International Conference on Learning Representations, 2022

Exploring the Limits of Large Scale Pre-training.

[DOI]

,

Mostafa Dehghani

,

Behnam Neyshabur

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

A Loss Curvature Perspective on Training Instability in Deep Learning.

[DOI]

,

Behrooz Ghorbani

,

,

Sneha Kudugunta

,

Behnam Neyshabur

,

,

,

,

CoRR, 2021

Deep Learning Through the Lens of Example Difficulty.

[DOI]

Robert J. N. Baldock

,

Hartmut Maennel

,

Behnam Neyshabur

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

When Do Curricula Work?

[DOI]

,

,

Behnam Neyshabur

Proceedings of the 9th International Conference on Learning Representations, 2021

The Deep Bootstrap Framework: Good Online Learners are Good Offline Generalizers.

[DOI]

Preetum Nakkiran

,

Behnam Neyshabur

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Understanding the failure modes of out-of-distribution generalization.

[DOI]

Vaishnavh Nagarajan

,

Anders Andreassen

,

Behnam Neyshabur

Proceedings of the 9th International Conference on Learning Representations, 2021

Extreme Memorization via Scale of Initialization.

[DOI]

,

,

Behnam Neyshabur

Proceedings of the 9th International Conference on Learning Representations, 2021

Are wider nets better given the same number of parameters?

[DOI]

,

,

Behnam Neyshabur

Proceedings of the 9th International Conference on Learning Representations, 2021

Sharpness-aware Minimization for Efficiently Improving Generalization.

[DOI]

,

,

,

Behnam Neyshabur

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

NeurIPS 2020 Competition: Predicting Generalization in Deep Learning.

[DOI]

,

,

,

,

,

Gintare Karolina Dziugaite

,

,

Suriya Gunasekar

,

,

Behnam Neyshabur

CoRR, 2020

The Deep Bootstrap: Good Online Learners are Good Offline Generalizers.

[DOI]

Preetum Nakkiran

,

Behnam Neyshabur

,

CoRR, 2020

What is being transferred in transfer learning?

[DOI]

Behnam Neyshabur

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Towards Learning Convolutions from Scratch.

[DOI]

Behnam Neyshabur

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Methods and Analysis of The First Competition in Predicting Generalization of Deep Learning.

[DOI]

,

,

,

Sumukh K. Aithal

,

,

Natarajan Subramanyam

,

Carlos Lassance

,

,

Gintare Karolina Dziugaite

,

Suriya Gunasekar

,

,

,

,

,

Behnam Neyshabur

,

Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020

Observational Overfitting in Reinforcement Learning.

[DOI]

,

,

,

,

Behnam Neyshabur

Proceedings of the 8th International Conference on Learning Representations, 2020

Fantastic Generalization Measures and Where to Find Them.

[DOI]

,

Behnam Neyshabur

,

,

,

Proceedings of the 8th International Conference on Learning Representations, 2020

The intriguing role of module criticality in the generalization of deep networks.

[DOI]

Niladri S. Chatterji

,

Behnam Neyshabur

,

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

The role of over-parametrization in generalization of neural networks.

[DOI]

Behnam Neyshabur

,

,

Srinadh Bhojanapalli

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

2018

Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks.

[DOI]

Behnam Neyshabur

,

,

Srinadh Bhojanapalli

,

,

CoRR, 2018

Predicting protein-protein interactions through sequence-based deep learning.

[DOI]

Somaye Hashemifar

,

Behnam Neyshabur

,

,

Bioinform., 2018

Stronger Generalization Bounds for Deep Nets via a Compression Approach.

[DOI]

,

,

Behnam Neyshabur

,

Proceedings of the 35th International Conference on Machine Learning, 2018

A PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks.

[DOI]

Behnam Neyshabur

,

Srinadh Bhojanapalli

,

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

Implicit Regularization in Deep Learning.

[DOI]

Behnam Neyshabur

CoRR, 2017

Geometry of Optimization and Implicit Regularization in Deep Learning.

[DOI]

Behnam Neyshabur

,

,

Ruslan Salakhutdinov

,

CoRR, 2017

A PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks.

[DOI]

Behnam Neyshabur

,

Srinadh Bhojanapalli

,

David McAllester

,

CoRR, 2017

Stabilizing GAN Training with Multiple Random Projections.

[DOI]

Behnam Neyshabur

,

Srinadh Bhojanapalli

,

Ayan Chakrabarti

CoRR, 2017

Exploring Generalization in Deep Learning.

[DOI]

Behnam Neyshabur

,

Srinadh Bhojanapalli

,

David McAllester

,

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Implicit Regularization in Matrix Factorization.

[DOI]

Suriya Gunasekar

,

Blake E. Woodworth

,

Srinadh Bhojanapalli

,

Behnam Neyshabur

,

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Corralling a Band of Bandit Algorithms.

[DOI]

,

,

Behnam Neyshabur

,

Robert E. Schapire

Proceedings of the 30th Conference on Learning Theory, 2017

2016

Data-Dependent Path Normalization in Neural Networks.

[DOI]

Behnam Neyshabur

,

,

Ruslan Salakhutdinov

,

Proceedings of the 4th International Conference on Learning Representations, 2016

Path-Normalized Optimization of Recurrent Neural Networks with ReLU Activations.

[DOI]

Behnam Neyshabur

,

,

Ruslan Salakhutdinov

,

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Global Optimality of Local Search for Low Rank Matrix Recovery.

[DOI]

Srinadh Bhojanapalli

,

Behnam Neyshabur

,

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015

In Search of the Real Inductive Bias: On the Role of Implicit Regularization in Deep Learning.

[DOI]

Behnam Neyshabur

,

,

Proceedings of the 3rd International Conference on Learning Representations, 2015

Path-SGD: Path-Normalized Optimization in Deep Neural Networks.

[DOI]

Behnam Neyshabur

,

Ruslan Salakhutdinov

,

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

On Symmetric and Asymmetric LSHs for Inner Product Search.

[DOI]

Behnam Neyshabur

,

Proceedings of the 32nd International Conference on Machine Learning, 2015

Norm-Based Capacity Control in Neural Networks.

[DOI]

Behnam Neyshabur

,

,

Proceedings of The 28th Conference on Learning Theory, 2015

Joint inference of tissue-specific networks with a scale free topology.

[DOI]

Somaye Hashemifar

,

Behnam Neyshabur

,

Proceedings of the 2015 IEEE International Conference on Bioinformatics and Biomedicine, 2015

2014

Clustering, Hamming Embedding, Generalized LSH and the Max Norm.

[DOI]

Behnam Neyshabur

,

Yury Makarychev

,

Proceedings of the Algorithmic Learning Theory - 25th International Conference, 2014

2013

Sparse Matrix Factorization.

[DOI]

Behnam Neyshabur

,

CoRR, 2013

NETAL: a new graph-based method for global alignment of protein-protein interaction networks.

[DOI]

Behnam Neyshabur

,

Ahmadreza Khadem

,

Somaye Hashemifar

,

Seyed Shahriar Arab

Bioinform., 2013

The Power of Asymmetry in Binary Hashing.

[DOI]

Behnam Neyshabur

,

,

Ruslan Salakhutdinov

,

Yury Makarychev

,

Payman Yadollahpour

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Loading...