We stand with Ukraine

We stand with Ukraine

Samuel L. Smith

According to our database¹, Samuel L. Smith authored at least 28 papers between 2016 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2024

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models.

[DOI]

Aleksandar Botev

,

,

Samuel L. Smith

,

Anushan Fernando

,

George-Cristian Muraru

,

,

Leonard Berrada

,

,

Pier Giuseppe Sessa

,

,

Léonard Hussenot

,

,

,

,

,

Kathleen Kenealy

,

,

,

Surya Bhupatiraju

,

,

,

Morgane Rivière

,

Mihir Sanjay Kale

,

,

,

,

,

,

,

Srivatsan Srinivasan

,

Guillaume Desjardins

,

,

,

,

,

,

Sebastian Borgeaud

,

,

,

Antonia Paterson

,

,

,

,

Nesh Devanathan

,

,

,

,

Luiz Gustavo Martins

,

,

David Huntsperger

,

,

,

,

,

,

Zoubin Ghahramani

,

Clément Farabet

,

Koray Kavukcuoglu

,

,

,

,

Nando de Frietas

CoRR, 2024

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models.

[DOI]

,

Samuel L. Smith

,

Anushan Fernando

,

Aleksandar Botev

,

George-Cristian Muraru

,

,

,

Leonard Berrada

,

,

Srivatsan Srinivasan

,

Guillaume Desjardins

,

,

,

,

,

Nando de Freitas

,

Caglar Gulcehre

CoRR, 2024

Universality of Linear Recurrences Followed by Non-linear Projections: Finite-Width Guarantees and Benefits of Complex Eigenvalues.

[DOI]

Antonio Orvieto

,

,

Caglar Gulcehre

,

,

Samuel L. Smith

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

ConvNets Match Vision Transformers at Scale.

[DOI]

Samuel L. Smith

,

,

Leonard Berrada

,

CoRR, 2023

Unlocking Accuracy and Fairness in Differentially Private Image Classification.

[DOI]

Leonard Berrada

,

,

Judy Hanwen Shen

,

,

Robert Stanforth

,

,

,

Samuel L. Smith

,

CoRR, 2023

On the Universality of Linear Recurrences Followed by Nonlinear Projections.

[DOI]

Antonio Orvieto

,

,

Çaglar Gülçehre

,

,

Samuel L. Smith

CoRR, 2023

Differentially Private Diffusion Models Generate Useful Synthetic Images.

[DOI]

Sahra Ghalebikesabi

,

Leonard Berrada

,

,

,

Robert Stanforth

,

,

,

Samuel L. Smith

,

,

CoRR, 2023

Resurrecting Recurrent Neural Networks for Long Sequences.

[DOI]

Antonio Orvieto

,

Samuel L. Smith

,

,

Anushan Fernando

,

Çaglar Gülçehre

,

,

Proceedings of the International Conference on Machine Learning, 2023

Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation.

[DOI]

,

,

,

Aleksandar Botev

,

,

Samuel L. Smith

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Unlocking High-Accuracy Differentially Private Image Classification through Scale.

[DOI]

,

Leonard Berrada

,

,

Samuel L. Smith

,

CoRR, 2022

2021

A study on the plasticity of neural networks.

[DOI]

,

Wojciech Czarnecki

,

,

Jörg Bornschein

,

Samuel L. Smith

,

,

Claudia Clopath

CoRR, 2021

Drawing Multiple Augmentation Samples Per Image During Training Efficiently Decreases Test Error.

[DOI]

,

,

,

,

Samuel L. Smith

CoRR, 2021

High-Performance Large-Scale Image Recognition Without Normalization.

[DOI]

,

,

Samuel L. Smith

,

Proceedings of the 38th International Conference on Machine Learning, 2021

On the Origin of Implicit Regularization in Stochastic Gradient Descent.

[DOI]

Samuel L. Smith

,

,

David G. T. Barrett

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Characterizing signal propagation to close the performance gap in unnormalized ResNets.

[DOI]

,

,

Samuel L. Smith

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

BYOL works even without batch statistics.

[DOI]

Pierre H. Richemond

,

Jean-Bastien Grill

,

Florent Altché

,

Corentin Tallec

,

,

,

Samuel L. Smith

,

,

,

,

CoRR, 2020

Cold Posteriors and Aleatoric Uncertainty.

[DOI]

,

,

Samuel L. Smith

CoRR, 2020

Batch Normalization Biases Deep Residual Networks Towards Shallow Paths.

[DOI]

,

Samuel L. Smith

CoRR, 2020

Batch Normalization Biases Residual Blocks Towards the Identity Function in Deep Networks.

[DOI]

,

Samuel L. Smith

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On the Generalization Benefit of Noise in Stochastic Gradient Descent.

[DOI]

Samuel L. Smith

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study.

[DOI]

,

Jascha Sohl-Dickstein

,

,

Samuel L. Smith

Proceedings of the 36th International Conference on Machine Learning, 2019

2018

Stochastic natural gradient descent draws posterior samples in function space.

[DOI]

Samuel L. Smith

,

Daniel Duckworth

,

,

Jascha Sohl-Dickstein

CoRR, 2018

Decoding Decoders: Finding Optimal Representation Spaces for Unsupervised Similarity Tasks.

[DOI]

Vitalii Zhelezniak

,

,

,

Samuel L. Smith

,

Nils Y. Hammerla

Proceedings of the 6th International Conference on Learning Representations, 2018

A Bayesian Perspective on Generalization and Stochastic Gradient Descent.

[DOI]

Samuel L. Smith

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Don't Decay the Learning Rate, Increase the Batch Size.

[DOI]

Samuel L. Smith

,

Pieter-Jan Kindermans

,

,

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

Don't Decay the Learning Rate, Increase the Batch Size.

[DOI]

Samuel L. Smith

,

Pieter-Jan Kindermans

,

CoRR, 2017

Offline bilingual word vectors, orthogonal transformations and the inverted softmax.

[DOI]

Samuel L. Smith

,

David H. P. Turban

,

,

Nils Y. Hammerla

Proceedings of the 5th International Conference on Learning Representations, 2017

2016

Monte Carlo Sort for unreliable human comparisons.

[DOI]

Samuel L. Smith

CoRR, 2016

Loading...