Justin Gilmer

Orcid: 0009-0003-4813-7874

According to our database¹, Justin Gilmer authored at least 40 papers between 2015 and 2024.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of two.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Pre-trained Gaussian Processes for Bayesian Optimization.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2024

Small-scale proxies for large-scale Transformer training instabilities.

[BibT_eX]

[DOI]

Jascha Sohl-Dickstein

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Replacing softmax with ReLU in Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

Benchmarking Neural Network Training Algorithms.

[BibT_eX]

[DOI]

CoRR, 2023

Order Matters in the Presence of Dataset Imbalance for Multilingual Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Improving Training Stability for Multitask Ranking Models in Recommender Systems.

[BibT_eX]

[DOI]

Jiaxi Tang

Yoel Drori

Daryl Chang

Maheswaran Sathiamoorthy

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Scaling Vision Transformers to 22 Billion Parameters.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

2022

Adaptive Gradient Methods at the Edge of Stability.

[BibT_eX]

[DOI]

CoRR, 2022

Pre-training helps Bayesian optimization too.

[BibT_eX]

[DOI]

CoRR, 2022

AI system for fetal ultrasound in low-resource settings.

[BibT_eX]

[DOI]

CoRR, 2022

Do Current Multi-Task Optimization Methods in Deep Learning Even Help?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Loss Curvature Perspective on Training Instabilities of Deep Learning Models.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Predicting the utility of search spaces for black-box optimization: a simple, budget-aware approach.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

A Loss Curvature Perspective on Training Instability in Deep Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Automatic prior selection for meta Bayesian optimization with a case study on tuning deep neural network optimizers.

[BibT_eX]

[DOI]

CoRR, 2021

A Large Batch Optimizer Reality Check: Traditional, Generic Optimizers Suffice Across Batch Sizes.

[BibT_eX]

[DOI]

Zachary Nado

Justin Gilmer

Christopher J. Shallue

Rohan Anil

George E. Dahl

CoRR, 2021

The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty.

[BibT_eX]

[DOI]

Balaji Lakshminarayanan

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Improving Robustness Without Sacrificing Accuracy with Patch Gaussian Augmentation.

[BibT_eX]

[DOI]

Raphael Gontijo Lopes

CoRR, 2019

MNIST-C: A Robustness Benchmark for Computer Vision.

[BibT_eX]

[DOI]

Norman Mu

Justin Gilmer

CoRR, 2019

A Fourier Perspective on Model Robustness in Computer Vision.

[BibT_eX]

[DOI]

Dong Yin

Raphael Gontijo Lopes

Jonathon Shlens

Ekin Dogus Cubuk

Justin Gilmer

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Adversarial Examples Are a Natural Consequence of Test Error in Noise.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

2018

Motivating the Rules of the Game for Adversarial Example Research.

[BibT_eX]

[DOI]

CoRR, 2018

Relational inductive biases, deep learning, and graph networks.

[BibT_eX]

[DOI]

CoRR, 2018

Sanity Checks for Saliency Maps.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV).

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Adversarial Spheres.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Local Explanation Methods for Deep Neural Networks Lack Sensitivity to Parameter Values.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

A Communication Game Related to the Sensitivity Conjecture.

[BibT_eX]

[DOI]

Justin Gilmer

Michal Koucký

Michael E. Saks

Theory Comput., 2017

Adversarial Patch.

[BibT_eX]

[DOI]

CoRR, 2017

SVCCA: Singular Vector Canonical Correlation Analysis for Deep Understanding and Improvement.

[BibT_eX]

[DOI]

Maithra Raghu

Justin Gilmer

Jason Yosinski

Jascha Sohl-Dickstein

CoRR, 2017

SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability.

[BibT_eX]

[DOI]

Maithra Raghu

Justin Gilmer

Jason Yosinski

Jascha Sohl-Dickstein

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Neural Message Passing for Quantum Chemistry.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Input Switched Affine Networks: An RNN Architecture Designed for Interpretability.

[BibT_eX]

[DOI]

Jakob N. Foerster

Justin Gilmer

Jascha Sohl-Dickstein

Jan Chorowski

David Sussillo

Proceedings of the 34th International Conference on Machine Learning, 2017

Deep Information Propagation.

[BibT_eX]

[DOI]

Samuel S. Schoenholz

Justin Gilmer

Surya Ganguli

Jascha Sohl-Dickstein

Proceedings of the 5th International Conference on Learning Representations, 2017

Explaining the Learning Dynamics of Direct Feedback Alignment.

[BibT_eX]

[DOI]

Jascha Sohl-Dickstein

Proceedings of the 5th International Conference on Learning Representations, 2017

2016

A local central limit theorem for triangles in a random graph.

[BibT_eX]

[DOI]

Justin Gilmer

Swastik Kopparty

Random Struct. Algorithms, 2016

Intelligible Language Modeling with Input Switched Affine Networks.

[BibT_eX]

[DOI]

Jakob N. Foerster

Justin Gilmer

Jan Chorowski

Jascha Sohl-Dickstein

David Sussillo

CoRR, 2016

Composition limits and separating examples for some boolean function complexity measures.

[BibT_eX]

[DOI]

Justin Gilmer

Michael E. Saks

Srikanth Srinivasan

Comb., 2016

2015

A New Approach to the Sensitivity Conjecture.

[BibT_eX]

[DOI]

Justin Gilmer

Michal Koucký

Michael E. Saks

Proceedings of the 2015 Conference on Innovations in Theoretical Computer Science, 2015

Justin Gilmer

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...