George E. Dahl

According to our database1, George E. Dahl authored at least 37 papers between 2010 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Benchmarking Neural Network Training Algorithms.
CoRR, 2023

2022
Adaptive Gradient Methods at the Edge of Stability.
CoRR, 2022

Pre-training helps Bayesian optimization too.
CoRR, 2022

AI system for fetal ultrasound in low-resource settings.
CoRR, 2022

A Loss Curvature Perspective on Training Instabilities of Deep Learning Models.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Predicting the utility of search spaces for black-box optimization: a simple, budget-aware approach.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
A Loss Curvature Perspective on Training Instability in Deep Learning.
CoRR, 2021

Automatic prior selection for meta Bayesian optimization with a case study on tuning deep neural network optimizers.
CoRR, 2021

A Large Batch Optimizer Reality Check: Traditional, Generic Optimizers Suffice Across Batch Sizes.
CoRR, 2021

What Will it Take to Fix Benchmarking in Natural Language Understanding?
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

2019
Measuring the Effects of Data Parallelism on Neural Network Training.
J. Mach. Learn. Res., 2019

On Empirical Comparisons of Optimizers for Deep Learning.
CoRR, 2019

Faster Neural Network Training with Data Echoing.
CoRR, 2019

Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Motivating the Rules of the Game for Adversarial Example Research.
CoRR, 2018

Relational inductive biases, deep learning, and graph networks.
CoRR, 2018

Parallel Architecture and Hyperparameter Search via Successive Halving and Classification.
CoRR, 2018

Embedding Text in Hyperbolic Spaces.
Proceedings of the Twelfth Workshop on Graph-Based Methods for Natural Language Processing, 2018

Large scale distributed neural network training through online distillation.
Proceedings of the 6th International Conference on Learning Representations, 2018

The Importance of Generation Order in Language Modeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
Detecting Cancer Metastases on Gigapixel Pathology Images.
CoRR, 2017

Neural Message Passing for Quantum Chemistry.
Proceedings of the 34th International Conference on Machine Learning, 2017

2015
Deep Learning Approaches to Problems in Speech Recognition, Computational Chemistry, and Natural Language Text Processing.
PhD thesis, 2015

Deep Convolutional Neural Networks for Large-scale Speech Tasks.
Neural Networks, 2015

Deep Neural Nets as a Method for Quantitative Structure-Activity Relationships.
J. Chem. Inf. Model., 2015

2014
Multi-task Neural Networks for QSAR Predictions.
CoRR, 2014

2013
On the importance of initialization and momentum in deep learning.
Proceedings of the 30th International Conference on Machine Learning, 2013

Improving deep neural networks for LVCSR using rectified linear units and dropout.
Proceedings of the IEEE International Conference on Acoustics, 2013

Large-scale malware classification using random projections and neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2013

Improvements to Deep Convolutional Neural Networks for LVCSR.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Acoustic Modeling Using Deep Belief Networks.
IEEE Trans. Speech Audio Process., 2012

Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition.
IEEE Trans. Speech Audio Process., 2012

Training Restricted Boltzmann Machines on Word Observations.
Proceedings of the 29th International Conference on Machine Learning, 2012

2011
Deep Belief Networks using discriminative features for phone recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Large vocabulary continuous speech recognition with context-dependent DBN-HMMS.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Incorporating Side Information in Probabilistic Matrix Factorization with Gaussian Processes.
Proceedings of the UAI 2010, 2010

Phone Recognition with the Mean-Covariance Restricted Boltzmann Machine.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010


  Loading...