Yann N. Dauphin

Affiliations:
  • Google AI, Accra, Ghana
  • Facebook AI Research, Menlo Park, CA, USA
  • University of Montréal, Department of Computer Science and Operations Research, Canada


According to our database1, Yann N. Dauphin authored at least 51 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Neglected Hessian component explains mysteries in Sharpness regularization.
CoRR, 2024

2023
Temperature check: theory and practice for training models with softmax-cross-entropy losses.
Trans. Mach. Learn. Res., 2023

A density estimation perspective on learning from pairwise human preferences.
CoRR, 2023

Has the Machine Learning Review Process Become More Arbitrary as the Field Has Grown? The NeurIPS 2021 Consistency Experiment.
CoRR, 2023

JaxPruner: A concise library for sparsity research.
CoRR, 2023

Robustmix: Improving Robustness by Regularizing the Frequency Bias of Deep Nets.
CoRR, 2023

Tied-Augment: Controlling Representation Similarity Improves Data Augmentation.
Proceedings of the International Conference on Machine Learning, 2023

SAM operates far from home: eigenvalue regularization as a dynamical phenomenon.
Proceedings of the International Conference on Machine Learning, 2023

2022
How do Authors' Perceptions of their Papers Compare with Co-authors' Perceptions and Peer-review Decisions?
CoRR, 2022

No One Representation to Rule Them All: Overlapping Features of Training Methods.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Gradient Flow in Sparse Neural Networks and How Lottery Tickets Win.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Continental-Scale Building Detection from High Resolution Satellite Imagery.
CoRR, 2021

Auxiliary Task Update Decomposition: the Good, the Bad and the neutral.
Proceedings of the 9th International Conference on Learning Representations, 2021

Deconstructing the Regularization of BatchNorm.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Robust and On-the-fly Dataset Denoising for Image Classification.
CoRR, 2020

Robust and On-the-Fly Dataset Denoising for Image Classification.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Selective Brain Damage: Measuring the Disparate Impact of Model Pruning.
CoRR, 2019

Simple and Effective Noisy Channel Modeling for Neural Machine Translation.
CoRR, 2019

MetaInit: Initializing learning by learning to initialize.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Fixup Initialization: Residual Learning Without Normalization.
Proceedings of the 7th International Conference on Learning Representations, 2019

Pay Less Attention with Lightweight and Dynamic Convolutions.
Proceedings of the 7th International Conference on Learning Representations, 2019

Simple and Effective Noisy Channel Modeling for Neural Machine Translation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

On the Pitfalls of Measuring Emergent Communication.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Strategies for Structuring Story Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
mixup: Beyond Empirical Risk Minimization.
Proceedings of the 6th International Conference on Learning Representations, 2018

Empirical Analysis of the Hessian of Over-Parametrized Neural Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Hierarchical Neural Story Generation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Tackling Over-pruning in Variational Autoencoders.
CoRR, 2017

Convolutional Sequence to Sequence Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

Language Modeling with Gated Convolutional Networks.
Proceedings of the 34th International Conference on Machine Learning, 2017

Parseval Networks: Improving Robustness to Adversarial Examples.
Proceedings of the 34th International Conference on Machine Learning, 2017

Deal or No Deal? End-to-End Learning of Negotiation Dialogues.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Convolutional Encoder Model for Neural Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
EmoNets: Multimodal deep learning approaches for emotion recognition in video.
J. Multimodal User Interfaces, 2016

Predicting distributions with Linearizing Belief Networks.
Proceedings of the 4th International Conference on Learning Representations, 2016

Theano: A Python framework for fast computation of mathematical expressions.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2016

2015
Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

RMSProp and equilibrated adaptive learning rates for non-convex optimization.
CoRR, 2015

Equilibrated adaptive learning rates for non-convex optimization.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2014
On the saddle point problem for non-convex optimization.
CoRR, 2014

Zero-Shot Learning and Clustering for Semantic Utterance Classification.
Proceedings of the 2nd International Conference on Learning Representations, 2014

Identifying and attacking the saddle point problem in high-dimensional non-convex optimization.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

2013
Big Neural Networks Waste Capacity
Proceedings of the 1st International Conference on Learning Representations, 2013

Stochastic Ratio Matching of RBMs for Sparse High-Dimensional Inputs.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Better Mixing via Deep Representations.
Proceedings of the 30th International Conference on Machine Learning, 2013


2012
Unsupervised and Transfer Learning Challenge: a Deep Learning Approach.
Proceedings of the Unsupervised and Transfer Learning, 2012

A Generative Process for Contractive Auto-Encoders.
Proceedings of the 29th International Conference on Machine Learning, 2012

2011
Higher Order Contractive Auto-Encoder.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011

The Manifold Tangent Classifier.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Large-Scale Learning of Embeddings with Reconstruction Sampling.
Proceedings of the 28th International Conference on Machine Learning, 2011


  Loading...