Yann N. Dauphin

Affiliations:

Google AI, Accra, Ghana
Facebook AI Research, Menlo Park, CA, USA
University of Montréal, Department of Computer Science and Operations Research, Canada

According to our database¹, Yann N. Dauphin authored at least 55 papers between 2011 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2025

Capturing Individual Human Preferences with Reward Features.

[BibT_eX]

[DOI]

CoRR, March, 2025

Capturing Individual Human Preferences with Reward Features.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Avoiding spurious sharpness minimization broadens applicability of SAM.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

A density estimation perspective on learning from pairwise human preferences.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Towards Optimal Adapter Placement for Efficient Transfer Learning.

[BibT_eX]

[DOI]

Aleksandra Irena Nowak

CoRR, 2024

Neglected Hessian component explains mysteries in sharpness regularization.

[BibT_eX]

[DOI]

Yann N. Dauphin

Atish Agarwala

Hossein Mobahi

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Jaxpruner: A Concise Library for Sparsity Research.

[BibT_eX]

[DOI]

Proceedings of the Conference on Parsimony and Learning, 2024

2023

Temperature check: theory and practice for training models with softmax-cross-entropy losses.

[BibT_eX]

[DOI]

Atish Agarwala

Samuel Stern Schoenholz

Jeffrey Pennington

Yann N. Dauphin

Trans. Mach. Learn. Res., 2023

Has the Machine Learning Review Process Become More Arbitrary as the Field Has Grown? The NeurIPS 2021 Consistency Experiment.

[BibT_eX]

[DOI]

Alina Beygelzimer

Yann N. Dauphin

Percy Liang

Jennifer Wortman Vaughan

CoRR, 2023

Robustmix: Improving Robustness by Regularizing the Frequency Bias of Deep Nets.

[BibT_eX]

[DOI]

Jonas Ngnawé

Marianne Abemgnigni Njifon

Jonathan Heek

Yann N. Dauphin

CoRR, 2023

Tied-Augment: Controlling Representation Similarity Improves Data Augmentation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

SAM operates far from home: eigenvalue regularization as a dynamical phenomenon.

[BibT_eX]

[DOI]

Atish Agarwala

Yann N. Dauphin

Proceedings of the International Conference on Machine Learning, 2023

2022

How do Authors' Perceptions of their Papers Compare with Co-authors' Perceptions and Peer-review Decisions?

[BibT_eX]

[DOI]

Jennifer Wortman Vaughan

CoRR, 2022

No One Representation to Rule Them All: Overlapping Features of Training Methods.

[BibT_eX]

[DOI]

Raphael Gontijo Lopes

Yann N. Dauphin

Ekin Dogus Cubuk

Proceedings of the Tenth International Conference on Learning Representations, 2022

Gradient Flow in Sparse Neural Networks and How Lottery Tickets Win.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Continental-Scale Building Detection from High Resolution Satellite Imagery.

[BibT_eX]

[DOI]

Yasser Salah Eddine Bouchareb

CoRR, 2021

Auxiliary Task Update Decomposition: the Good, the Bad and the neutral.

[BibT_eX]

[DOI]

Lucio M. Dery

Yann N. Dauphin

David Grangier

Proceedings of the 9th International Conference on Learning Representations, 2021

Deconstructing the Regularization of BatchNorm.

[BibT_eX]

[DOI]

Yann N. Dauphin

Ekin Dogus Cubuk

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

Robust and On-the-fly Dataset Denoising for Image Classification.

[BibT_eX]

[DOI]

CoRR, 2020

Robust and On-the-Fly Dataset Denoising for Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

2019

Selective Brain Damage: Measuring the Disparate Impact of Model Pruning.

[BibT_eX]

[DOI]

CoRR, 2019

Simple and Effective Noisy Channel Modeling for Neural Machine Translation.

[BibT_eX]

[DOI]

CoRR, 2019

MetaInit: Initializing learning by learning to initialize.

[BibT_eX]

[DOI]

Yann N. Dauphin

Samuel S. Schoenholz

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Fixup Initialization: Residual Learning Without Normalization.

[BibT_eX]

[DOI]

Hongyi Zhang

Yann N. Dauphin

Tengyu Ma

Proceedings of the 7th International Conference on Learning Representations, 2019

Pay Less Attention with Lightweight and Dynamic Convolutions.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Simple and Effective Noisy Channel Modeling for Neural Machine Translation.

[BibT_eX]

[DOI]

Kyra Yee

Yann N. Dauphin

Michael Auli

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

On the Pitfalls of Measuring Emergent Communication.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Strategies for Structuring Story Generation.

[BibT_eX]

[DOI]

Angela Fan

Mike Lewis

Yann N. Dauphin

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

mixup: Beyond Empirical Risk Minimization.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Empirical Analysis of the Hessian of Over-Parametrized Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Hierarchical Neural Story Generation.

[BibT_eX]

[DOI]

Angela Fan

Mike Lewis

Yann N. Dauphin

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

Tackling Over-pruning in Variational Autoencoders.

[BibT_eX]

[DOI]

CoRR, 2017

Convolutional Sequence to Sequence Learning.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Language Modeling with Gated Convolutional Networks.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Parseval Networks: Improving Robustness to Adversarial Examples.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Deal or No Deal? End-to-End Learning of Negotiation Dialogues.

[BibT_eX]

[DOI]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Convolutional Encoder Model for Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016

EmoNets: Multimodal deep learning approaches for emotion recognition in video.

[BibT_eX]

[DOI]

Samira Ebrahimi Kahou

Nicolas Boulanger-Lewandowski

Raul Chandias Ferrari

Christopher Joseph Pal

Yoshua Bengio

J. Multimodal User Interfaces, 2016

Predicting distributions with Linearizing Belief Networks.

[BibT_eX]

[DOI]

Yann N. Dauphin

David Grangier

Proceedings of the 4th International Conference on Learning Representations, 2016

Theano: A Python framework for fast computation of mathematical expressions.

[BibT_eX]

[DOI]

Nicolas Boulanger-Lewandowski

Xavier Bouthillier

Alexandre de Brébisson

Samira Ebrahimi Kahou

Pierre-Antoine Manzagol

Christopher Joseph Pal

S. Ramana Subramanyam

CoRR, 2016

2015

Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

RMSProp and equilibrated adaptive learning rates for non-convex optimization.

[BibT_eX]

[DOI]

CoRR, 2015

Equilibrated adaptive learning rates for non-convex optimization.

[BibT_eX]

[DOI]

Yann N. Dauphin

Harm de Vries

Yoshua Bengio

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2014

On the saddle point problem for non-convex optimization.

[BibT_eX]

[DOI]

CoRR, 2014

Zero-Shot Learning and Clustering for Semantic Utterance Classification.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Conference on Learning Representations, 2014

Identifying and attacking the saddle point problem in high-dimensional non-convex optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

2013

Big Neural Networks Waste Capacity

[BibT_eX]

[DOI]

Yann N. Dauphin

Yoshua Bengio

Proceedings of the 1st International Conference on Learning Representations, 2013

Stochastic Ratio Matching of RBMs for Sparse High-Dimensional Inputs.

[BibT_eX]

[DOI]

Yann N. Dauphin

Yoshua Bengio

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Better Mixing via Deep Representations.

[BibT_eX]

[DOI]

Proceedings of the 30th International Conference on Machine Learning, 2013

Combining modality specific deep neural networks for emotion recognition in video.

[BibT_eX]

[DOI]

Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

2012

Unsupervised and Transfer Learning Challenge: a Deep Learning Approach.

[BibT_eX]

[DOI]

Proceedings of the Unsupervised and Transfer Learning, 2012

A Generative Process for Contractive Auto-Encoders.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Machine Learning, 2012

2011

Higher Order Contractive Auto-Encoder.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011

The Manifold Tangent Classifier.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Large-Scale Learning of Embeddings with Reconstruction Sampling.

[BibT_eX]

[DOI]

Yann N. Dauphin

Xavier Glorot

Yoshua Bengio

Proceedings of the 28th International Conference on Machine Learning, 2011

Yann N. Dauphin

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...