Roger B. Grosse

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Understanding and Mitigating Exploding Inverses in Invertible Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Learning Branching Heuristics for Propositional Model Counting.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

The Scattering Compositional Learner: Discovering Objects, Attributes, Relationships in Analogical Reasoning.

[BibT_eX]

[DOI]

CoRR, 2020

Learning Branching Heuristics for Propositional Model Counting.

[BibT_eX]

[DOI]

CoRR, 2020

Regularized linear autoencoders recover the principal components, eventually.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Delta-STN: Efficient Bilevel Optimization for Neural Networks using Structured Response Jacobians.

[BibT_eX]

[DOI]

Juhan Bae

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Evaluating Lossy Compression Rates of Deep Generative Models.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Picking Winning Tickets Before Training by Preserving Gradient Flow.

[BibT_eX]

[DOI]

Chaoqi Wang

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Fast Convergence of Natural Gradient Descent for Over-Parameterized Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model.

[BibT_eX]

[DOI]

Christopher J. Shallue

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Don't Blame the ELBO! A Linear VAE Perspective on Posterior Collapse.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Sorting Out Lipschitz Function Approximation.

[BibT_eX]

[DOI]

Cem Anil

James Lucas

Proceedings of the 36th International Conference on Machine Learning, 2019

Three Mechanisms of Weight Decay Regularization.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Functional variational Bayesian Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Self-Tuning Networks: Bilevel Optimization of Hyperparameters using Structured Best-Response Functions.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Understanding Posterior Collapse in Generative Latent Variable Models.

[BibT_eX]

[DOI]

Proceedings of the Deep Generative Models for Highly Structured Data, 2019

Aggregated Momentum: Stability Through Passive Damping.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

2018

Eigenvalue Corrected Noisy Natural Gradient.

[BibT_eX]

[DOI]

Juhan Bae

CoRR, 2018

A Coordinate-Free Construction of Scalable Natural Gradient.

[BibT_eX]

[DOI]

Kevin Luk

CoRR, 2018

Aggregated Momentum: Stability Through Passive Damping.

[BibT_eX]

[DOI]

James Lucas

Richard S. Zemel

CoRR, 2018

Reversible Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Isolating Sources of Disentanglement in Variational Autoencoders.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Noisy Natural Gradient as Variational Inference.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Adversarial Distillation of Bayesian Neural Network Posteriors.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Differentiable Compositional Kernel Learning for Gaussian Processes.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Understanding Short-Horizon Bias in Stochastic Meta-Optimization.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Flipout: Efficient Pseudo-Independent Weight Perturbations on Mini-Batches.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Stochastic Gradient Langevin dynamics that Exploit Neural Network Structure.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

The Reversible Residual Network: Backpropagation Without Storing Activations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

On the Quantitative Analysis of Decoder-Based Generative Models.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Distributed Second-Order Optimization using Kronecker-Factored Approximations.

[BibT_eX]

[DOI]

Jimmy Ba

Proceedings of the 5th International Conference on Learning Representations, 2017

Discovering and Exploiting Additive Structure for Bayesian Optimization.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016

Importance Weighted Autoencoders.

[BibT_eX]

[DOI]

Yuri Burda

Proceedings of the 4th International Conference on Learning Representations, 2016

Measuring the reliability of MCMC inference with bidirectional Monte Carlo.

[BibT_eX]

[DOI]

Siddharth Ancha

Daniel M. Roy

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

A Kronecker-factored approximate Fisher matrix for convolution layers.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

Sandwiching the marginal likelihood using bidirectional Monte Carlo.

[BibT_eX]

[DOI]

Zoubin Ghahramani

Ryan P. Adams

CoRR, 2015

Statistical Inference, Learning and Models in Big Data.

[BibT_eX]

[DOI]

Alessandro Selvitella

CoRR, 2015

Learning Wake-Sleep Recurrent Attention Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Optimizing Neural Networks with Kronecker-factored Approximate Curvature.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Machine Learning, 2015

Scaling up Natural Gradient by Sparsely Factorizing the Inverse Fisher Matrix.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Machine Learning, 2015

Accurate and conservative estimates of MRF log-likelihood using reverse annealing.

[BibT_eX]

[DOI]

Yuri Burda

Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2014

Model selection in compositional spaces.

[BibT_eX]

[DOI]

Roger Baker Grosse

PhD thesis, 2014

Testing MCMC code.

[BibT_eX]

[DOI]

David Kristjanson Duvenaud

CoRR, 2014

Automatic Construction and Natural-Language Description of Nonparametric Regression Models.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013

Annealing between distributions by averaging moments.

[BibT_eX]

[DOI]

Chris J. Maddison