James Martens

According to our database1, James Martens authored at least 34 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Disentangling the Causes of Plasticity Loss in Neural Networks.
CoRR, 2024

2023
Pre-training via Denoising for Molecular Property Prediction.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Rapid training of deep neural networks without skip connections or normalization layers using Deep Kernel Shaping.
CoRR, 2021

On the validity of kernel approximations for orthogonally-initialized neural networks.
CoRR, 2021

2020
New Insights and Perspectives on the Natural Gradient Method.
J. Mach. Learn. Res., 2020

Blockchain-based Verifiable Credential Sharing with Selective Disclosure.
Proceedings of the 19th IEEE International Conference on Trust, 2020

2019
Differentiable Game Mechanics.
J. Mach. Learn. Res., 2019

Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks.
CoRR, 2019

On the Variance of Unbiased Online Recurrent Optimization.
CoRR, 2019

Fast Convergence of Natural Gradient Descent for Over-Parameterized Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Adversarial Robustness through Local Linearization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
The Mechanics of n-Player Differentiable Games.
Proceedings of the 35th International Conference on Machine Learning, 2018

Stochastic Gradient Langevin dynamics that Exploit Neural Network Structure.
Proceedings of the 6th International Conference on Learning Representations, 2018

Kronecker-factored Curvature Approximations for Recurrent Neural Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Distributed Second-Order Optimization using Kronecker-Factored Approximations.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
Second-order Optimization for Neural Networks.
PhD thesis, 2016

A Kronecker-factored approximate Fisher matrix for convolution layers.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2015
Adding Gradient Noise Improves Learning for Very Deep Networks.
CoRR, 2015

Optimizing Neural Networks with Kronecker-factored Approximate Curvature.
Proceedings of the 32nd International Conference on Machine Learning, 2015

2014
On the Expressive Efficiency of Sum Product Networks.
CoRR, 2014

New perspectives on the natural gradient method.
CoRR, 2014

2013
On the Expressive Power of Restricted Boltzmann Machines.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

On the importance of initialization and momentum in deep learning.
Proceedings of the 30th International Conference on Machine Learning, 2013

2012
Training Deep and Recurrent Networks with Hessian-Free Optimization.
Proceedings of the Neural Networks: Tricks of the Trade - Second Edition, 2012

Estimating the Hessian by Back-propagating Curvature.
Proceedings of the 29th International Conference on Machine Learning, 2012

2011
Normalization for probabilistic inference with neurons.
Biol. Cybern., 2011

Generating Text with Recurrent Neural Networks.
Proceedings of the 28th International Conference on Machine Learning, 2011

Learning Recurrent Neural Networks with Hessian-Free Optimization.
Proceedings of the 28th International Conference on Machine Learning, 2011

2010
Parallelizable Sampling of Markov Random Fields.
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Learning the Linear Dynamical System with ASOS.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Deep learning via Hessian-free optimization.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


  Loading...