Stanislaw Jastrzebski

According to our database1, Stanislaw Jastrzebski authored at least 26 papers between 2013 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
Large Scale Structure of Neural Network Loss Landscapes.
CoRR, 2019

Split Batch Normalization: Improving Semi-Supervised Learning under Domain Shift.
CoRR, 2019

Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening.
CoRR, 2019

Non-linear ICA based on Cramer-Wold metric.
CoRR, 2019

Parameter-Efficient Transfer Learning for NLP.
CoRR, 2019

Parameter-Efficient Transfer Learning for NLP.
Proceedings of the 36th International Conference on Machine Learning, 2019

On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length.
Proceedings of the 7th International Conference on Learning Representations, 2019

Dynamical Isometry is Achieved in Residual Networks in a Universal Way for any Activation Function.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Neural Architecture Search Over a Graph Search Space.
CoRR, 2018

Dynamical Isometry is Achieved in Residual Networks in a Universal Way for any Activation Function.
CoRR, 2018

DNN's Sharpest Directions Along the SGD Trajectory.
CoRR, 2018

Cramer-Wold AutoEncoder.
CoRR, 2018

Commonsense mining as knowledge base completion? A study on the impact of novelty.
CoRR, 2018

Finding Flatter Minima with SGD.
Proceedings of the 6th International Conference on Learning Representations, 2018

Residual Connections Encourage Iterative Inference.
Proceedings of the 6th International Conference on Learning Representations, 2018

Width of Minima Reached by Stochastic Gradient Descent is Influenced by Learning Rate to Batch Size Ratio.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2018, 2018

2017
Three Factors Influencing Minima in SGD.
CoRR, 2017

Residual Connections Encourage Iterative Inference.
CoRR, 2017

How to evaluate word embeddings? On importance of data efficiency and simple supervised tasks.
CoRR, 2017

Learning to Compute Word Embeddings On the Fly.
CoRR, 2017

A Closer Look at Memorization in Deep Networks.
CoRR, 2017

A Closer Look at Memorization in Deep Networks.
Proceedings of the 34th International Conference on Machine Learning, 2017

Deep Nets Don't Learn via Memorization.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
Osprey: Hyperparameter Optimization for Machine Learning.
J. Open Source Software, 2016

Learning to SMILE(S).
CoRR, 2016

2013
Density Invariant Detection of Osteoporosis Using Growing Neural Gas.
Proceedings of the 8th International Conference on Computer Recognition Systems CORES 2013, 2013


  Loading...