Jack W. Rae

According to our database1, Jack W. Rae authored at least 24 papers between 2016 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Training Compute-Optimal Large Language Models.
CoRR, 2022

Unified Scaling Laws for Routed Language Models.
CoRR, 2022

An empirical analysis of compute-optimal large language model training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022



2021
Towards lifelong reasoning with sparse and compressive memory systems
PhD thesis, 2021

Scaling Language Models: Methods, Analysis & Insights from Training Gopher.
CoRR, 2021

2020
Top-KAST: Top-K Always Sparse Training.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Stabilizing Transformers for Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control.
Proceedings of the 8th International Conference on Learning Representations, 2020

Compressive Transformers for Long-Range Sequence Modelling.
Proceedings of the 8th International Conference on Learning Representations, 2020

Multiplicative Interactions and Where to Find Them.
Proceedings of the 8th International Conference on Learning Representations, 2020

Meta-Learning Deep Energy-Based Memory Models.
Proceedings of the 8th International Conference on Learning Representations, 2020

Do Transformers Need Deep Long-Range Memory?
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Compressive Transformers for Long-Range Sequence Modelling.
CoRR, 2019

Training Language GANs from Scratch.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Meta-Learning Neural Bloom Filters.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
Unsupervised Predictive Memory in a Goal-Directed Agent.
CoRR, 2018

Neural Arithmetic Logic Units.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Relational recurrent neural networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Fast Parametric Learning with Activation Memorization.
Proceedings of the 35th International Conference on Machine Learning, 2018

Memory-based Parameter Adaptation.
Proceedings of the 6th International Conference on Learning Representations, 2018

2016
Model-Free Episodic Control.
CoRR, 2016

Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016


  Loading...