Ethan Dyer

According to our database1, Ethan Dyer authored at least 19 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context.
CoRR, 2024

2023
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models.
CoRR, 2023

PaLM 2 Technical Report.
CoRR, 2023

2022
WhichTF is functionally important in your open chromatin data?
PLoS Comput. Biol., 2022

Solving Quantitative Reasoning Problems with Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Block-Recurrent Transformers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exploring Length Generalization in Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Effect of scale on catastrophic forgetting in neural networks.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Explaining Neural Scaling Laws.
CoRR, 2021

Whitening and Second Order Optimization Both Make Information in the Dataset Unusable During Training, and Can Reduce or Prevent Generalization.
Proceedings of the 38th International Conference on Machine Learning, 2021

When Do Curricula Work?
Proceedings of the 9th International Conference on Learning Representations, 2021

Anatomy of Catastrophic Forgetting: Hidden Representations and Task Semantics.
Proceedings of the 9th International Conference on Learning Representations, 2021

Tradeoffs in Data Augmentation: An Empirical Study.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Asymptotics of Wide Convolutional Neural Networks.
CoRR, 2020

Whitening and second order optimization both destroy information about the dataset, and can make generalization impossible.
CoRR, 2020

The large learning rate phase of deep learning: the catapult mechanism.
CoRR, 2020

Affinity and Diversity: Quantifying Mechanisms of Data Augmentation.
CoRR, 2020

Asymptotics of Wide Networks from Feynman Diagrams.
Proceedings of the 8th International Conference on Learning Representations, 2020

2018
Gradient Descent Happens in a Tiny Subspace.
CoRR, 2018


  Loading...