Ethan Dyer

According to our database¹, Ethan Dyer authored at least 21 papers between 2018 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2024

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries.

[BibT_eX]

[DOI]

CoRR, 2024

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context.

[BibT_eX]

[DOI]

Jean-Baptiste Alayrac

et al.

CoRR, 2024

2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[BibT_eX]

[DOI]

Bartlomiej Bojanowski

Christopher D. Manning

Daniel Moseguí González

Eunice Engefu Manyasi

Evgenii Zheltonozhskii

Fanyue Xia

Fatemeh Siar

Fernando Martínez-Plumed

Giambattista Parascandolo

Giorgio Mariani

Gloria Wang

Gonzalo Jaimovitch-López

Jaime Fernández Fisac

Jascha Sohl-Dickstein

José Hernández-Orallo

Karthik Gopalakrishnan

Lidia Contreras Ochando

Louis-Philippe Morency

María José Ramírez-Quintana

Michael I. Ivanitskiy

Neta Gur-Ari Krakover

Nitish Shirish Keskar

Pablo Antonio Moreno Casares

Pegah Alipoormolabashi

Shyamolima (Shammie) Debnath

Sneha Priscilla Makini

Yadollah Yaghoobzadeh

Trans. Mach. Learn. Res., 2023

PaLM 2 Technical Report.

[BibT_eX]

[DOI]

Kathy Meier-Hellstern

Gustavo Hernández Ábrego

Christopher A. Choquette-Choo

et al.

CoRR, 2023

2022

WhichTF is functionally important in your open chromatin data?

[BibT_eX]

[DOI]

Yosuke Tanigawa

Ethan Dyer

Gill Bejerano

PLoS Comput. Biol., 2022

Solving Quantitative Reasoning Problems with Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Block-Recurrent Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exploring Length Generalization in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Effect of scale on catastrophic forgetting in neural networks.

[BibT_eX]

[DOI]

Vinay Venkatesh Ramasesh

Aitor Lewkowycz

Ethan Dyer

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Explaining Neural Scaling Laws.

[BibT_eX]

[DOI]

CoRR, 2021

Whitening and Second Order Optimization Both Make Information in the Dataset Unusable During Training, and Can Reduce or Prevent Generalization.

[BibT_eX]

[DOI]

Jascha Sohl-Dickstein

Proceedings of the 38th International Conference on Machine Learning, 2021

When Do Curricula Work?

[BibT_eX]

[DOI]

Xiaoxia Wu

Ethan Dyer

Behnam Neyshabur

Proceedings of the 9th International Conference on Learning Representations, 2021

Anatomy of Catastrophic Forgetting: Hidden Representations and Task Semantics.

[BibT_eX]

[DOI]

Vinay Venkatesh Ramasesh

Ethan Dyer

Maithra Raghu

Proceedings of the 9th International Conference on Learning Representations, 2021

Tradeoffs in Data Augmentation: An Empirical Study.

[BibT_eX]

[DOI]

Raphael Gontijo Lopes

Sylvia J. Smullin

Ekin Dogus Cubuk

Ethan Dyer

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

Asymptotics of Wide Convolutional Neural Networks.

[BibT_eX]

[DOI]

Anders Andreassen

Ethan Dyer

CoRR, 2020

Whitening and second order optimization both destroy information about the dataset, and can make generalization impossible.

[BibT_eX]

[DOI]

Jascha Sohl-Dickstein

CoRR, 2020

The large learning rate phase of deep learning: the catapult mechanism.

[BibT_eX]

[DOI]

Aitor Lewkowycz

Yasaman Bahri

Ethan Dyer

Jascha Sohl-Dickstein

Guy Gur-Ari

CoRR, 2020

Affinity and Diversity: Quantifying Mechanisms of Data Augmentation.

[BibT_eX]

[DOI]

Raphael Gontijo Lopes

Sylvia J. Smullin

Ekin D. Cubuk

Ethan Dyer

CoRR, 2020

Asymptotics of Wide Networks from Feynman Diagrams.

[BibT_eX]

[DOI]

Ethan Dyer

Guy Gur-Ari

Proceedings of the 8th International Conference on Learning Representations, 2020

2018

Gradient Descent Happens in a Tiny Subspace.

[BibT_eX]

[DOI]

Guy Gur-Ari

Daniel A. Roberts

Ethan Dyer

CoRR, 2018

Ethan Dyer

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...