Etai Littwin

Orcid: 0009-0001-7396-4658

According to our database¹, Etai Littwin authored at least 36 papers between 2015 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Annotations Mitigate Post-Training Mode Collapse.

[BibT_eX]

[DOI]

Jacob Mitchell Springer

CoRR, May, 2026

Text-Conditional JEPA for Learning Semantically Rich Visual Representations.

[BibT_eX]

[DOI]

CoRR, May, 2026

2025

To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

Rethinking JEPA: Compute-Efficient Video SSL with Frozen Teachers.

[BibT_eX]

[DOI]

CoRR, September, 2025

UI-JEPA: Towards Active Perception of User Intent through Onscreen User Activity.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM Conference on User Modeling, Adaptation and Personalization, 2025

Distillation Scaling Laws.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

The Slingshot Effect: A Late-Stage Optimization Anomaly in Adaptive Gradient Methods.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning.

[BibT_eX]

[DOI]

Etai Littwin

Vimal Thilak

Anand Gopalakrishnan

CoRR, 2024

How JEPA Avoids Noisy Features: The Implicit Bias of Deep Linear Self Distillation Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

What Algorithms can Transformers Learn? A Study in Length Generalization.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Vanishing Gradients in Reinforcement Finetuning of Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

When can transformers reason with abstract symbols?

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Tight conditions for when the NTK approximation is valid.

[BibT_eX]

[DOI]

Enric Boix-Adserà

Etai Littwin

Trans. Mach. Learn. Res., 2023

Adaptivity and Modularity for Efficient Generalization Over Task Complexity.

[BibT_eX]

[DOI]

Miguel Ángel Bautista

CoRR, 2023

Tensor Programs IVb: Adaptive Optimization in the Infinite-Width Limit.

[BibT_eX]

[DOI]

Greg Yang

Etai Littwin

CoRR, 2023

The NTK approximation is valid for longer than you think.

[BibT_eX]

[DOI]

Enric Boix-Adserà

Etai Littwin

CoRR, 2023

Transformers learn through gradual rank increase.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Stabilizing Transformer Training by Preventing Attention Entropy Collapse.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Adaptive Optimization in the ∞-Width Limit.

[BibT_eX]

[DOI]

Etai Littwin

Greg Yang

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon.

[BibT_eX]

[DOI]

CoRR, 2022

Learning Representation from Neural Fisher Kernel with Low-rank Approximation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Implicit Greedy Rank Learning in Autoencoders via Overparameterized Linear Networks.

[BibT_eX]

[DOI]

CoRR, 2021

Implicit Acceleration and Feature Learning in Infinitely Wide Neural Networks with Bottlenecks.

[BibT_eX]

[DOI]

CoRR, 2021

On random kernels of residual architectures.

[BibT_eX]

[DOI]

Etai Littwin

Tomer Galanti

Lior Wolf

Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Tensor Programs IIb: Architectural Universality Of Neural Tangent Kernel Training Dynamics.

[BibT_eX]

[DOI]

Greg Yang

Etai Littwin

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

On the Optimization Dynamics of Wide Hypernetworks.

[BibT_eX]

[DOI]

Etai Littwin

Tomer Galanti

Lior Wolf

CoRR, 2020

Residual Tangent Kernels.

[BibT_eX]

[DOI]

Etai Littwin

Lior Wolf

CoRR, 2020

On the Convex Behavior of Deep Neural Networks in Relation to the Layers' Width.

[BibT_eX]

[DOI]

Etai Littwin

Lior Wolf

CoRR, 2020

Collegial Ensembles.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On Infinite-Width Hypernetworks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2018

Regularizing by the Variance of the Activations' Sample-Variances.

[BibT_eX]

[DOI]

Etai Littwin

Lior Wolf

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2016

The Loss Surface of Residual Networks: Ensembles and the Role of Batch Normalization.

[BibT_eX]

[DOI]

Etai Littwin

Lior Wolf

CoRR, 2016

Complexity of multiverse networks and their multilayer generalization.

[BibT_eX]

[DOI]

Etai Littwin

Lior Wolf

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

The Multiverse Loss for Robust Transfer Learning.

[BibT_eX]

[DOI]

Etai Littwin

Lior Wolf

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

Spherical embedding of inlier silhouette dissimilarities.

[BibT_eX]

[DOI]

Etai Littwin

Hadar Averbuch-Elor

Daniel Cohen-Or

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Etai Littwin

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...