Samy Jelassi

According to our database1, Samy Jelassi authored at least 20 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models.
CoRR, 2024

Repeat After Me: Transformers are Better than State Space Models at Copying.
CoRR, 2024

2023
Length Generalization in Arithmetic Transformers.
CoRR, 2023

Depth Dependence of μP Learning Rates in ReLU MLPs.
CoRR, 2023

2022
Depth separation beyond radial functions.
J. Mach. Learn. Res., 2022

A Momentumized, Adaptive, Dual Averaged Gradient Method.
J. Mach. Learn. Res., 2022

Dissecting adaptive methods in GANs.
CoRR, 2022

Vision Transformers provably learn spatial structure.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards understanding how momentum improves generalization in deep learning.
Proceedings of the International Conference on Machine Learning, 2022

2021
Adaptivity without Compromise: A Momentumized, Adaptive, Dual Averaged Gradient Method for Stochastic Optimization.
CoRR, 2021

Auction Learning as a Two-Player Game.
Proceedings of the 9th International Conference on Learning Representations, 2021

A Permutation-Equivariant Neural Network Architecture For Auction Design.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Dual Averaging is Surprisingly Effective for Deep Learning Optimization.
CoRR, 2020

A mean-field analysis of two-player zero-sum games.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Extra-gradient with player sampling for faster convergence in n-player games.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Extra-gradient with player sampling for provable fast convergence in n-player games.
CoRR, 2019

Global convergence of neuron birth-death dynamics.
CoRR, 2019

Towards closing the gap between the theory and practice of SVRG.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Neuron birth-death dynamics accelerates gradient descent and converges asymptotically.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
Smoothed analysis of the low-rank approach for smooth semidefinite programs.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018


  Loading...