Alexandru Meterez

According to our database1, Alexandru Meterez authored at least 6 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
A Simplified Analysis of SGD for Linear Regression with Weight Averaging.
CoRR, June, 2025

Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining.
CoRR, April, 2025

The Optimization Landscape of SGD Across the Feature Learning Strength.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning.
CoRR, 2024

Super Consistency of Neural Network Landscapes and Learning Rate Transfer.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion.
Proceedings of the Twelfth International Conference on Learning Representations, 2024


  Loading...