Lorenzo Noci

According to our database1, Lorenzo Noci authored at least 14 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning.
CoRR, 2024

How Good is a Single Basin?
CoRR, 2024

2023
Disentangling Linear Mode-Connectivity.
CoRR, 2023

Depthwise Hyperparameter Transfer in Residual Networks: Dynamics and Scaling Limit.
CoRR, 2023

The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit.
CoRR, 2023

The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

The Curious Case of Benign Memorization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks in Continual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Signal Propagation in Transformers: Theoretical Perspectives and the Role of Rank Collapse.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

How Tempering Fixes Data Augmentation in Bayesian Neural Networks.
Proceedings of the International Conference on Machine Learning, 2022

2021
Disentangling the Roles of Curation, Data-Augmentation and the Prior in the Cold Posterior Effect.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Precise characterization of the prior predictive distribution of deep ReLU networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Adversarial Learning for Debiasing Knowledge Graph Embeddings.
CoRR, 2020


  Loading...