Paolo Glorioso

According to our database1, Paolo Glorioso authored at least 7 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
The Unreasonable Ineffectiveness of the Deeper Layers.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
The Zamba2 Suite: Technical Report.
CoRR, 2024

Zyda-2: a 5 Trillion Token High-Quality Dataset.
CoRR, 2024

Zyda: A 1.3T Dataset for Open Language Modeling.
CoRR, 2024

Zamba: A Compact 7B SSM Hybrid Model.
CoRR, 2024

BlackMamba: Mixture of Experts for State-Space Models.
CoRR, 2024

2022
Flatter, faster: scaling momentum for optimal speedup of SGD.
CoRR, 2022


  Loading...