Tiberiu Musat

According to our database1, Tiberiu Musat authored at least 4 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
The Geometry of Grokking: Norm Minimization on the Zero-Loss Manifold.
CoRR, November, 2025

On the Emergence of Induction Heads for In-Context Learning.
CoRR, November, 2025

Mechanism and Emergence of Stacked Attention Heads in Multi-Layer Transformers.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Clustering and Alignment: Understanding the Training Dynamics in Modular Addition.
CoRR, 2024


  Loading...