Maciej Stefaniak

According to our database1, Maciej Stefaniak authored at least 4 papers in 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
μ-Parametrization for Mixture of Experts.
CoRR, August, 2025

Decoupled Relative Learning Rate Schedules.
CoRR, July, 2025

Projected Compression: Trainable Projection for Efficient Transformer Compression.
CoRR, June, 2025

Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient.
Proceedings of the Forty-second International Conference on Machine Learning, 2025


  Loading...