Jan Ludziejewski

According to our database1, Jan Ludziejewski authored at least 10 papers between 2020 and 2025.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
μ-Parametrization for Mixture of Experts.
CoRR, August, 2025

Decoupled Relative Learning Rate Schedules.
CoRR, July, 2025

Projected Compression: Trainable Projection for Efficient Transformer Compression.
CoRR, June, 2025

Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts.
CoRR, 2024

Mixture of Tokens: Continuous MoE through Cross-Example Aggregation.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Scaling Laws for Fine-Grained Mixture of Experts.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
BrightBox - A rough set based technology for diagnosing mistakes of machine learning models.
Appl. Soft Comput., July, 2023

Mixture of Tokens: Efficient LLMs through Cross-Example Aggregation.
CoRR, 2023

2020
Integrated Human Tracking Based on Video and Smartphone Signal Processing within the Arahub System.
Proceedings of the 2020 Federated Conference on Computer Science and Information Systems, 2020


  Loading...