Benjamin Thérien

According to our database1, Benjamin Thérien authored at least 20 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
PyLO: Towards Accessible Learned Optimizers in PyTorch.
CoRR, June, 2025

MuLoCo: Muon is a practical inner optimizer for DiLoCo.
CoRR, May, 2025

Dense Backpropagation Improves Training for Sparse Mixture-of-Experts.
CoRR, April, 2025

Continual Pre-training of MoEs: How robust is your router?
CoRR, March, 2025

Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training.
CoRR, March, 2025

Meta-learning Optimizers for Communication-Efficient Learning.
Trans. Mach. Learn. Res., 2025

2024
Simple and Scalable Strategies to Continually Pre-train Large Language Models.
Trans. Mach. Learn. Res., 2024

μLO: Compute-Efficient Meta-Generalization of Learned Optimizers.
CoRR, 2024

Object Re-Identification from Point Clouds.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

StructMoE: Structured Mixture of Experts Using Low Rank Experts.
Proceedings of the NeurIPS Efficient Natural Language and Speech Processing Workshop, 2024

Dense Backpropagation Improves Routing for Sparsely-Gated Mixture-of-Experts.
Proceedings of the NeurIPS Efficient Natural Language and Speech Processing Workshop, 2024

2023
Can We Learn Communication-Efficient Optimizers?
CoRR, 2023

Continual Pre-Training of Large Language Models: How to (re)warm your model?
CoRR, 2023

Towards Object Re-Identification from Point Clouds for 3D MOT.
CoRR, 2023

2022
A Closer Look at Robustness to L-infinity and Spatial Perturbations and their Composition.
CoRR, 2022

Interpretable Deep Tracking.
CoRR, 2022

Out-of-Distribution Detection for LiDAR-based 3D Object Detection.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2022

Parametric Scattering Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Parametric Scattering Networks.
CoRR, 2021

CLaC-BP at SemEval-2021 Task 8: SciBERT Plus Rules for MeasEval.
Proceedings of the 15th International Workshop on Semantic Evaluation, 2021


  Loading...