Pierre Clavier

According to our database1, Pierre Clavier authored at least 12 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
ShiQ: Bringing back Bellman to LLMs.
CoRR, May, 2025

Command A: An Enterprise-Ready Large Language Model.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, April, 2025

2024
Robust Reinforcement Learning: Theory and Practice. (Apprentissage par renforcement robuste: théorie et pratique).
PhD thesis, 2024

RRLS : Robust Reinforcement Learning Suite.
CoRR, 2024

Bootstrapping Expectiles in Reinforcement Learning.
CoRR, 2024

Towards Minimax Optimality of Model-based Robust Reinforcement Learning.
Proceedings of the Uncertainty in Artificial Intelligence, 2024

Time-Constrained Robust MDPs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Near-Optimal Distributionally Robust Reinforcement Learning with General $L_p$ Norms.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

VITS : Variational Inference Thompson Sampling for contextual bandits.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
VITS : Variational Inference Thomson Sampling for contextual bandits.
CoRR, 2023

2022
Robust Reinforcement Learning with Distributional Risk-averse formulation.
CoRR, 2022

2020
Gaussian Sum-Product Networks Learning in the Presence of Interval Censored Data.
Proceedings of the International Conference on Probabilistic Graphical Models, 2020


  Loading...