Pierre Clavier

According to our database¹, Pierre Clavier authored at least 12 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

ShiQ: Bringing back Bellman to LLMs.

[BibT_eX]

[DOI]

Omar Darwiche Domingues

CoRR, May, 2025

Command A: An Enterprise-Ready Large Language Model.

[BibT_eX]

[DOI]

Arkady Arkhangorodsky

Walter Beller-Morales

Giannis Chatziveroglou

Omar Darwiche Domingues

Mohammad Gheshlaghi Azar

Ellen Gilsenan-McMahon

Seraphina Goldfarb-Tarrant

Tomas Goldsack

Aidan N. Gomez

Victor Machado Gonzaga

CoRR, April, 2025

2024

Robust Reinforcement Learning: Theory and Practice. (Apprentissage par renforcement robuste: théorie et pratique).

[BibT_eX]

[DOI]

Pierre Clavier

PhD thesis, 2024

RRLS : Robust Reinforcement Learning Suite.

[BibT_eX]

[DOI]

CoRR, 2024

Bootstrapping Expectiles in Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Minimax Optimality of Model-based Robust Reinforcement Learning.

[BibT_eX]

[DOI]

Pierre Clavier

Erwan Le Pennec

Matthieu Geist

Proceedings of the Uncertainty in Artificial Intelligence, 2024

Time-Constrained Robust MDPs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Near-Optimal Distributionally Robust Reinforcement Learning with General $L_p$ Norms.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

VITS : Variational Inference Thompson Sampling for contextual bandits.

[BibT_eX]

[DOI]

Pierre Clavier

Tom Huix

Alain Oliviero Durmus

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

VITS : Variational Inference Thomson Sampling for contextual bandits.

[BibT_eX]

[DOI]

Pierre Clavier

Tom Huix

Alain Durmus

CoRR, 2023

2022

Robust Reinforcement Learning with Distributional Risk-averse formulation.

[BibT_eX]

[DOI]

Pierre Clavier

Stéphanie Allassonnière

Erwan Le Pennec

CoRR, 2022

2020

Gaussian Sum-Product Networks Learning in the Presence of Interval Censored Data.

[BibT_eX]

[DOI]

Pierre Clavier

Olivier Bouaziz

Grégory Nuel

Proceedings of the International Conference on Probabilistic Graphical Models, 2020

Pierre Clavier

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...