We stand with Ukraine

We stand with Ukraine

Yannis Flet-Berliac

According to our database¹, Yannis Flet-Berliac authored at least 22 papers between 2017 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on ynns.io
on scholar.google.com

On csauthors.net:

Bibliography

2025

ShiQ: Bringing back Bellman to LLMs.

[DOI]

,

Nathan Grinsztajn

,

Raphaël Avalos

,

Yannis Flet-Berliac

,

,

Omar Darwiche Domingues

,

Eugene Tarassov

,

Olivier Pietquin

,

Pierre H. Richemond

,

,

CoRR, May, 2025

ShiQ: Bringing back Bellman to LLMs.

[DOI]

,

Nathan Grinsztajn

,

Raphaël Avalos

,

Yannis Flet-Berliac

,

,

Omar Darwiche Domingues

,

Olivier Pietquin

,

Pierre H. Richemond

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

2024

Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier.

[DOI]

CoRR, 2024

Averaging log-likelihoods in direct alignment.

[DOI]

Nathan Grinsztajn

,

Yannis Flet-Berliac

,

Mohammad Gheshlaghi Azar

,

,

,

,

,

,

,

Olivier Pietquin

,

CoRR, 2024

Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion.

[DOI]

Yannis Flet-Berliac

,

Nathan Grinsztajn

,

,

,

,

,

,

Mohammad Gheshlaghi Azar

,

Olivier Pietquin

,

CoRR, 2024

PASTA: Pretrained Action-State Transformer Agents.

[DOI]

,

Yannis Flet-Berliac

,

Lars C. P. M. Quaedvlieg

,

Arthur Flajolet

,

Guillaume Richard

,

RLJ, 2024

OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators.

[DOI]

,

,

Christina J. Yuan

,

Anirudhan Badrinath

,

Yannis Flet-Berliac

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion.

[DOI]

Yannis Flet-Berliac

,

Nathan Grinsztajn

,

,

,

,

,

,

,

Mohammad Gheshlaghi Azar

,

Olivier Pietquin

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

PASTA: Pretrained Action-State Transformer Agents.

[DOI]

,

Yannis Flet-Berliac

,

Arthur Flajolet

,

Guillaume Richard

,

CoRR, 2023

Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets.

[DOI]

Anirudhan Badrinath

,

Yannis Flet-Berliac

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Model-Based Offline Reinforcement Learning with Local Misspecification.

[DOI]

,

Yannis Flet-Berliac

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics.

[DOI]

Yannis Flet-Berliac

,

CoRR, 2022

Offline policy optimization with eligible actions.

[DOI]

,

Yannis Flet-Berliac

,

Proceedings of the Uncertainty in Artificial Intelligence, 2022

Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data.

[DOI]

,

Yannis Flet-Berliac

,

,

William Steenbergen

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

Sample-Efficient Deep Reinforcement Learning for Control, Exploration and Safety. (Apprentissage par renforcement profond éfficace pour le contrôle, l'exploration et la sûreté).

[DOI]

Yannis Flet-Berliac

PhD thesis, 2021

Learning Value Functions in Deep Policy Gradients using Residual Variance.

[DOI]

Yannis Flet-Berliac

,

,

Odalric-Ambrym Maillard

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Adversarially Guided Actor-Critic.

[DOI]

Yannis Flet-Berliac

,

,

Olivier Pietquin

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

Is Standard Deviation the New Standard? Revisiting the Critic in Deep Policy Gradients.

[DOI]

Yannis Flet-Berliac

,

,

Odalric-Ambrym Maillard

,

CoRR, 2020

Only Relevant Information Matters: Filtering Out Noisy Samples To Boost RL.

[DOI]

Yannis Flet-Berliac

,

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

2019

High-Dimensional Control Using Generalized Auxiliary Tasks.

[DOI]

Yannis Flet-Berliac

,

CoRR, 2019

Samples are not all useful: Denoising policy gradient updates using variance.

[DOI]

Yannis Flet-Berliac

,

CoRR, 2019

2017

Hearables in Hearing Care: Discovering Usage Patterns Through IoT Devices.

[DOI]

Benjamin Johansen

,

Yannis Paul Raymond Flet-Berliac

,

Maciej Jan Korzepa

,

,

Niels Henrik Pontoppidan

,

Michael Kai Petersen

,

Jakob Eg Larsen

Proceedings of the Universal Access in Human-Computer Interaction. Human and Technological Environments, 2017

Loading...