Pierre H. Richemond

According to our database¹, Pierre H. Richemond authored at least 21 papers between 2017 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

ShiQ: Bringing back Bellman to LLMs.

[BibT_eX]

[DOI]

Omar Darwiche Domingues

CoRR, May, 2025

ShiQ: Bringing back Bellman to LLMs.

[BibT_eX]

[DOI]

Omar Darwiche Domingues

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

2024

Offline Regularised Reinforcement Learning for Large Language Models Alignment.

[BibT_eX]

[DOI]

Pierre Harvey Richemond

Yunhao Tang

Daniel Guo

Daniele Calandriello

Mohammad Gheshlaghi Azar

CoRR, 2024

Scaling Instructable Agents Across Many Simulated Worlds.

[BibT_eX]

[DOI]

Arne Olav Hallingstad

Kathryn Martin Cussons

Loic Matthey

Siobhan Mcloughlin

Piermaria Mendolicchio

Yanko Gitahy Oliveira

Pierre Harvey Richemond

CoRR, 2024

Generalized Preference Optimization: A Unified Approach to Offline Alignment.

[BibT_eX]

[DOI]

Pierre Harvey Richemond

Michal Valko

Bernardo Ávila Pires

Bilal Piot

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Human Alignment of Large Language Models through Online Preference Optimisation.

[BibT_eX]

[DOI]

Pierre Harvey Richemond

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Understanding Self-Predictive Learning for Reinforcement Learning.

[BibT_eX]

[DOI]

Yunhao Tang

Zhaohan Daniel Guo

Pierre Harvey Richemond

Mohammad Gheshlaghi Azar

Proceedings of the International Conference on Machine Learning, 2023

The Edge of Orthogonality: A Simple View of What Makes BYOL Tick.

[BibT_eX]

[DOI]

Pierre Harvey Richemond

Proceedings of the International Conference on Machine Learning, 2023

SemPPL: Predicting Pseudo-Labels for Better Contrastive Representations.

[BibT_eX]

[DOI]

Matko Bosnjak

Pierre Harvey Richemond

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Continuous diffusion for categorical data.

[BibT_eX]

[DOI]

CoRR, 2022

Categorical SDEs with Simplex Diffusion.

[BibT_eX]

[DOI]

Pierre H. Richemond

Sander Dieleman

Arnaud Doucet

CoRR, 2022

Data Distributional Properties Drive Emergent In-Context Learning in Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Zipfian Environments for Reinforcement Learning.

[BibT_eX]

[DOI]

Stephanie C. Y. Chan

Andrew Kyle Lampinen

Pierre Harvey Richemond

Felix Hill

Proceedings of the Conference on Lifelong Learning Agents, 2022

2020

BYOL works even without batch statistics.

[BibT_eX]

[DOI]

CoRR, 2020

Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning.

[BibT_eX]

[DOI]

Mohammad Gheshlaghi Azar

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019

Biologically inspired architectures for sample-efficient deep reinforcement learning.

[BibT_eX]

[DOI]

Pierre H. Richemond

Arinbjörn Kolbeinsson

Yike Guo

CoRR, 2019

Static Activation Function Normalization.

[BibT_eX]

[DOI]

Pierre H. Richemond

Yike Guo

CoRR, 2019

Combining learning rate decay and weight decay with complexity gradient descent - Part I.

[BibT_eX]

[DOI]

Pierre H. Richemond

Yike Guo

CoRR, 2019

2017

A short variational proof of equivalence between policy gradients and soft Q learning.

[BibT_eX]

[DOI]

Pierre H. Richemond

Brendan Maginnis

CoRR, 2017

On Wasserstein Reinforcement Learning and the Fokker-Planck equation.

[BibT_eX]

[DOI]

Pierre H. Richemond

Brendan Maginnis

CoRR, 2017

Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit.

[BibT_eX]

[DOI]

Brendan Maginnis

Pierre H. Richemond

CoRR, 2017

Pierre H. Richemond

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...