Pierre H. Richemond

According to our database1, Pierre H. Richemond authored at least 17 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Human Alignment of Large Language Models through Online Preference Optimisation.
CoRR, 2024

Generalized Preference Optimization: A Unified Approach to Offline Alignment.
CoRR, 2024

2023
Understanding Self-Predictive Learning for Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

The Edge of Orthogonality: A Simple View of What Makes BYOL Tick.
Proceedings of the International Conference on Machine Learning, 2023

SemPPL: Predicting Pseudo-Labels for Better Contrastive Representations.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Continuous diffusion for categorical data.
CoRR, 2022

Categorical SDEs with Simplex Diffusion.
CoRR, 2022

Data Distributional Properties Drive Emergent In-Context Learning in Transformers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Zipfian Environments for Reinforcement Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2022

2020
BYOL works even without batch statistics.
CoRR, 2020

Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Biologically inspired architectures for sample-efficient deep reinforcement learning.
CoRR, 2019

Static Activation Function Normalization.
CoRR, 2019

Combining learning rate decay and weight decay with complexity gradient descent - Part I.
CoRR, 2019

2017
A short variational proof of equivalence between policy gradients and soft Q learning.
CoRR, 2017

On Wasserstein Reinforcement Learning and the Fokker-Planck equation.
CoRR, 2017

Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit.
CoRR, 2017


  Loading...