Thomas Mesnard

According to our database1, Thomas Mesnard authored at least 23 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning.
Trans. Mach. Learn. Res., 2024

Gemma 2: Improving Open Language Models at a Practical Size.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models.
CoRR, 2024

Gemma: Open Models Based on Gemini Research and Technology.
CoRR, 2024

Direct Language Model Alignment from Online AI Feedback.
CoRR, 2024


RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Credit Assignment in Deep Reinforcement Learning. (Attribution de crédit pour l'apprentissage par renforcement dans des réseaux profonds).
PhD thesis, 2023

Nash Learning from Human Feedback.
CoRR, 2023

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback.
CoRR, 2023

Quantile Credit Assignment.
Proceedings of the International Conference on Machine Learning, 2023

Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments.
Proceedings of the International Conference on Machine Learning, 2023

2022
Curiosity in hindsight.
CoRR, 2022

2021
Geometric Entropic Exploration.
CoRR, 2021

Counterfactual Credit Assignment in Model-Free Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Counterfactual Credit Assignment in Model-Free Reinforcement Learning.
CoRR, 2020

2019
Ghost Units Yield Biologically Plausible Backprop in Deep Neural Networks.
CoRR, 2019

Hindsight Credit Assignment.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Generalization of Equilibrium Propagation to Vector Field Dynamics.
CoRR, 2018

Extending the Framework of Equilibrium Propagation to General Dynamics.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
STDP-Compatible Approximation of Backpropagation in an Energy-Based Model.
Neural Comput., 2017

2016
Towards deep learning with spiking neurons in energy based models with contrastive Hebbian plasticity.
CoRR, 2016

2015
An objective function for STDP.
CoRR, 2015


  Loading...