Thomas Mesnard

According to our database¹, Thomas Mesnard authored at least 24 papers between 2015 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

VaultGemma: A Differentially Private Gemma Model.

[BibT_eX]

[DOI]

CoRR, October, 2025

EmbeddingGemma: Powerful and Lightweight Text Representations.

[BibT_eX]

[DOI]

Henrique Schechter Vera

Sindhu Raghuram Panyam

Divyashree Sreepathihalli

Gustavo Hernández Ábrego

Sai Meher Karthik Duddu

Mojtaba Seyedhosseini

CoRR, September, 2025

2024

A Survey of Temporal Credit Assignment in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

PaliGemma 2: A Family of Versatile VLMs for Transfer.

[BibT_eX]

[DOI]

Ibrahim Alabdulmohsin

Lucas Beyer

Xiaohua Zhai

CoRR, 2024

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models.

[BibT_eX]

[DOI]

George-Cristian Muraru

CoRR, 2024

Direct Language Model Alignment from Online AI Feedback.

[BibT_eX]

[DOI]

CoRR, 2024

Nash Learning from Human Feedback.

[BibT_eX]

[DOI]

Rémi Munos

Michal Valko

Daniele Calandriello

Mohammad Gheshlaghi Azar

Proceedings of the Forty-first International Conference on Machine Learning, 2024

RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Credit Assignment in Deep Reinforcement Learning. (Attribution de crédit pour l'apprentissage par renforcement dans des réseaux profonds).

[BibT_eX]

[DOI]

Thomas Mesnard

PhD thesis, 2023

Nash Learning from Human Feedback.

[BibT_eX]

[DOI]

Rémi Munos

Michal Valko

Daniele Calandriello

Mohammad Gheshlaghi Azar

CoRR, 2023

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback.

[BibT_eX]

[DOI]

CoRR, 2023

Quantile Credit Assignment.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

2022

Curiosity in hindsight.

[BibT_eX]

[DOI]

CoRR, 2022

2021

Geometric Entropic Exploration.

[BibT_eX]

[DOI]

Zhaohan Daniel Guo

Mohammad Gheshlaghi Azar

CoRR, 2021

Counterfactual Credit Assignment in Model-Free Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

Counterfactual Credit Assignment in Model-Free Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

2019

Ghost Units Yield Biologically Plausible Backprop in Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Hindsight Credit Assignment.

[BibT_eX]

[DOI]

Anna Harutyunyan

Will Dabney

Thomas Mesnard

Mohammad Gheshlaghi Azar

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018

Generalization of Equilibrium Propagation to Vector Field Dynamics.

[BibT_eX]

[DOI]

CoRR, 2018

Extending the Framework of Equilibrium Propagation to General Dynamics.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

STDP-Compatible Approximation of Backpropagation in an Energy-Based Model.

[BibT_eX]

[DOI]

Neural Comput., 2017

2016

Towards deep learning with spiking neurons in energy based models with contrastive Hebbian plasticity.

[BibT_eX]

[DOI]

Thomas Mesnard

Wulfram Gerstner

Johanni Brea

CoRR, 2016

2015

An objective function for STDP.

[BibT_eX]

[DOI]

CoRR, 2015

Thomas Mesnard

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...