We stand with Ukraine

We stand with Ukraine

Romain Laroche

According to our database¹, Romain Laroche authored at least 83 papers between 2009 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

Learning Fused State Representations for Control from Multi-View Observations.

[DOI]

,

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Rejecting Hallucinated State Targets during Planning.

[DOI]

,

Tristan Sylvain

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

Think Before You Act: Decision Transformers with Working Memory.

[DOI]

,

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning.

[DOI]

,

,

Harm van Seijen

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Massive multi-player multi-armed bandits for IoT networks: An application on LoRa networks.

[DOI]

,

Raphaël Féraud

,

Nadège Varsier

,

Patrick Maillé

,

Ad Hoc Networks, December, 2023

Using Representation Expressiveness and Learnability to Evaluate Self-Supervised Learning Methods.

[DOI]

,

,

Aristide Baratin

,

,

Aaron C. Courville

,

Alessandro Sordoni

Trans. Mach. Learn. Res., 2023

Combining Spatial and Temporal Abstraction in Planning for Better Generalization.

[DOI]

,

,

Harm van Seijen

,

,

,

CoRR, 2023

Think Before You Act: Decision Transformers with Internal Working Memory.

[DOI]

,

,

,

,

,

CoRR, 2023

Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning.

[DOI]

,

,

,

,

,

,

Remi Tachet des Combes

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets.

[DOI]

,

,

,

Abhishek Bhandwaldar

,

Akash Srivastava

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On the Convergence of SARSA with Linear Function Approximation.

[DOI]

Shangtong Zhang

,

Remi Tachet des Combes

,

Proceedings of the International Conference on Machine Learning, 2023

On the Occupancy Measure of Non-Markovian Policies in Continuous MDPs.

[DOI]

,

Remi Tachet des Combes

Proceedings of the International Conference on Machine Learning, 2023

Behavior Prior Representation learning for Offline Reinforcement Learning.

[DOI]

,

,

,

,

,

Remi Tachet des Combes

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting.

[DOI]

,

,

Remi Tachet des Combes

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

One-Shot Learning from a Demonstration with Hierarchical Latent Language.

[DOI]

,

,

Marc-Alexandre Côté

,

Matthew J. Hausknecht

,

,

,

Harm van Seijen

,

Benjamin Van Durme

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022

Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch.

[DOI]

Shangtong Zhang

,

Remi Tachet des Combes

,

J. Mach. Learn. Res., 2022

Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning.

[DOI]

,

,

,

,

Kenji Kawaguchi

,

,

,

,

Remi Tachet des Combes

CoRR, 2022

Emergence of Shared Sensory-motor Graphical Language from Visual Input.

[DOI]

,

,

,

Clément Moulin-Frier

,

Pierre-Yves Oudeyer

CoRR, 2022

Expressiveness and Learnability: A Unifying View for Evaluating Self-Supervised Learning.

[DOI]

,

,

Aristide Baratin

,

,

Aaron C. Courville

,

Alessandro Sordoni

CoRR, 2022

Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning.

[DOI]

David Brandfonbrener

,

Remi Tachet des Combes

,

CoRR, 2022

Non-Markovian policies occupancy measures.

[DOI]

,

Remi Tachet des Combes

,

CoRR, 2022

On the Chattering of SARSA with Linear Function Approximation.

[DOI]

Shangtong Zhang

,

Remi Tachet des Combes

,

CoRR, 2022

Discrete Compositional Representations as an Abstraction for Goal Conditioned Reinforcement Learning.

[DOI]

,

,

,

,

Kenji Kawaguchi

,

,

,

,

Remi Tachet des Combes

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

When does return-conditioned supervised learning work for offline reinforcement learning?

[DOI]

David Brandfonbrener

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms.

[DOI]

Shangtong Zhang

,

,

Harm van Seijen

,

Shimon Whiteson

,

Remi Tachet des Combes

Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms.

[DOI]

,

Remi Tachet des Combes

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

Batched Bandits with Crowd Externalities.

[DOI]

,

Othmane Safsafi

,

Raphaël Féraud

,

Nicolas Broutin

CoRR, 2021

Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates.

[DOI]

,

Remi Tachet des Combes

CoRR, 2021

Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs.

[DOI]

,

Philip S. Thomas

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dr Jekyll & Mr Hyde: the strange case of off-policy policy updates.

[DOI]

,

Remi Tachet des Combes

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

The Emergence of the Shape Bias Results from Communicative Efficiency.

[DOI]

,

Michael C. Frank

,

,

Alessandro Sordoni

,

Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

2020

Learning Dynamic Knowledge Graphs to Generalize on Text-Based Games.

[DOI]

Ashutosh Adhikari

,

,

Marc-Alexandre Côté

,

Mikulas Zelinka

,

Marc-Antoine Rondeau

,

,

,

,

,

William L. Hamilton

CoRR, 2020

Learning Dynamic Belief Graphs to Generalize on Text-Based Games.

[DOI]

Ashutosh Adhikari

,

,

Marc-Alexandre Côté

,

Mikulas Zelinka

,

Marc-Antoine Rondeau

,

,

,

,

,

William L. Hamilton

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Reinforcement Learning Framework for Deep Brain Stimulation Study.

[DOI]

,

Remi Tachet des Combes

,

,

Michael Rosenblum

,

Dmitry V. Dylov

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Safe Policy Improvement with an Estimated Baseline Policy.

[DOI]

Thiago D. Simão

,

,

Rémi Tachet des Combes

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019

Building Dynamic Knowledge Graphs from Text-based Games.

[DOI]

Mikulas Zelinka

,

,

Marc-Alexandre Côté

,

,

CoRR, 2019

Scaling up budgeted reinforcement learning.

[DOI]

Nicolas Carrara

,

Edouard Leurent

,

,

,

Odalric-Ambrym Maillard

,

Olivier Pietquin

CoRR, 2019

Safe Policy Improvement with Soft Baseline Bootstrapping.

[DOI]

,

,

Rémi Tachet des Combes

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

Budgeted Reinforcement Learning in Continuous State Space.

[DOI]

Nicolas Carrara

,

Edouard Leurent

,

,

,

Odalric-Ambrym Maillard

,

Olivier Pietquin

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Safe Policy Improvement with Baseline Bootstrapping.

[DOI]

,

Paul Trichelair

,

Remi Tachet des Combes

Proceedings of the 36th International Conference on Machine Learning, 2019

Decentralized Exploration in Multi-Armed Bandits.

[DOI]

Raphaël Féraud

,

,

Proceedings of the 36th International Conference on Machine Learning, 2019

2018

A methodology for turn-taking capabilities enhancement in Spoken Dialogue Systems using Reinforcement Learning.

[DOI]

Hatim Khouzaimi

,

,

Fabrice Lefèvre

Comput. Speech Lang., 2018

Counting to Explore and Generalize in Text-based Games.

[DOI]

,

Marc-Alexandre Côté

,

Alessandro Sordoni

,

,

Remi Tachet des Combes

,

Matthew J. Hausknecht

,

CoRR, 2018

In reinforcement learning, all objective functions are not equal.

[DOI]

,

Harm van Seijen

Proceedings of the 6th International Conference on Learning Representations, 2018

Reinforcement Learning Algorithm Selection.

[DOI]

,

Raphaël Féraud

Proceedings of the 6th International Conference on Learning Representations, 2018

Training Dialogue Systems With Human Advice.

[DOI]

,

,

Olivier Pietquin

Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

On Value Function Representation of Long Horizon Problems.

[DOI]

,

,

Harm van Seijen

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Safe Policy Improvement with Baseline Bootstrapping.

[DOI]

,

Paul Trichelair

CoRR, 2017

Multi-Advisor Reinforcement Learning.

[DOI]

,

,

,

Harm van Seijen

CoRR, 2017

Algorithm selection of off-policy reinforcement learning algorithm.

[DOI]

,

Raphaël Féraud

CoRR, 2017

The Complex Negotiation Dialogue Game.

[DOI]

CoRR, 2017

Hybrid Reward Architecture for Reinforcement Learning.

[DOI]

Harm van Seijen

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Transfer Reinforcement Learning with Shared Dynamics.

[DOI]

,

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Improving Scalability of Reinforcement Learning by Separation of Concerns.

[DOI]

Harm van Seijen

,

,

,

CoRR, 2016

Towards a virtual personal assistant based on a user-defined portfolio of multi-domain vocal applications.

[DOI]

Tatiana Ekeinhor-Komi

,

Jean Léon Bouraoui

,

,

Fabrice Lefèvre

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Learning dialogue dynamics with the method of moments.

[DOI]

,

,

Olivier Pietquin

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

The Negotiation Dialogue Game.

[DOI]

,

Proceedings of the Dialogues with Social Robots, 2016

Incremental Human-Machine Dialogue Simulation.

[DOI]

Hatim Khouzaimi

,

,

Fabrice Lefèvre

Proceedings of the Dialogues with Social Robots, 2016

Compact and Interpretable Dialogue State Representation with Genetic Sparse Distributed Memory.

[DOI]

,

,

Olivier Pietquin

Proceedings of the Dialogues with Social Robots, 2016

A Stochastic Model for Computer-Aided Human-Human Dialogue.

[DOI]

,

,

Olivier Pietquin

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Reinforcement Learning for Turn-Taking Management in Incremental Spoken Dialogue Systems.

[DOI]

Hatim Khouzaimi

,

,

Fabrice Lefèvre

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Transfer Learning for User Adaptation in Spoken Dialogue Systems.

[DOI]

,

Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Score-based Inverse Reinforcement Learning.

[DOI]

,

,

,

,

Olivier Pietquin

Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

2015

Optimising Turn-Taking Strategies With Reinforcement Learning.

[DOI]

Hatim Khouzaimi

,

,

Fabrice Lefèvre

Proceedings of the SIGDIAL 2015 Conference, 2015

Human-Machine Dialogue as a Stochastic Game.

[DOI]

,

Julien Pérolat

,

,

Olivier Pietquin

Proceedings of the SIGDIAL 2015 Conference, 2015

Content finder AssistanT.

[DOI]

Proceedings of the 18th International Conference on Intelligence in Next Generation Networks, 2015

Dialogue Efficiency Evaluation of Turn-Taking Phenomena in a Multi-layer Incremental Simulated Environment.

[DOI]

Hatim Khouzaimi

,

,

Fabrice Lefèvre

Proceedings of the HCI International 2015 - Posters' Extended Abstracts, 2015

Turn-taking phenomena in incremental dialogue systems.

[DOI]

Hatim Khouzaimi

,

,

Fabrice Lefèvre

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

2014

CFAsT: Content-Finder AssistanT [in French].

[DOI]

Proceedings of the Traitement Automatique des Langues Naturelles, 2014

DictaNum: a dialogue system for numbers dictation (DictaNum : système de dialogue incrémental pour la dictée de numéros.) [in French].

[DOI]

Hatim Khouzaimi

,

,

Fabrice Lefèvre

Proceedings of the Traitement Automatique des Langues Naturelles, 2014

A simple approach to make dialogue systems incremental (Vers une approche simplifiée pour introduire le caractère incrémental dans les systèmes de dialogue) [in French].

[DOI]

Hatim Khouzaimi

,

,

Fabrice Lefèvre

Proceedings of the Traitement Automatique des Langues Naturelles, 2014

Enia : A customizable multi-domain assistant (Un assistant vocal personnalisable) [in French].

[DOI]

Tatiana Ekeinhor-Komi

,

,

Christine Chardenon

,

,

Fabrice Lefèvre

Proceedings of the Traitement Automatique des Langues Naturelles, 2014

An easy method to make dialogue systems incremental.

[DOI]

Hatim Khouzaimi

,

,

Fabrice Lefèvre

Proceedings of the SIGDIAL 2014 Conference, 2014

DINASTI: Dialogues with a Negotiating Appointment Setting Interface.

[DOI]

,

,

Olivier Pietquin

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

NASTIA: Negotiating Appointment Setting Interface.

[DOI]

,

Rémi Lemonnier

,

,

Olivier Pietquin

,

Hatim Khouzaimi

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Contextual Bandit for Active Learning: Active Thompson Sampling.

[DOI]

Djallel Bouneffouf

,

,

,

Raphaël Féraud

,

Robin Allesiardo

Proceedings of the Neural Information Processing - 21st International Conference, 2014

Ordinal regression for interaction quality prediction.

[DOI]

,

Hatim Khouzaimi

,

,

Olivier Pietquin

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Reward Shaping for Statistical Optimisation of Dialogue Management.

[DOI]

,

,

Olivier Pietquin

Proceedings of the Statistical Language and Speech Processing, 2013

Will my Spoken Dialogue System be a Slow Learner ?

[DOI]

,

Proceedings of the SIGDIAL 2013 Conference, 2013

2012

Reward Function Learning for Dialogue Management.

[DOI]

,

,

Olivier Pietquin

Proceedings of the STAIRS 2012, 2012

2010

Enhanced Monitoring Tools and Online Dialogue Optimisation Merged into a New Spoken Dialogue System Design Experience.

[DOI]

Ghislain Putois

,

,

Philippe Bretier

Proceedings of the SIGDIAL 2010 Conference, 2010

Optimising a handcrafted dialogue system design.

[DOI]

,

Ghislain Putois

,

Philippe Bretier

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Hybridisation of expertise and reinforcement learning in dialogue systems.

[DOI]

,

Ghislain Putois

,

Philippe Bretier

,

Bernadette Bouchon-Meunier

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Loading...