Romain Laroche

According to our database1, Romain Laroche authored at least 78 papers between 2009 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Massive multi-player multi-armed bandits for IoT networks: An application on LoRa networks.
Ad Hoc Networks, December, 2023

Combining Spatial and Temporal Abstraction in Planning for Better Generalization.
CoRR, 2023

Think Before You Act: Decision Transformers with Internal Working Memory.
CoRR, 2023

Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On the Convergence of SARSA with Linear Function Approximation.
Proceedings of the International Conference on Machine Learning, 2023

On the Occupancy Measure of Non-Markovian Policies in Continuous MDPs.
Proceedings of the International Conference on Machine Learning, 2023

Behavior Prior Representation learning for Offline Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

One-Shot Learning from a Demonstration with Hierarchical Latent Language.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch.
J. Mach. Learn. Res., 2022

Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning.
CoRR, 2022

Emergence of Shared Sensory-motor Graphical Language from Visual Input.
CoRR, 2022

Expressiveness and Learnability: A Unifying View for Evaluating Self-Supervised Learning.
CoRR, 2022

Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning.
CoRR, 2022

Non-Markovian policies occupancy measures.
CoRR, 2022

On the Chattering of SARSA with Linear Function Approximation.
CoRR, 2022

Discrete Compositional Representations as an Abstraction for Goal Conditioned Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

When does return-conditioned supervised learning work for offline reinforcement learning?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
Batched Bandits with Crowd Externalities.
CoRR, 2021

Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates.
CoRR, 2021

Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dr Jekyll & Mr Hyde: the strange case of off-policy policy updates.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

The Emergence of the Shape Bias Results from Communicative Efficiency.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

2020
Learning Dynamic Knowledge Graphs to Generalize on Text-Based Games.
CoRR, 2020

Learning Dynamic Belief Graphs to Generalize on Text-Based Games.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Reinforcement Learning Framework for Deep Brain Stimulation Study.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Safe Policy Improvement with an Estimated Baseline Policy.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019
Building Dynamic Knowledge Graphs from Text-based Games.
CoRR, 2019

Scaling up budgeted reinforcement learning.
CoRR, 2019

Safe Policy Improvement with Soft Baseline Bootstrapping.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

Budgeted Reinforcement Learning in Continuous State Space.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Safe Policy Improvement with Baseline Bootstrapping.
Proceedings of the 36th International Conference on Machine Learning, 2019

Decentralized Exploration in Multi-Armed Bandits.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
A methodology for turn-taking capabilities enhancement in Spoken Dialogue Systems using Reinforcement Learning.
Comput. Speech Lang., 2018

Counting to Explore and Generalize in Text-based Games.
CoRR, 2018

In reinforcement learning, all objective functions are not equal.
Proceedings of the 6th International Conference on Learning Representations, 2018

Reinforcement Learning Algorithm Selection.
Proceedings of the 6th International Conference on Learning Representations, 2018

Training Dialogue Systems With Human Advice.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

On Value Function Representation of Long Horizon Problems.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Safe Policy Improvement with Baseline Bootstrapping.
CoRR, 2017

Multi-Advisor Reinforcement Learning.
CoRR, 2017

Algorithm selection of off-policy reinforcement learning algorithm.
CoRR, 2017

The Complex Negotiation Dialogue Game.
CoRR, 2017

Hybrid Reward Architecture for Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Transfer Reinforcement Learning with Shared Dynamics.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Improving Scalability of Reinforcement Learning by Separation of Concerns.
CoRR, 2016

Towards a virtual personal assistant based on a user-defined portfolio of multi-domain vocal applications.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Learning dialogue dynamics with the method of moments.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

The Negotiation Dialogue Game.
Proceedings of the Dialogues with Social Robots, 2016

Incremental Human-Machine Dialogue Simulation.
Proceedings of the Dialogues with Social Robots, 2016

Compact and Interpretable Dialogue State Representation with Genetic Sparse Distributed Memory.
Proceedings of the Dialogues with Social Robots, 2016

A Stochastic Model for Computer-Aided Human-Human Dialogue.
Proceedings of the Interspeech 2016, 2016

Reinforcement Learning for Turn-Taking Management in Incremental Spoken Dialogue Systems.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Transfer Learning for User Adaptation in Spoken Dialogue Systems.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Score-based Inverse Reinforcement Learning.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

2015
Optimising Turn-Taking Strategies With Reinforcement Learning.
Proceedings of the SIGDIAL 2015 Conference, 2015

Human-Machine Dialogue as a Stochastic Game.
Proceedings of the SIGDIAL 2015 Conference, 2015

Content finder AssistanT.
Proceedings of the 18th International Conference on Intelligence in Next Generation Networks, 2015

Dialogue Efficiency Evaluation of Turn-Taking Phenomena in a Multi-layer Incremental Simulated Environment.
Proceedings of the HCI International 2015 - Posters' Extended Abstracts, 2015

Turn-taking phenomena in incremental dialogue systems.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

2014
CFAsT: Content-Finder AssistanT [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2014

DictaNum: a dialogue system for numbers dictation (DictaNum : système de dialogue incrémental pour la dictée de numéros.) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2014

A simple approach to make dialogue systems incremental (Vers une approche simplifiée pour introduire le caractère incrémental dans les systèmes de dialogue) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2014

Enia : A customizable multi-domain assistant (Un assistant vocal personnalisable) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2014

An easy method to make dialogue systems incremental.
Proceedings of the SIGDIAL 2014 Conference, 2014

DINASTI: Dialogues with a Negotiating Appointment Setting Interface.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

NASTIA: Negotiating Appointment Setting Interface.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Contextual Bandit for Active Learning: Active Thompson Sampling.
Proceedings of the Neural Information Processing - 21st International Conference, 2014

Ordinal regression for interaction quality prediction.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Reward Shaping for Statistical Optimisation of Dialogue Management.
Proceedings of the Statistical Language and Speech Processing, 2013

Will my Spoken Dialogue System be a Slow Learner ?
Proceedings of the SIGDIAL 2013 Conference, 2013

2012
Reward Function Learning for Dialogue Management.
Proceedings of the STAIRS 2012, 2012

2010
Optimising a handcrafted dialogue system design.
Proceedings of the INTERSPEECH 2010, 2010

Enhanced monitoring tools and online dialogue optimisation merged into a new spoken dialogue system design experience.
Proceedings of the INTERSPEECH 2010, 2010

2009
Hybridisation of expertise and reinforcement learning in dialogue systems.
Proceedings of the INTERSPEECH 2009, 2009


  Loading...