Olivier Sigaud

Orcid: 0000-0002-8544-0229

According to our database1, Olivier Sigaud authored at least 128 papers between 2000 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Single-Reset Divide & Conquer Imitation Learning.
CoRR, 2024

2023
Combining Evolution and Deep Reinforcement Learning for Policy Search: A Survey.
ACM Trans. Evol. Learn. Optim., September, 2023

Toward Teachable Autotelic Agents.
IEEE Trans. Cogn. Dev. Syst., September, 2023

A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents.
CoRR, 2023

A Simple Open-Loop Baseline for Reinforcement Learning Locomotion Tasks.
CoRR, 2023

Utility-based Adaptive Teaching Strategies using Bayesian Theory of Mind.
CoRR, 2023

Enhancing Agent Communication and Learning through Action and Language.
CoRR, 2023

Human-Machine Co-Learning : Case Study on Motor Skill Acquisition.
Proceedings of the 34th Conference on l'Interaction Humain-Machine, 2023

Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

2022
Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: A Short Survey.
J. Artif. Intell. Res., 2022

An extensive appraisal of weight-sharing on the NAS-Bench-101 benchmark.
Neurocomputing, 2022

Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration.
CoRR, 2022

Overcoming Referential Ambiguity in Language-Guided Goal-Conditioned Reinforcement Learning.
CoRR, 2022

Making Reinforcement Learning Work on Swimmer.
CoRR, 2022

Stein Variational Goal Generation For Reinforcement Learning in Hard Exploration Problems.
CoRR, 2022

Pedagogical Demonstrations and Pragmatic Learning in Artificial Tutor-Learner Interactions.
CoRR, 2022

Help Me Explore: Minimal Social Interventions for Graph-Based Autotelic Agents.
CoRR, 2022

Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal Environments.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Divide & Conquer Imitation Learning.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Neural Architecture Search for Fracture Classification.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Diversity policy gradient for sample efficient quality-diversity optimization.
Proceedings of the GECCO '22: Genetic and Evolutionary Computation Conference, Boston, Massachusetts, USA, July 9, 2022

Learning Object-Centered Autotelic Behaviors with Graph Neural Networks.
Proceedings of the Conference on Lifelong Learning Agents, 2022

2021
CLIC: Curriculum Learning and Imitation for Object Control in Nonrewarding Environments.
IEEE Trans. Cogn. Dev. Syst., 2021

Towards Teachable Autonomous Agents.
CoRR, 2021

Grounding Language to Autonomously-Acquired Skills via Goal Generation.
Proceedings of the 9th International Conference on Learning Representations, 2021

First-Order and Second-Order Variants of the Gradient Descent in a Unified Framework.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021

Selection-Expansion: A Unifying Framework for Motion-Planning and Diversity Search Algorithms.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021

Teaching a Robot with Unlabeled Instructions: The TICS Architecture.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020
Task Feasibility Maximization Using Model-Free Policy Search and Model-Based Whole-Body Control.
Frontiers Robotics AI, 2020

Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey.
CoRR, 2020

Offline Reinforcement Learning Hands-On.
CoRR, 2020

Learning Compositional Neural Programs for Continuous Control.
CoRR, 2020

QD-RL: Efficient Mixing of Quality and Diversity in Reinforcement Learning.
CoRR, 2020

DECSTR: Learning Goal-Directed Abstract Behaviors using Pre-Verbal Spatial Predicates in Intrinsically Motivated Agents.
CoRR, 2020

Language-Conditioned Goal Generation: a New Approach to Language Grounding for RL.
CoRR, 2020

DREAM Architecture: a Developmental Approach to Open-Ended Learning in Robotics.
CoRR, 2020

To Share or Not To Share: A Comprehensive Appraisal of Weight-Sharing.
CoRR, 2020

Interactively shaping robot behaviour with unlabeled human instructions.
Auton. Agents Multi Agent Syst., 2020

TIRL: Enriching Actor-Critic RL with non-expert human teachers and a Trust Model.
Proceedings of the 29th IEEE International Conference on Robot and Human Interactive Communication, 2020

Understanding Failures of Deterministic Actor-Critic with Continuous Action Spaces and Sparse Rewards.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2020, 2020

PBCS: Efficient Exploration and Exploitation Using a Synergy Between Reinforcement Learning and Motion Planning.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2020, 2020

2019
Policy search in continuous action domains: An overview.
Neural Networks, 2019

The problem with DDPG: understanding failures in deterministic environments with sparse rewards.
CoRR, 2019

Investigating Generalisation in Continuous Deep Reinforcement Learning.
CoRR, 2019

CLIC: Curriculum Learning and Imitation for feature Control in non-rewarding environments.
CoRR, 2019

Learning Compositional Neural Programs with Recursive Tree Search and Planning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

CEM-RL: Combining evolutionary and gradient-based methods for policy search.
Proceedings of the 7th International Conference on Learning Representations, 2019

A Hitchhiker's Guide to Statistical Comparisons of Reinforcement Learning Algorithms.
Proceedings of the Reproducibility in Machine Learning, 2019

2018
The CoDyCo Project Achievements and Beyond: Toward Human Aware Whole-Body Controllers for Physical Human Robot Interaction.
IEEE Robotics Autom. Lett., 2018

Identification of Invariant Sensorimotor Structures as a Prerequisite for the Discovery of Objects.
Frontiers Robotics AI, 2018

Open-Ended Learning: A Conceptual Framework Based on Representational Redescription.
Frontiers Neurorobotics, 2018

CURIOUS: Intrinsically Motivated Multi-Task, Multi-Goal Reinforcement Learning.
CoRR, 2018

Importance mixing: Improving sample reuse in evolutionary policy search methods.
CoRR, 2018

Accuracy-based Curriculum Learning in Deep Reinforcement Learning.
CoRR, 2018

How Many Random Seeds? Statistical Power Analysis in Deep Reinforcement Learning Experiments.
CoRR, 2018

GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms.
Proceedings of the Journées Francophone Planification, 2018

Unsupervised Learning of Goal Spaces for Intrinsically Motivated Goal Exploration.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Tensor Based Knowledge Transfer Across Skill Categories for Robot Control.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

2016
Towards Deep Developmental Learning.
IEEE Trans. Cogn. Dev. Syst., 2016

Actor-critic versus direct policy search: a comparison based on sample complexity.
CoRR, 2016

Training a robot with evaluative feedback and unlabeled guidance signals.
Proceedings of the 25th IEEE International Symposium on Robot and Human Interactive Communication, 2016

Efficient reinforcement learning for humanoid whole-body control.
Proceedings of the 16th IEEE-RAS International Conference on Humanoid Robots, 2016

2015
Deep unsupervised network for multimodal perception, representation and classification.
Robotics Auton. Syst., 2015

Many regression algorithms, one unified model: A review.
Neural Networks, 2015

Gated networks: an inventory.
CoRR, 2015

Social-Task Learning for HRI.
Proceedings of the Social Robotics - 7th International Conference, 2015

Variance modulated task prioritization in Whole-Body Control.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Socially Guided XCS: Using Teaching Signals to Boost Learning.
Proceedings of the Genetic and Evolutionary Computation Conference, 2015

2014
Object Learning Through Active Exploration.
IEEE Trans. Auton. Ment. Dev., 2014

Modelling Individual Differences in the Form of Pavlovian Conditioned Approach Responses: A Dual Learning Systems Approach with Factored Representations.
PLoS Comput. Biol., 2014

Robot initiative in a team learning task increases the rhythm of interaction but not the perceived engagement.
Frontiers Neurorobotics, 2014

Learning a repertoire of actions with deep neural networks.
Proceedings of the 4th International Conference on Development and Learning and on Epigenetic Robotics, 2014

Multiple task optimization using dynamical movement primitives for whole-body reactive control.
Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014

2013
Adaptation de la matrice de covariance pour l'apprentissage par renforcement direct.
Rev. d'Intelligence Artif., 2013

Apprentissage et optimisation de politiques pour un bras articulé actionné par des muscles.
Rev. d'Intelligence Artif., 2013

Robot Skill Learning: From Reinforcement Learning to Evolution Strategies.
Paladyn J. Behav. Robotics, 2013

Gated Autoencoders with Tied Input Weights.
Proceedings of the 30th International Conference on Machine Learning, 2013

Learning to recognize objects through curiosity-driven manipulation with the iCub humanoid robot.
Proceedings of the 2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics, 2013

2012
From humans to humanoids: The optimal control framework.
Paladyn J. Behav. Robotics, 2012

Function approximation with LWPR and XCSF: a comparative study.
Evol. Intell., 2012

XCSF with local deletion: preventing detrimental forgetting.
Evol. Intell., 2012

Which Temporal Difference Learning Algorithm Best Reproduces Dopamine Activity in a Multi-choice Task?
Proceedings of the From Animals to Animats 12, 2012

Autonomous online learning of velocity kinematics on the iCub: A comparative study.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Path Integral Policy Improvement with Covariance Matrix Adaptation.
Proceedings of the 29th International Conference on Machine Learning, 2012

Perception and human interaction for developmental learning of objects and affordances.
Proceedings of the 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012), Osaka, Japan, November 29, 2012

Multimodal People Engagement with iCub.
Proceedings of the Biologically Inspired Cognitive Architectures 2012 - Proceedings of the Third Annual Meeting of the BICA Society, Palermo, Sicily, Italy, October 31, 2012

2011
On-line regression algorithms for learning mechanical models of robots: A survey.
Robotics Auton. Syst., 2011

Learning the velocity kinematics of ICUB for model-based control: XCSF versus LWPR.
Proceedings of the 11th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2011), 2011

Learning cost-efficient control policies with XCSF: generalization capabilities and further improvement.
Proceedings of the 13th Annual Genetic and Evolutionary Computation Conference, 2011

2010
From Motor Learning to Interaction Learning in Robots.
Proceedings of the From Motor Learning to Interaction Learning in Robots, 2010

Learning Forward Models for the Operational Space Control of Redundant Robots.
Proceedings of the From Motor Learning to Interaction Learning in Robots, 2010

TeXDYNA: Hierarchical Reinforcement Learning in Factored MDPs.
Proceedings of the From Animals to Animats 11, 2010

A comparative study: function approximation with LWPR and XCSF.
Proceedings of the Genetic and Evolutionary Computation Conference, 2010

2009
Apprentissage par renforcement factorisé pour le comportement de personnages non joueurs.
Rev. d'Intelligence Artif., 2009

Considering Unseen States as Impossible in Factored Reinforcement Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

Control of redundant robots using learned models: An operational space control approach.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Transfer of knowledge for a climbing Virtual Human: A reinforcement learning approach.
Proceedings of the 2009 IEEE International Conference on Robotics and Automation, 2009

2008
A comparison between ATNoSFERES and Learning Classifier Systems on non-Markov problems.
Inf. Sci., 2008

Compacting a Rule Base into an and/or Diagram for Game AI.
Proceedings of the GAMEON'2008, 2008

Exploiting Additive Structure in Factored MDPs for Reinforcement Learning.
Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008

Anticipatory Learning Classifier Systems and Factored Reinforcement Learning.
Proceedings of the Anticipatory Behavior in Adaptive Learning Systems, 2008

A Two-Level Model of Anticipation-Based Motor Learning for Whole Body Motion.
Proceedings of the Anticipatory Behavior in Adaptive Learning Systems, 2008

From Sensorimotor to Higher-Level Cognitive Processes: An Introduction to Anticipatory Behavior Systems.
Proceedings of the Anticipatory Behavior in Adaptive Learning Systems, 2008

2007
Learning classifier systems: a survey.
Soft Comput., 2007

Les systèmes de classeurs.
Rev. d'Intelligence Artif., 2007

2006
GACS : une approche ascendante pour la coordination spatiale.
Rev. d'Intelligence Artif., 2006

Chi-square Tests Driven Method for Learning the Structure of Factored MDPs.
Proceedings of the UAI '06, 2006

Anticipations, Brains, Individual and Social Behavior: An Introduction to Anticipatory Systems.
Proceedings of the Anticipatory Behavior in Adaptive Learning Systems, 2006

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems.
Proceedings of the Machine Learning, 2006

2005
Combining latent learning with dynamic programming in the modular anticipatory classifier system.
Eur. J. Oper. Res., 2005

An Experimental Comparison Between ATNoSFERES and ACS.
Proceedings of the Learning Classifier Systems, International Workshops, 2005

ATNoSFERES revisited.
Proceedings of the Genetic and Evolutionary Computation Conference, 2005

GACS, an evolutionary approach to the spatial coordination of agents.
Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

2004
Rapid response of head direction cells to reorienting visual cues: a computational model.
Neurocomputing, 2004

Improving MACS Thanks to a Comparison with 2TBNs.
Proceedings of the Genetic and Evolutionary Computation, 2004

2003
Coordination spatiale émergente par champs de potentie.
Tech. Sci. Informatiques, 2003

Designing Efficient Exploration with MACS: Modules and Function Approximation.
Proceedings of the Genetic and Evolutionary Computation, 2003

Internal Models and Anticipations in Adaptive Learning Systems.
Proceedings of the Anticipatory Behavior in Adaptive Learning Systems, 2003

Anticipatory Behavior: Exploiting Knowledge About the Future to Improve Current Behavior.
Proceedings of the Anticipatory Behavior in Adaptive Learning Systems, 2003

2002
YACS: a new learning classifier system using anticipation.
Soft Comput., 2002

Further Comparison between ATNoSFERES and XCSM.
Proceedings of the Learning Classifier Systems, 5th International Workshop, 2002

A Comparison Between ATNoSFERES And XCSM.
Proceedings of the GECCO 2002: Proceedings of the Genetic and Evolutionary Computation Conference, 2002

2000
Using Classifier Systems as Adaptive Expert Systems for Control.
Proceedings of the Advances in Learning Classifier Systems, Third International Workshop, 2000

YACS: Combining Dynamic Programming with Generalization in Classifier Systems.
Proceedings of the Advances in Learning Classifier Systems, Third International Workshop, 2000

Being Reactive by Exchanging Roles: An Empirical Study.
Proceedings of the Balancing Reactivity and Social Deliberation in Multi-Agent Systems, 2000


  Loading...