Jan Peters

Orcid: 0000-0002-5266-8091

Affiliations:
  • TU Darmstadt, Department of Computer Science, Germany
  • Max Planck Institute for Intelligent Systems, Tübingen, Germany
  • Max Planck Institute for Biological Cybernetics, Tübingen, Germany
  • University of Southern California Los Angeles, Computational Learning and Motion Control Lab, CA, USA


According to our database1, Jan Peters authored at least 511 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Variational Hierarchical Mixtures for Probabilistic Learning of Inverse Dynamics.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

Fast Kinodynamic Planning on the Constraint Manifold With Deep Neural Networks.
IEEE Trans. Robotics, 2024

What Matters for Active Texture Recognition With Vision-Based Tactile Sensors.
CoRR, 2024

Iterated Q-Network: Beyond the One-Step Bellman Operator.
CoRR, 2024

Information-Theoretic Safe Bayesian Optimization.
CoRR, 2024

Structure-Aware E(3)-Invariant Molecular Conformer Aggregation Networks.
CoRR, 2024

Transition State Clustering for Interaction Segmentation and Learning.
Proceedings of the Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024

Kinematically Constrained Human-like Bimanual Robot-to-Human Handovers.
Proceedings of the Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024

Parameterized Projected Bellman Operator.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Peer Learning: Learning Complex Policies in Groups from Scratch via Action Recommendations.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
PAC-Bayes Bounds for Bandit Problems: A Survey and Experimental Comparison.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

An Adaptive Human Driver Model for Realistic Race Car Simulations.
IEEE Trans. Syst. Man Cybern. Syst., November, 2023

Composable energy policies for reactive motion generation and reinforcement learning.
Int. J. Robotics Res., September, 2023

Digital Twin of a Driver-in-the-Loop Race Car Simulation With Contextual Reinforcement Learning.
IEEE Robotics Autom. Lett., July, 2023

A human-centered safe robot reinforcement learning framework with interactive behaviors.
Frontiers Neurorobotics, June, 2023

Continuous-Time Fitted Value Iteration for Robust Policies.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

A Deterministic Approximation to Neural SDEs.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Learning-Based Design and Control for Quadrupedal Robots With Parallel-Elastic Actuators.
IEEE Robotics Autom. Lett., March, 2023

Combining physics and deep learning to learn continuous-time dynamics models.
Int. J. Robotics Res., March, 2023

Cheap and Deterministic Inference for Deep State-Space Models of Interacting Dynamical Systems.
Trans. Mach. Learn. Res., 2023

Learning to Control Highly Accelerated Ballistic Movements on Muscular Robots.
Robotics Auton. Syst., 2023

Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization.
CoRR, 2023

Evetac: An Event-based Optical Tactile Sensor for Robotic Manipulation.
CoRR, 2023

Learning Multimodal Latent Dynamics for Human-Robot Interaction.
CoRR, 2023

Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts.
CoRR, 2023

Towards Transferring Tactile-based Continuous Force Control Policies from Simulation to Robot.
CoRR, 2023

Time-Efficient Reinforcement Learning with Stochastic Stateful Policies.
CoRR, 2023

LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion.
CoRR, 2023

Domain Randomization via Entropy Maximization.
CoRR, 2023

Robust Adversarial Reinforcement Learning via Bounded Rationality Curricula.
CoRR, 2023

Tracking Control for a Spherical Pendulum via Curriculum Reinforcement Learning.
CoRR, 2023

On the Benefit of Optimal Transport for Curriculum Reinforcement Learning.
CoRR, 2023

Sampling-Free Probabilistic Deep State-Space Models.
CoRR, 2023

Value-Distributional Model-Based Reinforcement Learning.
CoRR, 2023

Function-Space Regularization for Deep Bayesian Classification.
CoRR, 2023

Model Predictive Control with Gaussian-Process-Supported Dynamical Constraints for Autonomous Vehicles.
CoRR, 2023

A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors.
CoRR, 2023

Pseudo-Likelihood Inference.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Improved Algorithms for Stochastic Linear Bandits Using Tail Bounds for Martingale Mixtures.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Accelerating Motion Planning via Optimal Transport.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Hierarchical Policy Blending As Optimal Transport.
Proceedings of the Learning for Dynamics and Control Conference, 2023

Placing by Touching: An Empirical Study on the Importance of Tactile Sensing for Precise Object Placing.
IROS, 2023

Motion Planning Diffusion: Learning and Planning of Robot Motions with Diffusion Models.
IROS, 2023

Start State Selection for Control Policy Learning from Optimal Trajectories.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

SE(3)-DiffusionFields: Learning smooth cost functions for joint grasp and motion optimization through diffusion.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Safe Reinforcement Learning of Dynamic High-Dimensional Robotic Tasks: Navigation, Manipulation, Interaction.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Hierarchical Policy Blending as Inference for Reactive Robot Control.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Clustering of Motion Trajectories by a Distance Measure Based on Semantic Features.
Proceedings of the 22nd IEEE-RAS International Conference on Humanoid Robots, 2023

Disentangling Interaction Using Maximum Entropy Reinforcement Learning in Multi-Agent Systems.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

Visual Tactile Sensor Based Force Estimation for Position-Force Teleoperation.
Proceedings of the IEEE International Conference on Cyborg and Bionic Systems, 2023

Model-Based Uncertainty in Value Functions.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Autonomous Learning of Page Flipping Movements via Tactile Feedback.
IEEE Trans. Robotics, 2022

Learning to Play Table Tennis From Scratch Using Muscular Robots.
IEEE Trans. Robotics, 2022

Learning Stable Vector Fields on Lie Groups.
IEEE Robotics Autom. Lett., 2022

Robot Learning of Mobile Manipulation With Reachability Behavior Priors.
IEEE Robotics Autom. Lett., 2022

Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation.
IEEE Robotics Autom. Lett., 2022

Monte-Carlo Robot Path Planning.
IEEE Robotics Autom. Lett., 2022

Batch Reinforcement Learning With a Nonparametric Off-Policy Policy Gradient.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Continuous Action Reinforcement Learning From a Mixture of Interpretable Experts.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Robust Reinforcement Learning: A Review of Foundations and Recent Advances.
Mach. Learn. Knowl. Extr., 2022

HEBO: An Empirical Study of Assumptions in Bayesian Optimisation.
J. Artif. Intell. Res., 2022

Human-Robot Handshaking: A Review.
Int. J. Soc. Robotics, 2022

Integrating contrastive learning with dynamic models for reinforcement learning from images.
Neurocomputing, 2022

Robot Learning From Randomized Simulations: A Review.
Frontiers Robotics AI, 2022

PAC-Bayesian lifelong learning for multi-armed bandits.
Data Min. Knowl. Discov., 2022

How Crucial is Transformer in Decision Transformer?
CoRR, 2022

Variational Hierarchical Mixtures for Learning Probabilistic Inverse Dynamics.
CoRR, 2022

Self-supervised Sequential Information Bottleneck for Robust Exploration in Deep Reinforcement Learning.
CoRR, 2022

Structured Q-learning For Antibody Design.
CoRR, 2022

Active Inference for Robotic Manipulation.
CoRR, 2022

Revisiting Model-based Value Expansion.
CoRR, 2022

Accelerating Integrated Task and Motion Planning with Neural Feasibility Checking.
CoRR, 2022

A Hierarchical Approach to Active Pose Estimation.
CoRR, 2022

Residual Robot Learning for Object-Centric Probabilistic Movement Primitives.
CoRR, 2022

An Analysis of Measure-Valued Derivatives for Policy Gradients.
CoRR, 2022

A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search.
CoRR, 2022

Learning Geometric Constraints in Task and Motion Planning.
CoRR, 2022

Long-Term Visitation Value for Deep Exploration in Sparse-Reward Reinforcement Learning.
Algorithms, 2022

A Semantic Enhancement of Unified Geometric Representations for Improving Indoor Visual SLAM.
Proceedings of the 19th International Conference on Ubiquitous Robots, 2022

Information-Theoretic Safe Exploration with Gaussian Processes.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Implicit Priors for Motion Optimization.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Active Exploration for Robotic Manipulation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Controlling the Cascade: Kinematic Planning for N-ball Toss Juggling.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Regularized Deep Signed Distance Fields for Reactive Motion Generation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Curriculum Reinforcement Learning via Constrained Optimal Transport.
Proceedings of the International Conference on Machine Learning, 2022

Boosted Curriculum Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Integrated Bi-Manual Motion Generation and Control shaped for Probabilistic Movement Primitives.
Proceedings of the 21st IEEE-RAS International Conference on Humanoid Robots, 2022

MILD: Multimodal Interactive Latent Dynamics for Learning Human-Robot Interaction.
Proceedings of the 21st IEEE-RAS International Conference on Humanoid Robots, 2022

Improving Sample Efficiency of Example-Guided Deep Reinforcement Learning for Bipedal Walking.
Proceedings of the 21st IEEE-RAS International Conference on Humanoid Robots, 2022

Adapting Object-Centric Probabilistic Movement Primitives with Residual Reinforcement Learning.
Proceedings of the 21st IEEE-RAS International Conference on Humanoid Robots, 2022

Inferring Smooth Control: Monte Carlo Posterior Policy Iteration with Gaussian Processes.
Proceedings of the Conference on Robot Learning, 2022

Dimensionality Reduction and Prioritized Exploration for Policy Search.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
ArmSym: A Virtual Human-Robot Interaction Laboratory for Assistive Robotics.
IEEE Trans. Hum. Mach. Syst., 2021

Sim2Real in Robotics and Automation: Applications and Challenges.
IEEE Trans Autom. Sci. Eng., 2021

SKID RAW: Skill Discovery From Raw Trajectories.
IEEE Robotics Autom. Lett., 2021

Data-Efficient Domain Randomization With Bayesian Optimization.
IEEE Robotics Autom. Lett., 2021

Assessing Transferability From Simulation to Reality for Reinforcement Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Convex optimization with an interpolation-based projection and its application to deep learning.
Mach. Learn., 2021

A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning.
J. Mach. Learn. Res., 2021

MushroomRL: Simplifying Reinforcement Learning Research.
J. Mach. Learn. Res., 2021

Gaussian Approximation for Bias Reduction in Q-Learning.
J. Mach. Learn. Res., 2021

ExGenNet: Learning to Generate Robotic Facial Expression Using Facial Expression Recognition.
Frontiers Robotics AI, 2021

Distilled Domain Randomization.
CoRR, 2021

Model-Based Reinforcement Learning for Stochastic Hybrid Systems.
CoRR, 2021

A Differentiable Newton-Euler Algorithm for Real-World Robotics.
CoRR, 2021

A Robot Cluster for Reproducible Research in Dexterous Manipulation.
CoRR, 2021

High-Dimensional Bayesian Optimisation with Variational Autoencoders and Deep Metric Learning.
CoRR, 2021

Stochastic Control through Approximate Bayesian Input Inference.
CoRR, 2021

Reinforcement Learning using Guided Observability.
CoRR, 2021

Distributionally Robust Trajectory Optimization Under Uncertain Dynamics via Relative-Entropy Trust Regions.
CoRR, 2021

Extended Task and Motion Planning of Long-horizon Robot Manipulation.
CoRR, 2021

Robust Value Iteration for Continuous Control Tasks.
Proceedings of the Robotics: Science and Systems XVII, Virtual Event, July 12-16, 2021., 2021


Efficient and Reactive Planning for High Speed Robot Air Hockey.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients.
Proceedings of the International Joint Conference on Neural Networks, 2021

Contextual Latent-Movements Off-Policy Optimization for Robotic Manipulation Skills.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Learning Human-like Hand Reaching for Human-Robot Handshaking.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with Deep Reinforcement Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Differentiable Physics Models for Real-world Offline Model-based Reinforcement Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Directed Acyclic Graph Neural Network for Human Motion Prediction.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

A Variational Infinite Mixture for Probabilistic Inverse Dynamics Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Value Iteration in Continuous Actions, States and Time.
Proceedings of the 38th International Conference on Machine Learning, 2021

Convex Regularization in Monte-Carlo Tree Search.
Proceedings of the 38th International Conference on Machine Learning, 2021

Neural Posterior Domain Randomization.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Robot Reinforcement Learning on the Constraint Manifold.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Learn2Assemble with Structured Representations and Search for Robotic Architectural Construction.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Building Skill Learning Systems for Robotics.
Proceedings of the 17th IEEE International Conference on Automation Science and Engineering, 2021

Advancing Trajectory Optimization with Approximate Inference: Exploration, Covariance Control and Adaptive Risk.
Proceedings of the 2021 American Control Conference, 2021

Latent Derivative Bayesian Last Layer Networks.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Trajectory Optimization of Energy Consumption and Expected Service Life of a Robotic System.
Proceedings of the IEEE/ASME International Conference on Advanced Intelligent Mechatronics, 2021

2020
Adaptation and Robust Learning of Probabilistic Movement Primitives.
IEEE Trans. Robotics, 2020

Grip Stabilization through Independent Finger Tactile Feedback Control.
Sensors, 2020

Learning Sequential Force Interaction Skills.
Robotics, 2020

Probabilistic Approach to Physical Object Disentangling.
IEEE Robotics Autom. Lett., 2020

Plucking Motions for Tea Harvesting Robots Using Probabilistic Movement Primitives.
IEEE Robotics Autom. Lett., 2020

A Probabilistic Framework for Imitating Human Race Driver Behavior.
IEEE Robotics Autom. Lett., 2020

Multi-Sensor Next-Best-View Planning as Matroid-Constrained Submodular Maximization.
IEEE Robotics Autom. Lett., 2020

Real Time Trajectory Prediction Using Deep Conditional Generative Models.
IEEE Robotics Autom. Lett., 2020

Numerical Quadrature for Probabilistic Policy Search.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Evolutionary training and abstraction yields algorithmic generalization of neural computers.
Nat. Mach. Intell., 2020

Learning attribute grammars for movement primitive sequencing.
Int. J. Robotics Res., 2020

Incremental Learning of an Open-Ended Collaborative Skill Library.
Int. J. Humanoid Robotics, 2020

Hierarchical Tactile-Based Control Decomposition of Dexterous In-Hand Manipulation Tasks.
Frontiers Robotics AI, 2020

Multi-Channel Interactive Reinforcement Learning for Sequential Tasks.
Frontiers Robotics AI, 2020

Structured Policy Representation: Imposing Stability in arbitrarily conditioned dynamic systems.
CoRR, 2020

Perspectives on Sim2Real Transfer for Robotics: A Summary of the R: SS 2020 Workshop.
CoRR, 2020

A Differentiable Newton Euler Algorithm for Multi-body Model Learning.
CoRR, 2020

Differentiable Implicit Layers.
CoRR, 2020

Deterministic Inference of Neural Stochastic Differential Equations.
CoRR, 2020

Reinforcement Learning from a Mixture of Interpretable Experts.
CoRR, 2020

Orientation Attentive Robot Grasp Synthesis.
CoRR, 2020

Deep Reinforcement Learning with Weighted Q-Learning.
CoRR, 2020

Learning to Fly via Deep Model-Based Reinforcement Learning.
CoRR, 2020

Metric-Based Imitation Learning Between Two Dissimilar Anthropomorphic Robotic Arms.
CoRR, 2020

Dimensionality Reduction of Movement Primitives in Parameter Space.
CoRR, 2020

Bayesian Domain Randomization for Sim-to-Real Transfer.
CoRR, 2020

An Upper Bound of the Bias of Nadaraya-Watson Kernel Regression under Lipschitz Assumptions.
CoRR, 2020

A Nonparametric Offpolicy Policy Gradient.
CoRR, 2020

Assisted teleoperation in changing environments with a mixture of virtual guides.
Adv. Robotics, 2020

Multi-agent active information gathering in discrete and continuous-state decentralized POMDPs by policy graph improvement.
Auton. Agents Multi Agent Syst., 2020

Bayesian Online Prediction of Change Points.
Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020

Advances in Human-Robot Handshaking.
Proceedings of the Social Robotics - 12th International Conference, 2020

Self-Paced Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Hierarchical Decomposition of Nonlinear Dynamics and Control for System Identification and Policy Distillation.
Proceedings of the 2nd Annual Conference on Learning for Dynamics and Control, 2020

Active Inference or Control as Inference? A Unifying View.
Proceedings of the Active Inference - First International Workshop, 2020

Decentralized Data-Driven Tuning of Droop Frequency Controllers.
Proceedings of the IEEE PES Innovative Smart Grid Technologies Europe, 2020

Redundancy resolution under hard joint constraints: a generalized approach to rank updates.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

ImitationFlow: Learning Deep Stable Stochastic Dynamic Systems by Normalizing Flows.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Learning Hierarchical Acquisition Functions for Bayesian Optimization.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Deep Adversarial Reinforcement Learning for Object Disentangling.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Model-Based Quality-Diversity Search for Efficient Robot Learning.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Generalized Mean Estimation in Monte-Carlo Tree Search.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Learning Control Policies from Optimal Trajectories.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Underactuated Waypoint Trajectory Optimization for Light Painting Photography.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Sharing Knowledge in Multi-Task Deep Reinforcement Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Evaluation of the Handshake Turing Test for anthropomorphic Robots.
Proceedings of the Companion of the 2020 ACM/IEEE International Conference on Human-Robot Interaction, 2020

High Acceleration Reinforcement Learning for Real-World Juggling with Binary Rewards.
Proceedings of the 4th Conference on Robot Learning, 2020

A Nonparametric Off-Policy Policy Gradient.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
Optimizing the Execution of Dynamic Robot Movements With Learning Control.
IEEE Trans. Robotics, 2019

Reliable Real-Time Ball Tracking for Robot Table Tennis.
Robotics, 2019

Learning Intention Aware Online Adaptation of Movement Primitives.
IEEE Robotics Autom. Lett., 2019

Learning to Serve: An Experimental Study for a New Learning From Demonstrations Framework.
IEEE Robotics Autom. Lett., 2019

Learning Replanning Policies With Direct Policy Search.
IEEE Robotics Autom. Lett., 2019

Intrinsic motivation and mental replay enable efficient online adaptation in stochastic recurrent networks.
Neural Networks, 2019

TD-regularized actor-critic methods.
Mach. Learn., 2019

Compatible natural gradient policy search.
Mach. Learn., 2019

Reinforcement learning of motor skills using Policy Search and human corrective advice.
Int. J. Robotics Res., 2019

Learning Trajectory Distributions for Assisted Teleoperation and Path Planning.
Frontiers Robotics AI, 2019

How Cognitive Models of Human Body Experience Might Push Robotics.
Frontiers Neurorobotics, 2019

Entropic Regularization of Markov Decision Processes.
Entropy, 2019

Learning Algorithmic Solutions to Symbolic Planning Tasks with a Neural Computer.
CoRR, 2019

Model-based Lookahead Reinforcement Learning.
CoRR, 2019

Local Online Motor Babbling: Learning Motor Abundance of A Musculoskeletal Robot Arm.
CoRR, 2019

Learning walk and trot from the same objective using different types of exploration.
CoRR, 2019

Was ist eine Professur fuer Kuenstliche Intelligenz?
CoRR, 2019

CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity.
CoRR, 2019

Bayesian Online Detection and Prediction of Change Points.
CoRR, 2019

Neural Signatures of Motor Skill in the Resting Brain.
Proceedings of the 2019 IEEE International Conference on Systems, Man and Cybernetics, 2019

Generalized Multiple Correlation Coefficient as a Similarity Measurement between Trajectories.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Multimodal Uncertainty Reduction for Intention Recognition in Human-Robot Interaction.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Experience Reuse with Probabilistic Movement Primitives.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Entropic Risk Measure in Policy Search.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Deep Lagrangian Networks for end-to-end learning of energy-based control for under-actuated systems.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Local Online Motor Babbling: Learning Motor Abundance of a Musculoskeletal Robot Arm<sup>*</sup>.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Reinforcement Learning of Trajectory Distributions: Applications in Assisted Teleoperation and Motion Planning.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Chance-Constrained Trajectory Optimization for Non-linear Systems with Unknown Stochastic Dynamics.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Exploration Driven by an Optimistic Bellman Equation.
Proceedings of the International Joint Conference on Neural Networks, 2019

Switching Linear Dynamics for Variational Bayes Filtering.
Proceedings of the 36th International Conference on Machine Learning, 2019

Projections for Approximate Policy Iteration Algorithms.
Proceedings of the 36th International Conference on Machine Learning, 2019

Deep Lagrangian Networks: Using Physics as Model Prior for Deep Learning.
Proceedings of the 7th International Conference on Learning Representations, 2019

Building a Library of Tactile Skills Based on FingerVision.
Proceedings of the 19th IEEE-RAS International Conference on Humanoid Robots, 2019

Stochastic Optimal Control as Approximate Input Inference.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Receding Horizon Curiosity.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

HJB Optimal Feedback Control with Deep Differential Value Functions and Action Constraints.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Self-Paced Contextual Reinforcement Learning.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Information Gathering in Decentralized POMDPs by Policy Graph Improvement.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018
Grip Stabilization of Novel Objects Using Slip Prediction.
IEEE Trans. Haptics, 2018

Online optimal trajectory generation for robot table tennis.
Robotics Auton. Syst., 2018

Approximate Value Iteration Based on Numerical Quadrature.
IEEE Robotics Autom. Lett., 2018

Mixture of Attractors: A Novel Movement Primitive Representation for Learning Motor Skills From Demonstrations.
IEEE Robotics Autom. Lett., 2018

Control of Musculoskeletal Systems Using Learned Dynamics Models.
IEEE Robotics Autom. Lett., 2018

Inverse Reinforcement Learning via Nonparametric Spatio-Temporal Subgoal Modeling.
J. Mach. Learn. Res., 2018

Model-Free Trajectory-based Policy Optimization with Monotonic Improvement.
J. Mach. Learn. Res., 2018

An Algorithmic Perspective on Imitation Learning.
Found. Trends Robotics, 2018

Assisting Movement Training and Execution With Visual and Haptic Feedback.
Frontiers Neurorobotics, 2018

Optimizing Execution of Dynamic Goal-Directed Robot Movements with Learning Control.
CoRR, 2018

In-Hand Object Stabilization by Independent Finger Control.
CoRR, 2018

f-Divergence constrained policy improvement.
CoRR, 2018

Using probabilistic movement primitives in robotics.
Auton. Robots, 2018

A kernel-based approach to learning contact distributions for robot manipulation tasks.
Auton. Robots, 2018

Probabilistic movement primitives under unknown system dynamics.
Adv. Robotics, 2018

Hierarchical reinforcement learning of multiple grasping strategies with human instructions.
Adv. Robotics, 2018

Regularizing Reinforcement Learning with State Abstraction.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Sample and Feedback Efficient Hierarchical Reinforcement Learning from Human Preferences.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Inducing Probabilistic Context-Free Grammars for the Sequencing of Movement Primitives.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Learning Coupled Forward-Inverse Models with Combined Prediction Errors.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos.
Proceedings of the 35th International Conference on Machine Learning, 2018

Online Learning of an Open-Ended Skill Library for Collaborative Tasks.
Proceedings of the 18th IEEE-RAS International Conference on Humanoid Robots, 2018

Utilizing Human Feedback in POMDP Execution and Specification.
Proceedings of the 18th IEEE-RAS International Conference on Humanoid Robots, 2018

Domain Randomization for Simulation-Based Policy Optimization with Transferability Assessment.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

2017
Robot Learning.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

Policy Gradient Methods.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

Whole-body multi-contact motion in humans and humanoids: Advances of the CoDyCo European project.
Robotics Auton. Syst., 2017

Probabilistic Prioritization of Movement Primitives.
IEEE Robotics Autom. Lett., 2017

Guiding Trajectory Optimization by Demonstrated Distributions.
IEEE Robotics Autom. Lett., 2017

A Comparison of Autoregressive Hidden Markov Models for Multimodal Manipulations With Variable Masses.
IEEE Robotics Autom. Lett., 2017

Generalized exploration in policy search.
Mach. Learn., 2017

Stability of Controllers for Gaussian Process Dynamics.
J. Mach. Learn. Res., 2017

Non-parametric Policy Search with Limited Information Loss.
J. Mach. Learn. Res., 2017

Towards Engagement Models that Consider Individual Factors in HRI: On the Relation of Extroversion and Negative Attitude Towards Robots to Gaze and Speech During a Human-Robot Assembly Task - Experiments with the iCub humanoid.
Int. J. Soc. Robotics, 2017

Phase estimation for fast action recognition and trajectory generation in human-robot collaboration.
Int. J. Robotics Res., 2017

Learning movement primitive libraries through probabilistic segmentation.
Int. J. Robotics Res., 2017

Manifold-based multi-objective policy search with sample reuse.
Neurocomputing, 2017

Prediction of Intention during Interaction with iCub with Probabilistic Movement Primitives.
Frontiers Robotics AI, 2017

Probabilistic movement primitives for coordination of multiple human-robot collaborative tasks.
Auton. Robots, 2017

Anticipatory action selection for human-robot table tennis.
Artif. Intell., 2017

Model-based contextual policy search for data-efficient generalization of robot skills.
Artif. Intell., 2017

Goal-driven dimensionality reduction for reinforcement learning.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Hybrid control trajectory optimization under uncertainty.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Context-driven movement primitive adaptation.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Empowered skills.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Layered direct policy search for learning hierarchical skills.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

A learning-based shared control architecture for interactive task execution.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Local Bayesian Optimization of Motor Skills.
Proceedings of the 34th International Conference on Machine Learning, 2017

Efficient online adaptation with stochastic recurrent neural networks.
Proceedings of the 17th IEEE-RAS International Conference on Humanoid Robotics, 2017

A comparison of distance measures for learning nonparametric motor skill libraries.
Proceedings of the 17th IEEE-RAS International Conference on Humanoid Robotics, 2017

Learning inverse dynamics models in O(n) time with LSTM networks.
Proceedings of the 17th IEEE-RAS International Conference on Humanoid Robotics, 2017

Investigating Music imagery as a Cognitive Paradigm for low-Cost brain-Computer Interfaces.
Proceedings of the From Vision to Reality, 2017

Bayesian Regression for Artifact correction in Electroencephalography.
Proceedings of the From Vision to Reality, 2017

Online Learning with Stochastic Recurrent Neural Networks using Intrinsic Motivation Signals.
Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

Active Incremental Learning of Robot Movement Primitives.
Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

State-Regularized Policy Search for Linearized Dynamical Systems.
Proceedings of the Twenty-Seventh International Conference on Automated Planning and Scheduling, 2017

Policy Search with High-Dimensional Context Variables.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Robot Learning.
Proceedings of the Springer Handbook of Robotics, 2016

Acquiring and Generalizing the Embodiment Mapping From Human Observations to Robot Skills.
IEEE Robotics Autom. Lett., 2016

Probabilistic inference for determining options in reinforcement learning.
Mach. Learn., 2016

Hierarchical Relative Entropy Policy Search.
J. Mach. Learn. Res., 2016

Contextual Policy Search for Linear and Nonlinear Generalization of a Humanoid Walking Controller.
J. Intell. Robotic Syst., 2016

Can Modular Finger Control for In-Hand Object Stabilization be accomplished by Independent Tactile Feedback Control Laws?
CoRR, 2016

Bayesian optimization for learning gaits under uncertainty - An experimental comparison on a dynamic bipedal walker.
Ann. Math. Artif. Intell., 2016

Local-utopia policy selection for multi-objective reinforcement learning.
Proceedings of the 2016 IEEE Symposium Series on Computational Intelligence, 2016

Multi-task logistic regression in brain-computer interfaces.
Proceedings of the 2016 IEEE International Conference on Systems, Man, and Cybernetics, 2016

A low-cost sensor glove with vibrotactile feedback and multiple finger joint and hand motion sensing for human-robot interaction.
Proceedings of the 25th IEEE International Symposium on Robot and Human Interactive Communication, 2016

Catching heuristics are optimal control policies.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Experiments with Hierarchical Reinforcement Learning of Multiple Grasping Policies.
Proceedings of the International Symposium on Experimental Robotics, 2016

Active tactile object exploration with Gaussian processes.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Probabilistic decomposition of sequential force interaction tasks into Movement Primitives.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

A new trajectory generation framework in robotic table tennis.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Stable reinforcement learning with autoencoders for tactile and visual data.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Manifold Gaussian Processes for regression.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

Learning soft task priorities for control of redundant robots.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

Movement primitives with multiple phase parameters.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

A lightweight robotic arm with pneumatic muscles for robot learning.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

Stability of Controllers for Gaussian Process Forward Models.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Deep spiking networks for model-based planning in humanoids.
Proceedings of the 16th IEEE-RAS International Conference on Humanoid Robots, 2016

Demonstration based trajectory optimization for generalizable robot motions.
Proceedings of the 16th IEEE-RAS International Conference on Humanoid Robots, 2016

Jointly learning trajectory generation and hitting point prediction in robot table tennis.
Proceedings of the 16th IEEE-RAS International Conference on Humanoid Robots, 2016

Using probabilistic movement primitives for striking movements.
Proceedings of the 16th IEEE-RAS International Conference on Humanoid Robots, 2016

Incremental imitation learning of context-dependent motor skills.
Proceedings of the 16th IEEE-RAS International Conference on Humanoid Robots, 2016

Model-Based Relative Entropy Stochastic Search.
Proceedings of the Genetic and Evolutionary Computation Conference, 2016

Anticipative Interaction Primitives for Human-Robot Collaboration.
Proceedings of the 2016 AAAI Fall Symposia, Arlington, Virginia, USA, November 17-19, 2016, 2016

2015
Learning movement primitive attractor goals and sequential skills from kinesthetic demonstrations.
Robotics Auton. Syst., 2015

Multimodal Manipulation Under Uncertainty (Dagstuhl Seminar 15411).
Dagstuhl Reports, 2015

Low-cost Sensor Glove with Force Feedback for Learning from Demonstrations using Probabilistic Trajectory Representations.
CoRR, 2015

Towards engagement models that consider individual factors in HRI: on the relation of extroversion and negative attitude towards robots to gaze and speech during a human-robot assembly task.
CoRR, 2015

Active reward learning with a novel acquisition function.
Auton. Robots, 2015

A Probabilistic Framework for Semi-autonomous Robots Based on Interaction Primitives with Phase Estimation.
Proceedings of the Robotics Research, 2015

Combined pose-wrench and state machine representation for modeling Robotic Assembly Skills.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Stabilizing novel objects by learning to predict tactile slip.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Reinforcement learning vs human programming in tetherball robot games.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Model-free Probabilistic Movement Primitives for physical interaction.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Probabilistic progress prediction and sequencing of concurrent movement primitives.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Learning optimal striking points for a ping-pong playing robot.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Learning motor skills from partially observed movements executed at different speeds.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Extracting low-dimensional control variables for movement primitives.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Towards learning hierarchical skills for multi-phase manipulation tasks.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Learning multiple collaborative tasks with a mixture of Interaction Primitives.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Learning inverse dynamics models with contacts.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Contextual Policy Search for Generalizing a Parameterized Biped Walking Controller.
Proceedings of the 2015 IEEE International Conference on Autonomous Robot Systems and Competitions, 2015

Probabilistic segmentation applied to an assembly task.
Proceedings of the 15th IEEE-RAS International Conference on Humanoid Robots, 2015

A comparison of contact distribution representations for learning to predict object interactions.
Proceedings of the 15th IEEE-RAS International Conference on Humanoid Robots, 2015

Optimizing robot striking movement primitives with Iterative Learning Control.
Proceedings of the 15th IEEE-RAS International Conference on Humanoid Robots, 2015

Learning robot in-hand manipulation with tactile features.
Proceedings of the 15th IEEE-RAS International Conference on Humanoid Robots, 2015

Evaluation of tactile feature extraction for interactive object recognition.
Proceedings of the 15th IEEE-RAS International Conference on Humanoid Robots, 2015

First-person tele-operation of a humanoid robot.
Proceedings of the 15th IEEE-RAS International Conference on Humanoid Robots, 2015

Learning torque control in presence of contacts using tactile sensing from robot skin.
Proceedings of the 15th IEEE-RAS International Conference on Humanoid Robots, 2015

Learning of Non-Parametric Control Policies with High-Dimensional State Features.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

Policy Evaluation with Temporal Differences: A Survey and Comparison (Extended Abstract).
Proceedings of the Twenty-Fifth International Conference on Automated Planning and Scheduling, 2015

2014
Learning Motor Skills - From Algorithms to Robot Experiments
Springer Tracts in Advanced Robotics 97, Springer, ISBN: 978-3-319-03193-4, 2014

Probabilistic Segmentation and Targeted Exploration of Objects in Cluttered Environments.
IEEE Trans. Robotics, 2014

Natural evolution strategies.
J. Mach. Learn. Res., 2014

Policy evaluation with temporal differences: a survey and comparison.
J. Mach. Learn. Res., 2014

Generalizing Movements with Information-Theoretic Stochastic Optimal Control.
J. Aerosp. Inf. Syst., 2014

Learning modular policies for robotics.
Frontiers Comput. Neurosci., 2014

Learning strategies in table tennis using inverse reinforcement learning.
Biol. Cybern., 2014

Special issue on autonomous grasping and manipulation.
Auton. Robots, 2014

Indirect robot model learning for tracking control.
Adv. Robotics, 2014

Active Reward Learning.
Proceedings of the Robotics: Science and Systems X, 2014

Policy Search for Path Integral Control.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014

Bayesian Gait Optimization for Bipedal Locomotion.
Proceedings of the Learning and Intelligent Optimization, 2014

Learning to sequence movement primitives from demonstrations.
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Latent space policy search for robotics.
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Predicting object interactions from contact distributions.
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Learning robot tactile sensing for object manipulation.
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Sample-based informationl-theoretic stochastic optimal control.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Learning to predict phases of manipulation tasks as hidden states.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Multi-task policy search for robotics.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

An experimental comparison of Bayesian optimization for bipedal locomotion.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Policy search for learning robot control using sparse data.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Interaction primitives for human-robot cooperation tasks.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Multi-modal filtering for non-linear estimation.
Proceedings of the IEEE International Conference on Acoustics, 2014

Omnidirectional Walking with a Compliant Inverted Pendulum Model.
Proceedings of the Advances in Artificial Intelligence - IBERAMIA 2014, 2014

Learning Manipulation by Sequencing Motor Primitives with a Two-Armed Robot.
Proceedings of the Intelligent Autonomous Systems 13, 2014

Robust policy updates for stochastic optimal control.
Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014

Learning interaction for collaborative tasks with probabilistic movement primitives.
Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014

Tools for simulating humanoid robot dynamics: A survey based on user feedback.
Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014

Dimensionality reduction for probabilistic movement primitives.
Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014

Generalizing pouring actions between objects using warped parameters.
Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014

2013
Probabilistic movement modeling for intention inference in human-robot interaction.
Int. J. Robotics Res., 2013

Learning to select and generalize striking movements in robot table tennis.
Int. J. Robotics Res., 2013

Reinforcement learning in robotics: A survey.
Int. J. Robotics Res., 2013

A Survey on Policy Search for Robotics.
Found. Trends Robotics, 2013

Multi-Task Policy Search.
CoRR, 2013

Probabilistic model-based imitation learning.
Adapt. Behav., 2013

Towards Robot Skill Learning: From Simple Skills to Table Tennis.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013

Probabilistic Movement Primitives.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Learning responsive robot behavior by imitation.
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013

Autonomous reinforcement learning with hierarchical REPS.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Alignment-based transfer learning for robot models.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Feedback error learning for rhythmic motor primitives.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Model-based imitation learning by probabilistic trajectory matching.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Learning sequential motor tasks.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

A probabilistic approach to robot trajectory generation.
Proceedings of the 13th IEEE-RAS International Conference on Humanoid Robots, 2013

Probabilistic interactive segmentation for anthropomorphic robots in cluttered environments.
Proceedings of the 13th IEEE-RAS International Conference on Humanoid Robots, 2013

Data-Efficient Generalization of Robot Skills with Contextual Policy Search.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Online Kernel-Based Learning for Task-Space Tracking Robot Control.
IEEE Trans. Neural Networks Learn. Syst., 2012

Real-time detection of colored objects in multiple camera streams with off-the-shelf hardware components.
J. Real Time Image Process., 2012

Hierarchical Relative Entropy Policy Search.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Special issue on "Bio-inspired computing for autonomous vehicles".
Int. J. Intell. Comput. Cybern., 2012

Learning Prioritized Control of Motor Primitives
CoRR, 2012

Reinforcement learning to adjust parametrized motor primitives to new situations.
Auton. Robots, 2012

Probabilistic Modeling of Human Movements for Intention Inference.
Proceedings of the Robotics: Science and Systems VIII, 2012

Structured Apprenticeship Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2012

Algorithms for Learning Markov Field Policies.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

A brain-robot interface for studying motor learning after stroke.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Learning throwing and catching skills.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Maximally informative interaction learning for scene exploration.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Toward fast policy search for learning legged locomotion.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Learning concurrent motor skills in versatile solution spaces.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Generalization of human grasping for multi-fingered robot hands.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

A kernel-based approach to direct action perception.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Learning tracking control with forward models.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Point cloud completion using extrusions.
Proceedings of the 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012), Osaka, Japan, November 29, 2012

Preface.
Proceedings of the Tenth European Workshop on Reinforcement Learning, 2012

Robot Skill Learning.
Proceedings of the ECAI 2012, 2012

Reinforcement Learning in Robotics: A Survey.
Proceedings of the Reinforcement Learning, 2012

2011
Learning Dynamic Tactile Sensing With Robust Vision-Based Training.
IEEE Trans. Robotics, 2011

Learning grasp affordance densities.
Paladyn J. Behav. Robotics, 2011

Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning.
Neural Comput., 2011

Policy search for motor primitives in robotics.
Mach. Learn., 2011

Relative Entropy Inverse Reinforcement Learning.
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011

Learning visual representations for perception-action systems.
Int. J. Robotics Res., 2011

Incremental online sparsification for model learning in real-time robot control.
Neurocomputing, 2011

Exploration and Curiosity in Robot Learning and Inference (Dagstuhl Seminar 11131).
Dagstuhl Reports, 2011

Model learning for robot control: a survey.
Cogn. Process., 2011

PAC-Bayesian Analysis of the Exploration-Exploitation Trade-off
CoRR, 2011

PAC-Bayesian Analysis of Martingales and Multiarmed Bandits
CoRR, 2011

A biomimetic approach to robot table tennis.
Adapt. Behav., 2011

A Non-Parametric Approach to Dynamic Programming.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Learning anticipation policies for robot table tennis.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Learning task-space tracking control with kernels.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Learning elementary movements jointly with a higher level task.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Learning robot grasping from 3-D images with Markov Random Fields.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Learning inverse kinematics with structured prediction.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Trajectory planning for optimal robot catching in real-time.
Proceedings of the IEEE International Conference on Robotics and Automation, 2011

A flexible hybrid framework for modeling complex manipulation tasks.
Proceedings of the IEEE International Conference on Robotics and Automation, 2011

Active exploration for robot parameter selection in episodic reinforcement learning.
Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

Modeling Opponent Actions for Table-Tennis Playing Robot.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Balancing Safety and Exploitability in Opponent Modeling.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
From Motor Learning to Interaction Learning in Robots.
Proceedings of the From Motor Learning to Interaction Learning in Robots, 2010

Real-Time Local GP Model Learning.
Proceedings of the From Motor Learning to Interaction Learning in Robots, 2010

Imitation and Reinforcement Learning for Motor Primitives with Perceptual Coupling.
Proceedings of the From Motor Learning to Interaction Learning in Robots, 2010

Learning Continuous Grasp Affordances by Sensorimotor Exploration.
Proceedings of the From Motor Learning to Interaction Learning in Robots, 2010

Robot Learning.
Proceedings of the Encyclopedia of Machine Learning, 2010

Policy Gradient Methods.
Proceedings of the Encyclopedia of Machine Learning, 2010

Policy gradient methods.
Scholarpedia, 2010

Combining active learning and reactive control for robot grasping.
Robotics Auton. Syst., 2010

Imitation and Reinforcement Learning.
IEEE Robotics Autom. Mag., 2010

Parameter-exploring policy gradients.
Neural Networks, 2010

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning.
Neural Comput., 2010

Incremental Sparsification for Real-time Online Model Learning.
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Recurrent policy gradients.
Log. J. IGPL, 2010

Algorithmen zum Automatischen Erlernen von Motorfähigkeiten (Policy Learning Algorithms for Motor Skills).
Autom., 2010

Closing the sensorimotor loop: Haptic feedback facilitates decoding of arm movement imagery.
Proceedings of the IEEE International Conference on Systems, 2010

Simulating Human Table Tennis with a Biomimetic Robot Setup.
Proceedings of the From Animals to Animats 11, 2010

Adapting Preshaped Grasping Movements Using Vision Descriptors.
Proceedings of the From Animals to Animats 11, 2010

Reinforcement Learning to adjust Robot Movements to New Situations.
Proceedings of the Robotics: Science and Systems VI, 2010

Movement extraction by detecting dynamics switches and repetitions.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Switched Latent Force Models for Movement Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Experiments with Motor Primitives in Table Tennis.
Proceedings of the Experimental Robotics, 2010

Learning probabilistic discriminative models of grasp affordances under limited supervision.
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Using model knowledge for learning inverse dynamics.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Movement templates for learning of hitting and batting.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Grasping with Vision Descriptors and Motor Primitives.
Proceedings of the ICINCO 2010, 2010

Learning table tennis with a Mixture of Motor Primitives.
Proceedings of the 10th IEEE-RAS International Conference on Humanoid Robots, 2010

Relative Entropy Policy Search.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009
Robot learning [TC Spotlight].
IEEE Robotics Autom. Mag., 2009

Adaptive importance sampling for value function approximation in off-policy reinforcement learning.
Neural Networks, 2009

Policy Search for Motor Primitives.
Künstliche Intell., 2009

An Expectation Maximization Algorithm for Continuous Markov Decision Processes with Arbitrary Reward.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

Gaussian process dynamic programming.
Neurocomputing, 2009

Guest editorial: Special issue on robot learning, Part B.
Auton. Robots, 2009

Guest editorial: Special issue on robot learning, Part A.
Auton. Robots, 2009

Model Learning with Local Gaussian Process Regression.
Adv. Robotics, 2009

Efficient Sample Reuse in EM-Based Policy Search.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

Learning Visual Representations for Interactive Systems.
Proceedings of the Robotics Research - The 14th International Symposium, 2009

Towards Motor Skill Learning for Robotics.
Proceedings of the Robotics Research - The 14th International Symposium, 2009

Sparse online model learning for robot control with support vector regression.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Active learning using mean shift optimization for robot grasping.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Learning motor primitives for robotics.
Proceedings of the 2009 IEEE International Conference on Robotics and Automation, 2009

Learning complex motions by sequencing simpler motion templates.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

09341 Summary - Cognition, Control and Learning for Robot Manipulation in Human Environments.
Proceedings of the Cognition, 2009

09341 Abstracts Collection - Cognition, Control and Learning for Robot Manipulation in Human Environments.
Proceedings of the Cognition, 2009

Active Structured Learning for High-Speed Object Detection.
Proceedings of the Pattern Recognition, 2009

A Computational Model of Human Table Tennis for Robot Application.
Proceedings of the Autonome Mobile Systeme 2009, 2009

Learning New Basic Movements for Robotics.
Proceedings of the Autonome Mobile Systeme 2009, 2009

Using reward-weighted imitation for robot Reinforcement Learning.
Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009

Efficient data reuse in value function approximation.
Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009

2008
Reinforcement learning of motor skills with policy gradients.
Neural Networks, 2008

Machine Learning for motor skills in robotics.
Künstliche Intell., 2008

Learning to Control in Operational Space.
Int. J. Robotics Res., 2008

Operational Space Control: A Theoretical and Empirical Comparison.
Int. J. Robotics Res., 2008

Natural Actor-Critic.
Neurocomputing, 2008

Manifold-valued Thin-Plate Splines with Applications in Computer Graphics.
Comput. Graph. Forum, 2008

A unifying framework for robot control with redundant DOFs.
Auton. Robots, 2008

Fitness Expectation Maximization.
Proceedings of the Parallel Problem Solving from Nature, 2008

Local Gaussian Process Regression for Real Time Online Model Learning.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Fitted Q-iteration by Advantage Weighted Regression.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Using Bayesian Dynamical Systems for Motion Template Libraries.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Local Gaussian process regression for real-time model-based robot control.
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

Learning perceptual coupling for motor primitives.
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

Real-time learning of resolved velocity control on a Mitsubishi PA-10.
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Episodic Reinforcement Learning by Logistic Reward-Weighted Regression.
Proceedings of the Artificial Neural Networks, 2008

Policy Gradients with Parameter-Based Exploration for Control.
Proceedings of the Artificial Neural Networks, 2008

Policy Learning - A Unified Perspective with Applications in Robotics.
Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008

Learning Inverse Dynamics: a Comparison.
Proceedings of the 16th European Symposium on Artificial Neural Networks, 2008

Model-Based Reinforcement Learning with Continuous States and Actions.
Proceedings of the 16th European Symposium on Artificial Neural Networks, 2008

Natural Evolution Strategies.
Proceedings of the IEEE Congress on Evolutionary Computation, 2008

Computed torque control with nonparametric regression models.
Proceedings of the American Control Conference, 2008

Approximate dynamic programming with Gaussian processes.
Proceedings of the American Control Conference, 2008

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
Computational Intelligence: Principles, Techniques and Applications.
Comput. J., 2007

Towards compliant humanoids-an experimental assessment of suitable task space position/orientation controllers.
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Reinforcement Learning for Operational Space Control.
Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

Policy Learning for Motor Skills.
Proceedings of the Neural Information Processing, 14th International Conference, 2007

Reinforcement learning by reward-weighted regression for operational space control.
Proceedings of the Machine Learning, 2007

Solving Deep Memory POMDPs with Recurrent Policy Gradients.
Proceedings of the Artificial Neural Networks, 2007

Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning.
Proceedings of the 15th European Symposium on Artificial Neural Networks, 2007

Towards Machine Learning of Motor Skills.
Proceedings of the Autonome Mobile Systeme 2007, 2007

2006
A Bayesian Approach to Nonlinear Parameter Identification for Rigid Body Dynamics.
Proceedings of the Robotics: Science and Systems II, 2006

Learning Operational Space Control.
Proceedings of the Robotics: Science and Systems II, 2006

Policy Gradient Methods for Robotics.
Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

Reinforcement Learning for Parameterized Motor Primitives.
Proceedings of the International Joint Conference on Neural Networks, 2006

2005
A unifying methodology for the control of robotic systems.
Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

Comparative experiments on task space control with redundancy resolution.
Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

Natural Actor-Critic.
Proceedings of the Machine Learning: ECML 2005, 2005

2003
Learning Movement Primitives.
Proceedings of the Robotics Research, The Eleventh International Symposium, 2003

2002
Searching a Scalable Approach to Cerebellar Based Control.
Appl. Intell., 2002


  Loading...