Nicolas Heess

CoRR, 2024

Neural Population Learning beyond Symmetric Zero-sum Games.

[BibT_eX]

[DOI]

CoRR, 2024

2023

Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots.

[BibT_eX]

[DOI]

CoRR, 2023

Foundations for Transfer in Reinforcement Learning: A Taxonomy of Knowledge Modalities.

[BibT_eX]

[DOI]

CoRR, 2023

Replay across Experiments: A Natural Extension of Off-Policy RL.

[BibT_eX]

[DOI]

CoRR, 2023

TacticAI: an AI assistant for football tactics.

[BibT_eX]

[DOI]

CoRR, 2023

Policy composition in reinforcement learning via multi-objective policy optimization.

[BibT_eX]

[DOI]

CoRR, 2023

Towards A Unified Agent with Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, 2023

Barkour: Benchmarking Animal-level Agility with Quadruped Robots.

[BibT_eX]

[DOI]

CoRR, 2023

A Generalist Dynamics Model for Control.

[BibT_eX]

[DOI]

Leonard Hasenclever

CoRR, 2023

Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains.

[BibT_eX]

[DOI]

Jingwei Zhang

CoRR, 2023

Coherent Soft Imitation Learning.

[BibT_eX]

[DOI]

Joe Watson

Sandy H. Huang

Wojciech Marian Czarnecki

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

NeRF2Real: Sim2real Transfer of Vision-guided Bipedal Motion Skills using Neural Radiance Fields.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Stateful Active Facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Language to Rewards for Robotic Skill Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2023

Representation Learning in Deep RL via Discrete Information Bottleneck.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022

A Generalist Agent.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2022

From motor control to team play in simulated humanoid football.

[BibT_eX]

[DOI]

Sci. Robotics, 2022

Behavior Priors for Efficient Reinforcement Learning.

[BibT_eX]

[DOI]

Arun Ahuja

J. Mach. Learn. Res., 2022

SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration.

[BibT_eX]

[DOI]

CoRR, 2022

Coordinating Policies Among Multiple Agents via an Intelligent Communication Channel.

[BibT_eX]

[DOI]

CoRR, 2022

Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach.

[BibT_eX]

[DOI]

Matt Hoffman

CoRR, 2022

Offline Distillation for Robot Lifelong Learning with Imbalanced Experience.

[BibT_eX]

[DOI]

CoRR, 2022

Imitate and Repurpose: Learning Reusable Robot Movement Skills From Human and Animal Behaviors.

[BibT_eX]

[DOI]

CoRR, 2022

Retrieval-Augmented Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Data augmentation for efficient learning from parametric experts.

[BibT_eX]

[DOI]

Alexandre Galashov

Joshua Scott Merel

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Coordinated Terrain-Adaptive Locomotion by Imitating a Centroidal Dynamics Planner.

[BibT_eX]

[DOI]

Konstantinos Bousmalis

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Offline Meta-Reinforcement Learning for Industrial Insertion.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Simplex Neural Population Learning: Any-Mixture Bayes-Optimality in Symmetric Zero-sum Games.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Retrieval-Augmented Reinforcement Learning.

[BibT_eX]

[DOI]

Alessandro Davide Ialongo

Proceedings of the International Conference on Machine Learning, 2022

Learning transferable motor skills with hierarchical latent mixture policies.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

NeuPL: Neural Population Learning.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Evaluating Model-Based Planning and Planner Amortization for Continuous Control.

[BibT_eX]

[DOI]

Yuval Tassa

Proceedings of the Tenth International Conference on Learning Representations, 2022

COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Forgetting and Imbalance in Robot Lifelong Learning with Off-policy Data.

[BibT_eX]

[DOI]

Proceedings of the Conference on Lifelong Learning Agents, 2022

MO2: Model-Based Offline Options.

[BibT_eX]

[DOI]

Proceedings of the Conference on Lifelong Learning Agents, 2022

2021

Game Plan: What AI can do for Football, and What Football can do for AI.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2021

Learning Dynamics Models for Model Predictive Agents.

[BibT_eX]

[DOI]

CoRR, 2021

Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration.

[BibT_eX]

[DOI]

CoRR, 2021

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning.

[BibT_eX]

[DOI]

Shruti Mishra

Dhruva TB

Konstantinos Bousmalis

CoRR, 2021

Entropic Desired Dynamics for Intrinsic Control.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neural Production Systems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Data-efficient Hindsight Off-policy Option Learning.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Counterfactual Credit Assignment in Model-Free Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Collect & Infer - a fresh look at data-efficient Reinforcement Learning.

[BibT_eX]

[DOI]

Roland Hafner

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

A Constrained Multi-Objective Reinforcement Learning Framework.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Towards Real Robot Learning in the Wild: A Case Study in Bipedal Locomotion.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

2020

Catch & Carry: reusable neural controllers for vision-guided whole-body tasks.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2020

dm_control: Software and tasks for continuous control.

[BibT_eX]

[DOI]

Softw. Impacts, 2020

Counterfactual Credit Assignment in Model-Free Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification.

[BibT_eX]

[DOI]

CoRR, 2020

Learning Dexterous Manipulation from Suboptimal Experts.

[BibT_eX]

[DOI]

Rae Jeong

CoRR, 2020

Local Search for Policy Iteration in Continuous Control.

[BibT_eX]

[DOI]

CoRR, 2020

Temporal Difference Uncertainties as a Signal for Exploration.

[BibT_eX]

[DOI]

CoRR, 2020

Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban.

[BibT_eX]

[DOI]

CoRR, 2020

Learning to swim in potential flow.

[BibT_eX]

[DOI]

CoRR, 2020

Physically Embedded Planning Problems: New Challenges for Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Importance Weighted Policy Learning and Adaption.

[BibT_eX]

[DOI]

CoRR, 2020

Action and Perception as Divergence Minimization.

[BibT_eX]

[DOI]

CoRR, 2020

RL Unplugged: Benchmarks for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Sergio Gómez Colmenarejo

CoRR, 2020

Simple Sensor Intentions for Exploration.

[BibT_eX]

[DOI]

Tim Hertweck

Michael Bloesch

Giambattista Parascandolo

CoRR, 2020

Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning.

[BibT_eX]

[DOI]

CoRR, 2020

Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Noah Y. Siegel

CoRR, 2020

Compositional Transfer in Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

Markus Wulfmeier

Roland Hafner

Proceedings of the Robotics: Science and Systems XVI, 2020

Direct Policy Gradients: Direct Optimization of Policies in Discrete Action Spaces.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Sergio Gómez Colmenarejo

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Value-driven Hindsight Modelling.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Critic Regularized Regression.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Stabilizing Transformers for Reinforcement Learning.

[BibT_eX]

[DOI]

Siddhant M. Jayakumar

Max Jaderberg

Raphaël Lopez Kaufman

Proceedings of the 37th International Conference on Machine Learning, 2020

CoMic: Complementary Task Learning & Mimicry for Reusable Skills.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

A distributional view on multi-objective policy optimization.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control.

[BibT_eX]

[DOI]

H. Francis Song

Proceedings of the 8th International Conference on Learning Representations, 2020

Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Noah Y. Siegel

Proceedings of the 8th International Conference on Learning Representations, 2020

A Generalized Training Approach for Multiagent Learning.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Learning Dexterous Manipulation from Suboptimal Experts.

[BibT_eX]

[DOI]

Rae Jeong

Proceedings of the 4th Conference on Robot Learning, 2020

Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions.

[BibT_eX]

[DOI]

Lars Buesing

Theophane Weber

Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019

Reusable neural skill embeddings for vision-guided whole body movement and object manipulation.

[BibT_eX]

[DOI]

CoRR, 2019

Quinoa: a Q-function You Infer Normalized Over Actions.

[BibT_eX]

[DOI]

Jonas Degrave

CoRR, 2019

Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models.

[BibT_eX]

[DOI]

CoRR, 2019

Regularized Hierarchical Policies for Compositional Transfer in Robotics.

[BibT_eX]

[DOI]

Markus Wulfmeier

Roland Hafner

CoRR, 2019

Meta reinforcement learning as task inference.

[BibT_eX]

[DOI]

CoRR, 2019

Meta-learning of Sequential Strategies.

[BibT_eX]

[DOI]

CoRR, 2019

Exploiting Hierarchy for Learning and Transfer in KL-regularized RL.

[BibT_eX]

[DOI]

CoRR, 2019

Value constrained model-free continuous control.

[BibT_eX]

[DOI]

CoRR, 2019

Self-supervised Learning of Image Embedding for Continuous Control.

[BibT_eX]

[DOI]

Carlos Florensa

Jonas Degrave

Krishnamurthy (Dj) Dvijotham

CoRR, 2019

Hindsight Credit Assignment.

[BibT_eX]

[DOI]

Anna Harutyunyan

Will Dabney

Thomas Mesnard

Mohammad Gheshlaghi Azar

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Reinforcement Learning Agents acquire Flocking and Symbiotic Behaviour in Simulated Ecosystems.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Artificial Life, 2019

Composing Entropic Policies using Divergence Correction.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures.

[BibT_eX]

[DOI]

Pushmeet Kohli

Proceedings of the 7th International Conference on Learning Representations, 2019

Neural Probabilistic Motor Primitives for Humanoid Control.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Hierarchical Visuomotor Control of Humanoids.

[BibT_eX]

[DOI]

Arun Ahuja

Vu Pham

Proceedings of the 7th International Conference on Learning Representations, 2019

Emergent Coordination Through Competition.

[BibT_eX]

[DOI]

Siqi Liu

Guy Lever

Thore Graepel

Proceedings of the 7th International Conference on Learning Representations, 2019

Information asymmetry in KL-regularized RL.

[BibT_eX]

[DOI]

Alexandre Galashov

Siddhant M. Jayakumar

Wojciech M. Czarnecki

Razvan Pascanu

Proceedings of the 7th International Conference on Learning Representations, 2019

Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search.

[BibT_eX]

[DOI]

Jean-Baptiste Lespiau

Proceedings of the 7th International Conference on Learning Representations, 2019

Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics.

[BibT_eX]

[DOI]

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Imagined Value Gradients: Model-Based Policy Optimization with Tranferable Latent Dynamics Models.

[BibT_eX]

[DOI]

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Observational Learning by Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

The Body is Not a Given: Joint Agent Policy Learning and Morphology Evolution.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Credit Assignment Techniques in Stochastic Computation Graphs.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

The Termination Critic.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018

Relative Entropy Regularized Policy Iteration.

[BibT_eX]

[DOI]

CoRR, 2018

Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction.

[BibT_eX]

[DOI]

CoRR, 2018

Relational inductive biases, deep learning, and graph networks.

[BibT_eX]

[DOI]

CoRR, 2018

Reinforcement and Imitation Learning for Diverse Visuomotor Skills.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XIV, 2018

Graph Networks as Learnable Physics Engines for Inference and Control.

[BibT_eX]

[DOI]

Alvaro Sanchez-Gonzalez

Proceedings of the 35th International Conference on Machine Learning, 2018

Learning by Playing Solving Sparse Reward Tasks from Scratch.

[BibT_eX]

[DOI]

Wojciech Marian Czarnecki

Proceedings of the 35th International Conference on Machine Learning, 2018

Mix & Match Agent Curricula for Reinforcement Learning.

[BibT_eX]

[DOI]

Siddhant M. Jayakumar

Proceedings of the 35th International Conference on Machine Learning, 2018

Learning an Embedding Space for Transferable Robot Skills.

[BibT_eX]

[DOI]

Karol Hausman

Ziyu Wang

Proceedings of the 6th International Conference on Learning Representations, 2018

Distributed Distributional Deterministic Policy Gradients.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Maximum a Posteriori Policy Optimisation.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

Imagination-Augmented Agents for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Danilo Jimenez Rezende

CoRR, 2017

Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards.

[BibT_eX]

[DOI]

CoRR, 2017

Data-efficient Deep Reinforcement Learning for Dexterous Manipulation.

[BibT_eX]

[DOI]

CoRR, 2017

Learning model-based planning from scratch.

[BibT_eX]

[DOI]

CoRR, 2017

Learning human behaviors from motion capture by adversarial imitation.

[BibT_eX]

[DOI]

CoRR, 2017

Emergence of Locomotion Behaviours in Rich Environments.

[BibT_eX]

[DOI]

CoRR, 2017

Distral: Robust multitask reinforcement learning.

[BibT_eX]

[DOI]

Victor Bapst

Wojciech M. Czarnecki

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Imagination-Augmented Agents for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Danilo Jimenez Rezende

Alexander Sasha Vezhnevets

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Filtering Variational Objectives.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Learning Hierarchical Information Flow with Recurrent Neural Modules.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Robust Imitation of Diverse Behaviors.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

FeUdal Networks for Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Particle Value Functions.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Metacontrol for Adaptive Imagination-Based Optimization.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Sample Efficient Actor-Critic with Experience Replay.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Sim-to-Real Robot Learning from Pixels with Progressive Nets.

[BibT_eX]

[DOI]

Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

2016

Continuous control with deep reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Learning Representations, 2016

Learning and Transfer of Modulated Locomotor Controllers.

[BibT_eX]

[DOI]

CoRR, 2016

Attend, Infer, Repeat: Fast Scene Understanding with Generative Models.

[BibT_eX]

[DOI]

CoRR, 2016

Unsupervised Learning of 3D Structure from Images.

[BibT_eX]

[DOI]

Danilo Jimenez Rezende

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Attend, Infer, Repeat: Fast Scene Understanding with Generative Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015

Passing Expectation Propagation Messages with Kernel Methods.

[BibT_eX]

[DOI]

Wittawat Jitkrittum

Arthur Gretton

CoRR, 2015

Memory-based control with recurrent neural networks.

[BibT_eX]

[DOI]

CoRR, 2015

Kernel-Based Just-In-Time Learning for Passing Expectation Propagation Messages.

[BibT_eX]

[DOI]

Balaji Lakshminarayanan

Dino Sejdinovic

Zoltán Szabó

Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015

Gradient Estimation Using Stochastic Computation Graphs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Learning Continuous Control Policies by Stochastic Value Gradients.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2014

The Shape Boltzmann Machine: A Strong Model of Object Shape.

[BibT_eX]

[DOI]

S. M. Ali Eslami

Christopher K. I. Williams

John M. Winn

Int. J. Comput. Vis., 2014

Recurrent Models of Visual Attention.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Bayes-Adaptive Simulation-based Search with Value Function Approximation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Deterministic Policy Gradient Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 31th International Conference on Machine Learning, 2014

Visual Boundary Prediction: A Deep Neural Prediction Network and Quality Dissection.

[BibT_eX]

[DOI]

Jyri J. Kivinen

Christopher K. I. Williams

Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, 2014

2013

Learning to Pass Expectation Propagation Messages.

[BibT_eX]

[DOI]

Nicolas Manfred Otto Heess

Daniel Tarlow

John M. Winn

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

2012

Learning generative models of mid-level structure in natural images.

[BibT_eX]

[DOI]

PhD thesis, 2012

Searching for objects driven by context.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Actor-Critic Reinforcement Learning with Energy-Based Policies.

[BibT_eX]

[DOI]

David Silver

Proceedings of the Tenth European Workshop on Reinforcement Learning, 2012

The Shape Boltzmann Machine: A strong model of object shape.

[BibT_eX]

[DOI]

S. M. Ali Eslami