We stand with Ukraine

We stand with Ukraine

Sergey Levine

Orcid: 0000-0001-6764-2743

According to our database¹, Sergey Levine authored at least 552 papers between 2009 and 2024.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

SACSoN: Scalable Autonomous Control for Social Navigation.

[BibT_eX]

[DOI]

,

,

,

IEEE Robotics Autom. Lett., January, 2024

Multistage Cable Routing Through Hierarchical Imitation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

IEEE Trans. Robotics, 2024

Yell At Your Robot: Improving On-the-Fly from Language Corrections.

[BibT_eX]

[DOI]

Lucy Xiaoyang Shi

,

,

,

,

,

,

,

CoRR, 2024

Unfamiliar Finetuning Examples Control How Language Models Hallucinate.

[BibT_eX]

[DOI]

,

,

Claire J. Tomlin

,

,

CoRR, 2024

Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference.

[BibT_eX]

[DOI]

Benjamin Eysenbach

,

,

Ruslan Salakhutdinov

,

CoRR, 2024

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL.

[BibT_eX]

[DOI]

Jesse Farebrother

,

,

,

Adrien Ali Taïga

,

Yevgen Chebotar

,

,

,

,

Pablo Samuel Castro

,

Aleksandra Faust

,

,

Rishabh Agarwal

CoRR, 2024

MOKA: Open-Vocabulary Robotic Manipulation through Mark-Based Visual Prompting.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

SELFI: Autonomous Self-Improvement with Reinforcement Learning for Social Navigation.

[BibT_eX]

[DOI]

,

,

Kyle Stachowicz

,

,

CoRR, 2024

ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation.

[BibT_eX]

[DOI]

,

Catherine Glossop

,

,

,

,

,

,

CoRR, 2024

Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Feedback Efficient Online Fine-Tuning of Diffusion Models.

[BibT_eX]

[DOI]

Masatoshi Uehara

,

,

,

Ehsan Hajiramezanali

,

Gabriele Scalia

,

Nathaniel Lee Diamant

,

,

,

Tommaso Biancalani

CoRR, 2024

Foundation Policies with Hilbert Representations.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control.

[BibT_eX]

[DOI]

Masatoshi Uehara

,

,

,

Ehsan Hajiramezanali

,

Gabriele Scalia

,

Nathaniel Lee Diamant

,

,

Tommaso Biancalani

,

CoRR, 2024

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs.

[BibT_eX]

[DOI]

Soroush Nasiriany

,

,

,

,

,

Ishita Dasgupta

,

,

,

,

,

,

,

Tsang-Wei Edward Lee

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Vision-Language Models Provide Promptable Representations for Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Reinforcement Learning for Versatile, Dynamic, and Robust Bipedal Locomotion Control.

[BibT_eX]

[DOI]

,

,

,

,

,

Koushil Sreenath

CoRR, 2024

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents.

[BibT_eX]

[DOI]

CoRR, 2024

FMB: a Functional Manipulation Benchmark for Generalizable Robotic Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Functional Graphical Models: Structure Enables Offline Data-Driven Optimization.

[BibT_eX]

[DOI]

Jakub Grudzien Kuba

,

Masatoshi Uehara

,

,

CoRR, 2024

2023

Improving Generalization with Approximate Factored Value Functions.

[BibT_eX]

[DOI]

,

,

Trans. Mach. Learn. Res., 2023

Chain of Code: Reasoning with a Language Model-Augmented Code Emulator.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2023

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

RLIF: Interactive Imitation Learning as Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations.

[BibT_eX]

[DOI]

,

,

CoRR, 2023

Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

Offline RL with Observation Histories: Analyzing and Improving Sample Complexity.

[BibT_eX]

[DOI]

,

,

CoRR, 2023

Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion.

[BibT_eX]

[DOI]

,

,

CoRR, 2023

Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models.

[BibT_eX]

[DOI]

,

Mitsuhiko Nakamoto

,

,

,

,

,

CoRR, 2023

Latent Conservative Objective Models for Data-Driven Crystal Structure Prediction.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

METRA: Scalable Unsupervised RL with Metric-Aware Abstraction.

[BibT_eX]

[DOI]

,

,

CoRR, 2023

Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias.

[BibT_eX]

[DOI]

,

,

,

Rafael Rafailov

,

,

CoRR, 2023

NoMaD: Goal Masked Diffusion Policies for Navigation and Exploration.

[BibT_eX]

[DOI]

,

,

Catherine Glossop

,

CoRR, 2023

Deep Neural Networks Tend To Extrapolate Predictably.

[BibT_eX]

[DOI]

,

,

Claire J. Tomlin

,

CoRR, 2023

Robotic Offline RL from Internet Videos via Value-Function Pre-Training.

[BibT_eX]

[DOI]

Chethan Bhateja

,

,

,

,

,

,

Yevgen Chebotar

,

,

CoRR, 2023

Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions.

[BibT_eX]

[DOI]

CoRR, 2023

A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning.

[BibT_eX]

[DOI]

Benjamin Eysenbach

,

,

,

Ruslan Salakhutdinov

CoRR, 2023

Multi-Stage Cable Routing through Hierarchical Imitation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

Confidence-Based Model Selection: When to Take Shortcuts for Subpopulation Shifts.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Stabilizing Contrastive RL: Techniques for Offline Goal Reaching.

[BibT_eX]

[DOI]

,

Benjamin Eysenbach

,

,

,

,

Ruslan Salakhutdinov

,

CoRR, 2023

SACSoN: Scalable Autonomous Data Collection for Social Navigation.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

The False Promise of Imitating Proprietary LLMs.

[BibT_eX]

[DOI]

Arnav Gudibande

,

,

,

,

,

,

,

CoRR, 2023

Training Diffusion Models with Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators.

[BibT_eX]

[DOI]

CoRR, 2023

IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies.

[BibT_eX]

[DOI]

Philippe Hansen-Estruch

,

,

,

Jakub Grudzien Kuba

,

CoRR, 2023

Neural Constraint Satisfaction: Hierarchical Abstraction for Combinatorial Generalization in Object Rearrangement.

[BibT_eX]

[DOI]

,

Alyssa L. Dayan

,

Franziska Meier

,

Thomas L. Griffiths

,

,

CoRR, 2023

Ignorance is Bliss: Robust Control via Information Gating.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning.

[BibT_eX]

[DOI]

Mitsuhiko Nakamoto

,

,

,

,

,

,

,

CoRR, 2023

Grounded Decoding: Guiding Text Generation with Grounded Models for Robot Control.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Koushil Sreenath

CoRR, 2023

Project and Probe: Sample-Efficient Domain Adaptation by Interpolating Orthogonal Features.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Robotic Skill Acquisition via Instruction Augmentation with Vision-Language Models.

[BibT_eX]

[DOI]

,

,

Pierre Sermanet

,

,

,

,

,

Jonathan Tompson

Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Learning and Adapting Agile Locomotion Skills by Transferring Experience.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Robust and Versatile Bipedal Jumping Control through Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Koushil Sreenath

Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Pre-Training for Robots: Offline RL Enables Learning New Tasks in a Handful of Trials.

[BibT_eX]

[DOI]

,

,

Frederik D. Ebert

,

Mitsuhiko Nakamoto

,

,

,

Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Demonstrating A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

RT-1: Robotics Transformer for Real-World Control at Scale.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Ignorance is Bliss: Robust Control via Information Gating.

[BibT_eX]

[DOI]

,

,

Matthew E. Taylor

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ReDS: Offline RL With Heteroskedastic Datasets via Support Constraints.

[BibT_eX]

[DOI]

,

,

,

Yevgen Chebotar

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

HIQL: Offline Goal-Conditioned RL with Latent States as Actions.

[BibT_eX]

[DOI]

,

,

Benjamin Eysenbach

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning.

[BibT_eX]

[DOI]

Mitsuhiko Nakamoto

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Accelerating Exploration with Unlabeled Prior Data.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning to Influence Human Behavior with Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Multi-Task Imitation Learning for Linear Dynamical Systems.

[BibT_eX]

[DOI]

Thomas T. C. K. Zhang

,

,

,

Claire J. Tomlin

,

,

,

Proceedings of the Learning for Dynamics and Control Conference, 2023

Contrastive Example-Based Control.

[BibT_eX]

[DOI]

Kyle Beltran Hatch

,

Benjamin Eysenbach

,

Rafael Rafailov

,

,

Ruslan Salakhutdinov

,

,

Proceedings of the Learning for Dynamics and Control Conference, 2023

Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios.

[BibT_eX]

[DOI]

,

,

,

,

,

Rebecca Roelofs

,

,

,

Aleksandra Faust

,

Shimon Whiteson

,

Dragomir Anguelov

,

IROS, 2023

Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

Siddharth Reddy

,

,

,

IROS, 2023

Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

GNM: A General Navigation Model to Drive Any Robot.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for Industrial Insertion of Novel Connectors from Vision.

[BibT_eX]

[DOI]

,

,

Gokul Narayanan

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

ExAug: Robot-Conditioned Navigation Policies via Geometric Experience Augmentation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Demonstration-Bootstrapped Autonomous Practicing via Multi-Task Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Adversarial Policies Beat Superhuman Go AIs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Michael D. Dennis

,

,

Viktor Pogrebniak

,

,

Proceedings of the International Conference on Machine Learning, 2023

Jump-Start Reinforcement Learning.

[BibT_eX]

[DOI]

Ikechukwu Uchendu

,

,

,

,

,

Joséphine Simon

,

Matthew Bennice

,

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

Predictable MDP Abstraction for Unsupervised Model-Based RL.

[BibT_eX]

[DOI]

,

Proceedings of the International Conference on Machine Learning, 2023

Understanding the Complexity Gains of Single-Task RL with a Curriculum.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

Reinforcement Learning from Passive Data via Latent Intentions.

[BibT_eX]

[DOI]

,

Chethan Anand Bhateja

,

Proceedings of the International Conference on Machine Learning, 2023

A Connection between One-Step RL and Critic Regularization in Reinforcement Learning.

[BibT_eX]

[DOI]

Benjamin Eysenbach

,

,

,

Ruslan Salakhutdinov

Proceedings of the International Conference on Machine Learning, 2023

PaLM-E: An Embodied Multimodal Language Model.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Efficient Online Reinforcement Learning with Offline Data.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

Offline RL for Natural Language Generation with Implicit Language Q Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Bitrate-Constrained DRO: Beyond Worst Case Robustness To Unknown Group Shifts.

[BibT_eX]

[DOI]

,

Don Kurian Dennis

,

Benjamin Eysenbach

,

Aditi Raghunathan

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Efficient Deep Reinforcement Learning Requires Regulating Overfitting.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes.

[BibT_eX]

[DOI]

,

Rishabh Agarwal

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Confidence-Conditioned Value Functions for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective.

[BibT_eX]

[DOI]

,

Homanga Bharadhwaj

,

Benjamin Eysenbach

,

,

Russ Salakhutdinov

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Hierarchical Abstraction for Combinatorial Generalization in Object Rearrangement.

[BibT_eX]

[DOI]

,

Alyssa L. Dayan

,

Franziska Meier

,

Thomas L. Griffiths

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2023

BridgeData V2: A Dataset for Robot Learning at Scale.

[BibT_eX]

[DOI]

Homer Rich Walke

,

,

,

,

,

Philippe Hansen-Estruch

,

,

,

,

,

,

,

,

Proceedings of the Conference on Robot Learning, 2023

FastRLAP: A System for Learning High-Speed Driving via Deep RL and Autonomous Practicing.

[BibT_eX]

[DOI]

Kyle Stachowicz

,

,

,

,

Proceedings of the Conference on Robot Learning, 2023

ViNT: A Foundation Model for Visual Navigation.

[BibT_eX]

[DOI]

,

,

,

Kyle Stachowicz

,

,

,

Proceedings of the Conference on Robot Learning, 2023

Navigation with Large Language Models: Semantic Guesswork as a Heuristic for Planning.

[BibT_eX]

[DOI]

,

Michael Robert Equi

,

,

,

,

Proceedings of the Conference on Robot Learning, 2023

Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control.

[BibT_eX]

[DOI]

,

,

,

Homer Rich Walke

,

Philippe Hansen-Estruch

,

,

Mihai Jalobeanu

,

,

,

Proceedings of the Conference on Robot Learning, 2023

Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Conference on Robot Learning, 2023

REBOOT: Reuse Data for Bootstrapping Efficient Real-World Dexterous Manipulation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Conference on Robot Learning, 2023

Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2023

2022

ASE: large-scale reusable adversarial skill embeddings for physically simulated characters.

[BibT_eX]

[DOI]

,

,

,

,

ACM Trans. Graph., 2022

Learning Robotic Navigation from Experience: Principles, Methods, and Recent Results.

[BibT_eX]

[DOI]

,

CoRR, 2022

Dual Generator Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

Yevgen Chebotar

CoRR, 2022

Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints.

[BibT_eX]

[DOI]

,

,

,

Yevgen Chebotar

,

CoRR, 2022

Adversarial Policies Beat Professional-Level Go AIs.

[BibT_eX]

[DOI]

,

,

,

,

,

Michael D. Dennis

,

,

Viktor Pogrebniak

,

,

CoRR, 2022

FCM: Forgetful Causal Masking Makes Causal Language Models Better Zero-Shot Learners.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2022

Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of Trials.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2022

A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

CoRR, 2022

Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience.

[BibT_eX]

[DOI]

,

,

CoRR, 2022

Offline RL for Natural Language Generation with Implicit Language Q Learning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

Multimodal Masked Autoencoders Learn Transferable Representations.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2022

Context-Aware Language Modeling for Goal-Oriented Dialogue Systems.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Siddharth Verma

,

,

,

CoRR, 2022

When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?

[BibT_eX]

[DOI]

,

,

,

CoRR, 2022

Do As I Can, Not As I Say: Grounding Language in Robotic Affordances.

[BibT_eX]

[DOI]

CoRR, 2022

Fully Online Meta-Learning Without Task Boundaries.

[BibT_eX]

[DOI]

Jathushan Rajasegaran

,

,

CoRR, 2022

ViKiNG: Vision-Based Kilometer-Scale Navigation with Geographic Hints.

[BibT_eX]

[DOI]

,

Proceedings of the Robotics: Science and Systems XVIII, New York City, NY, USA, June 27, 2022

Bridge Data: Boosting Generalization of Robotic Skills with Cross-Domain Datasets.

[BibT_eX]

[DOI]

,

,

Karl Schmeckpeper

,

Bernadette Bucher

,

Georgios Georgakis

,

Kostas Daniilidis

,

,

Proceedings of the Robotics: Science and Systems XVIII, New York City, NY, USA, June 27, 2022

MEMO: Test Time Robustness via Adaptation and Augmentation.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DASCO: Dual-Generator Adversarial Support Constrained Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

Yevgen Chebotar

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Adversarial Unlearning: Reducing Confidence Along Adversarial Directions.

[BibT_eX]

[DOI]

,

Benjamin Eysenbach

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization.

[BibT_eX]

[DOI]

Siddharth Reddy

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Data-Driven Offline Decision-Making via Invariant Representation Learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Contrastive Learning as Goal-Conditioned Reinforcement Learning.

[BibT_eX]

[DOI]

Benjamin Eysenbach

,

,

,

Ruslan Salakhutdinov

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Imitating Past Successes can be Very Suboptimal.

[BibT_eX]

[DOI]

Benjamin Eysenbach

,

,

Russ Salakhutdinov

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Mismatched No More: Joint Model-Policy Optimization for Model-Based RL.

[BibT_eX]

[DOI]

Benjamin Eysenbach

,

Alexander Khazatsky

,

,

Ruslan Salakhutdinov

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

You Only Live Once: Single-Life Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Distributionally Adaptive Meta Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Object Representations as Fixed Points: Training Iterative Refinement Algorithms with Implicit Differentiation.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Siddharth Verma

,

,

,

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Context-Aware Language Modeling for Goal-Oriented Dialogue Systems.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Hierarchical Reinforcement Learning for Precise Soccer Shooting Skills using a Quadrupedal Robot.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Koushil Sreenath

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent Space.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Offline Meta-Reinforcement Learning for Industrial Insertion.

[BibT_eX]

[DOI]

,

,

,

Rugile Pevceviciute

,

,

,

,

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Legged Robots that Keep on Learning: Fine-Tuning Locomotion Policies in the Real World.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Control-Aware Prediction Objectives for Autonomous Driving.

[BibT_eX]

[DOI]

Rowan McAllister

,

,

,

,

,

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Hybrid Imitative Planning with Geometric and Predictive Costs in Off-road Environments.

[BibT_eX]

[DOI]

,

,

,

Henry A. Leopold

,

,

Ali-Akbar Agha-Mohammadi

,

Nicholas Rhinehart

,

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

ASHA: Assistive Teleoperation via Human-in-the-Loop Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Siddharth Reddy

,

,

,

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

How to Leverage Unlabeled Data in Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Yevgen Chebotar

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization.

[BibT_eX]

[DOI]

Brandon Trabucco

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

Offline Meta-Reinforcement Learning with Online Self-Supervision.

[BibT_eX]

[DOI]

Vitchyr H. Pong

,

,

,

Catherine Huang

,

Proceedings of the International Conference on Machine Learning, 2022

Lyapunov Density Models: Constraining Distribution Shift in Learning-Based Control.

[BibT_eX]

[DOI]

,

,

,

,

Claire J. Tomlin

,

Proceedings of the International Conference on Machine Learning, 2022

Planning with Diffusion for Flexible Behavior Synthesis.

[BibT_eX]

[DOI]

,

,

Joshua B. Tenenbaum

,

Proceedings of the International Conference on Machine Learning, 2022

Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning.

[BibT_eX]

[DOI]

Philippe Hansen-Estruch

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

Offline RL Policies Should Be Trained to be Adaptive.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks.

[BibT_eX]

[DOI]

,

Benjamin Eysenbach

,

Ruslan Salakhutdinov

,

,

Joseph E. Gonzalez

Proceedings of the Tenth International Conference on Learning Representations, 2022

TRAIL: Near-Optimal Imitation Learning with Suboptimal Data.

[BibT_eX]

[DOI]

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Autonomous Reinforcement Learning: Formalism and Benchmarking.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

Alexander Toshev

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Extending the WILDS Benchmark for Unsupervised Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Data-Driven Offline Optimization for Architecting Hardware Accelerators.

[BibT_eX]

[DOI]

,

Amir Yazdanbakhsh

,

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Should I Run Offline Reinforcement Learning or Behavioral Cloning?

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization.

[BibT_eX]

[DOI]

,

Rishabh Agarwal

,

,

Aaron C. Courville

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Offline Reinforcement Learning with Implicit Q-Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

The Information Geometry of Unsupervised Reinforcement Learning.

[BibT_eX]

[DOI]

Benjamin Eysenbach

,

Ruslan Salakhutdinov

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Maximum Entropy RL (Provably) Solves Some Robust RL Problems.

[BibT_eX]

[DOI]

Benjamin Eysenbach

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

RvS: What is Essential for Offline RL via Supervised Learning?

[BibT_eX]

[DOI]

,

Benjamin Eysenbach

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Information Prioritization through Empowerment in Visual Model-based RL.

[BibT_eX]

[DOI]

Homanga Bharadhwaj

,

Mohammad Babaeizadeh

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

CoMPS: Continual Meta Policy Search.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Don't Start From Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Conference on Robot Learning, 2022

LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Conference on Robot Learning, 2022

Offline Reinforcement Learning for Visual Navigation.

[BibT_eX]

[DOI]

,

,

,

,

Nicholas Rhinehart

,

Proceedings of the Conference on Robot Learning, 2022

Is Anyone There? Learning a Planner Contingent on Perceptual Uncertainty.

[BibT_eX]

[DOI]

,

Nicholas Rhinehart

,

Rowan Thomas McAllister

,

Matthew A. Wright

,

,

,

,

Joseph E. Gonzalez

Proceedings of the Conference on Robot Learning, 2022

Do As I Can, Not As I Say: Grounding Language in Robotic Affordances.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2022

Inner Monologue: Embodied Reasoning through Planning with Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Jonathan Tompson

,

,

Yevgen Chebotar

,

Pierre Sermanet

,

,

,

,

,

,

Proceedings of the Conference on Robot Learning, 2022

GenLoco: Generalized Locomotion Controllers for Quadrupedal Robots.

[BibT_eX]

[DOI]

,

,

,

,

Bhuvan Basireddy

,

,

,

,

,

Koushil Sreenath

,

Proceedings of the Conference on Robot Learning, 2022

Generalization with Lossy Affordances: Leveraging Broad Offline Data for Learning Visuomotor Tasks.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Conference on Robot Learning, 2022

2021

AMP: adversarial motion priors for stylized physics-based character control.

[BibT_eX]

[DOI]

,

,

,

,

Angjoo Kanazawa

ACM Trans. Graph., 2021

LaND: Learning to Navigate From Disengagements.

[BibT_eX]

[DOI]

,

,

IEEE Robotics Autom. Lett., 2021

BADGR: An Autonomous Self-Supervised Learning-Based Navigation System.

[BibT_eX]

[DOI]

,

,

IEEE Robotics Autom. Lett., 2021

Model-Based Meta-Reinforcement Learning for Flight With Suspended Payloads.

[BibT_eX]

[DOI]

Suneel Belkhale

,

,

,

Rowan McAllister

,

Roberto Calandra

,

IEEE Robotics Autom. Lett., 2021

How to train your robot with deep reinforcement learning: lessons we have learned.

[BibT_eX]

[DOI]

,

,

,

Mrinal Kalakrishnan

,

,

Int. J. Robotics Res., 2021

AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at Scale.

[BibT_eX]

[DOI]

,

,

Yevgen Chebotar

,

,

,

Alexander Herzog

,

,

,

,

Dmitry Kalashnikov

,

CoRR, 2021

Training on Test Data with Bayesian Adaptation for Covariate Shift.

[BibT_eX]

[DOI]

,

CoRR, 2021

ReLMM: Practical RL for Learning Mobile Manipulation Skills Using Only Onboard Sensors.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2021

Persistent Reinforcement Learning via Subgoal Curricula.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2021

Explore and Control with Adversarial Surprise.

[BibT_eX]

[DOI]

Arnaud Fickinger

,

,

Samyak Parajuli

,

,

Nicholas Rhinehart

,

,

,

CoRR, 2021

Multi-Robot Deep Reinforcement Learning for Mobile Navigation.

[BibT_eX]

[DOI]

,

,

CoRR, 2021

FitVid: Overfitting in Pixel-Level Video Prediction.

[BibT_eX]

[DOI]

Mohammad Babaeizadeh

,

Mohammad Taghi Saffar

,

,

,

,

CoRR, 2021

Reinforcement Learning as One Big Sequence Modeling Problem.

[BibT_eX]

[DOI]

,

,

CoRR, 2021

Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Shixiang Shane Gu

CoRR, 2021

MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale.

[BibT_eX]

[DOI]

Dmitry Kalashnikov

,

,

Yevgen Chebotar

,

Benjamin Swanson

,

Rico Jonschkowski

,

,

,

CoRR, 2021

Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills.

[BibT_eX]

[DOI]

Yevgen Chebotar

,

,

,

,

Dmitry Kalashnikov

,

,

,

Benjamin Eysenbach

,

,

,

CoRR, 2021

RECON: Rapid Exploration for Open-World Navigation with Latent Goal Models.

[BibT_eX]

[DOI]

,

Benjamin Eysenbach

,

Nicholas Rhinehart

,

CoRR, 2021

How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned.

[BibT_eX]

[DOI]

,

,

,

Mrinal Kalakrishnan

,

,

CoRR, 2021

Bayesian Adaptation for Covariate Shift.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Adaptive Risk Minimization: Learning to Adapt to Domain Shift.

[BibT_eX]

[DOI]

,

Henrik Marklund

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

COMBO: Conservative Offline Model-Based Policy Optimization.

[BibT_eX]

[DOI]

,

,

Rafael Rafailov

,

Aravind Rajeswaran

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Conservative Data Sharing for Multi-Task Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Yevgen Chebotar

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Autonomous Reinforcement Learning via Subgoal Curricula.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Outcome-Driven Reinforcement Learning via Variational Inference.

[BibT_eX]

[DOI]

Tim G. J. Rudner

,

,

Rowan McAllister

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Information is Power: Intrinsic Control via Information Capture.

[BibT_eX]

[DOI]

Nicholas Rhinehart

,

,

,

John D. Co-Reyes

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Pragmatic Image Compression for Human-in-the-Loop Decision-Making.

[BibT_eX]

[DOI]

Siddharth Reddy

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Which Mutual-Information Representation Learning Objectives are Sufficient for Control?

[BibT_eX]

[DOI]

,

,

Carlos Florensa

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Offline Reinforcement Learning as One Big Sequence Modeling Problem.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Robust Predictable Control.

[BibT_eX]

[DOI]

,

Ruslan Salakhutdinov

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification.

[BibT_eX]

[DOI]

,

,

Ruslan Salakhutdinov

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

ViNG: Learning Open-World Navigation with Visual Goals.

[BibT_eX]

[DOI]

,

Benjamin Eysenbach

,

,

Nicholas Rhinehart

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Contingencies from Observations: Tractable Contingency Planning with Learned Behavior Models.

[BibT_eX]

[DOI]

Nicholas Rhinehart

,

,

,

Matthew A. Wright

,

Rowan McAllister

,

Joseph E. Gonzalez

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies.

[BibT_eX]

[DOI]

Soroush Nasiriany

,

Vitchyr H. Pong

,

,

Alexander Khazatsky

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Koushil Sreenath

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

What Can I Do Here? Learning New Skills by Imagining Visual Affordances.

[BibT_eX]

[DOI]

Alexander Khazatsky

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

SimGAN: Hybrid Simulator Identification for Domain Adaptation via Adversarial Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Amortized Conditional Normalized Maximum Likelihood: Reliable Out of Distribution Uncertainty Estimation.

[BibT_eX]

[DOI]

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Conservative Objective Models for Effective Offline Model-Based Optimization.

[BibT_eX]

[DOI]

Brandon Trabucco

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Model-Based Reinforcement Learning via Latent-Space Collocation.

[BibT_eX]

[DOI]

,

,

Anusha Nagabandi

,

Kostas Daniilidis

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Simple and Effective VAE Training with Calibrated Decoders.

[BibT_eX]

[DOI]

,

Kostas Daniilidis

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Emergent Social Learning via Multi-agent Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Offline Meta-Reinforcement Learning with Advantage Weighting.

[BibT_eX]

[DOI]

,

Rafael Rafailov

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

Vitchyr H. Pong

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

WILDS: A Benchmark of in-the-Wild Distribution Shifts.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

Tatsuya Matsushima

,

,

,

,

,

Shixiang Shane Gu

Proceedings of the 38th International Conference on Machine Learning, 2021

PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Gregory Farquhar

Proceedings of the 38th International Conference on Machine Learning, 2021

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Shixiang Shane Gu

Proceedings of the 38th International Conference on Machine Learning, 2021

Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills.

[BibT_eX]

[DOI]

Yevgen Chebotar

,

,

,

,

Dmitry Kalashnikov

,

,

,

Benjamin Eysenbach

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment.

[BibT_eX]

[DOI]

,

Sidhant Kaushik

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Model-Based Visual Planning with Self-Supervised Functional Distances.

[BibT_eX]

[DOI]

,

,

,

,

Benjamin Eysenbach

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Nicholas Rhinehart

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

Rishabh Agarwal

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Recurrent Independent Mechanisms.

[BibT_eX]

[DOI]

,

,

Jordan Hoffmann

,

,

,

,

Bernhard Schölkopf

Proceedings of the 9th International Conference on Learning Representations, 2021

Factorizing Declarative and Procedural Knowledge in Structured, Dynamical Environments.

[BibT_eX]

[DOI]

,

,

Phanideep Gampa

,

Philippe Beaudoin

,

Charles Blundell

,

,

,

Michael Curtis Mozer

Proceedings of the 9th International Conference on Learning Representations, 2021

Learning to Reach Goals via Iterated Supervised Learning.

[BibT_eX]

[DOI]

,

,

,

,

Coline Manon Devin

,

Benjamin Eysenbach

,

Proceedings of the 9th International Conference on Learning Representations, 2021

X2T: Training an X-to-Text Typing Interface with Online Learning from User Feedback.

[BibT_eX]

[DOI]

,

Siddharth Reddy

,

,

,

Nikhilesh Natraj

,

Karunesh Ganguly

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Offline Model-Based Optimization via Normalized Maximum Likelihood Estimation.

[BibT_eX]

[DOI]

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Benchmarks for Deep Off-Policy Evaluation.

[BibT_eX]

[DOI]

,

Mohammad Norouzi

,

,

,

,

Alexander Novikov

,

,

Michael R. Zhang

,

,

,

Cosmin Paduraru

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

C-Learning: Learning to Achieve Goals via Recursive Classification.

[BibT_eX]

[DOI]

Benjamin Eysenbach

,

Ruslan Salakhutdinov

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers.

[BibT_eX]

[DOI]

Benjamin Eysenbach

,

Shreyas Chaudhari

,

,

,

Ruslan Salakhutdinov

Proceedings of the 9th International Conference on Learning Representations, 2021

Evolving Reinforcement Learning Algorithms.

[BibT_eX]

[DOI]

John D. Co-Reyes

,

,

,

,

,

,

,

Aleksandra Faust

Proceedings of the 9th International Conference on Learning Representations, 2021

Conservative Safety Critics for Exploration.

[BibT_eX]

[DOI]

Homanga Bharadhwaj

,

,

Nicholas Rhinehart

,

,

Florian Shkurti

,

Proceedings of the 9th International Conference on Learning Representations, 2021

SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments.

[BibT_eX]

[DOI]

,

,

Coline Manon Devin

,

Nicholas Rhinehart

,

,

Dinesh Jayaraman

,

Proceedings of the 9th International Conference on Learning Representations, 2021

OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Learning Invariant Representations for Reinforcement Learning without Reconstruction.

[BibT_eX]

[DOI]

,

Rowan Thomas McAllister

,

Roberto Calandra

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Fully Autonomous Real-World Reinforcement Learning with Applications to Mobile Manipulation.

[BibT_eX]

[DOI]

,

,

Coline Manon Devin

,

,

,

,

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Rapid Exploration for Open-World Navigation with Latent Goal Models.

[BibT_eX]

[DOI]

,

Benjamin Eysenbach

,

Nicholas Rhinehart

,

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

AW-Opt: Learning Robotic Skills with Imitation andReinforcement at Scale.

[BibT_eX]

[DOI]

,

,

Yevgen Chebotar

,

,

,

Alexander Herzog

,

,

,

,

Dmitry Kalashnikov

,

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Understanding the World Through Action.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

A Workflow for Offline Model-Free Robotic Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Hierarchically Integrated Models: Learning to Navigate from Heterogeneous Robots.

[BibT_eX]

[DOI]

,

,

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Scaling Up Multi-Task Robotic Reinforcement Learning.

[BibT_eX]

[DOI]

Dmitry Kalashnikov

,

,

Yevgen Chebotar

,

Benjamin Swanson

,

Rico Jonschkowski

,

,

,

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

BC-Z: Zero-Shot Task Generalization with Robotic Imitation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

2020

Morphology-Agnostic Visual Robotic Control.

[BibT_eX]

[DOI]

,

Dinesh Jayaraman

,

,

Alexei A. Efros

,

IEEE Robotics Autom. Lett., 2020

Safety Augmented Value Estimation From Demonstrations (SAVED): Safe Deep Model-Based RL for Sparse Cost Robotic Tasks.

[BibT_eX]

[DOI]

Brijen Thananjeyan

,

Ashwin Balakrishna

,

,

,

Rowan McAllister

,

Joseph E. Gonzalez

,

,

Francesco Borrelli

,

IEEE Robotics Autom. Lett., 2020

Cognitive Mapping and Planning for Visual Navigation.

[BibT_eX]

[DOI]

,

,

,

,

Rahul Sukthankar

,

Int. J. Comput. Vis., 2020

Variable-Shot Adaptation for Online Meta-Learning.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

WILDS: A Benchmark of in-the-Wild Distribution Shifts.

[BibT_eX]

[DOI]

,

,

Henrik Marklund

,

Sang Michael Xie

,

,

Akshay Balsubramani

,

,

Michihiro Yasunaga

,

Richard Lanas Phillips

,

,

,

,

,

,

,

CoRR, 2020

Models, Pixels, and Rewards: Evaluating Design Trade-offs in Visual Model-Based Reinforcement Learning.

[BibT_eX]

[DOI]

Mohammad Babaeizadeh

,

Mohammad Taghi Saffar

,

,

,

,

,

CoRR, 2020

Amortized Conditional Normalized Maximum Likelihood.

[BibT_eX]

[DOI]

,

CoRR, 2020

Rearrangement: A Challenge for Embodied AI.

[BibT_eX]

[DOI]

,

,

,

Andrew J. Davison

,

,

,

,

,

,

Roozbeh Mottaghi

,

,

CoRR, 2020

COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2020

γ-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction.

[BibT_eX]

[DOI]

,

,

CoRR, 2020

MELD: Meta-Reinforcement Learning from Images via Latent State Models.

[BibT_eX]

[DOI]

,

Anusha Nagabandi

,

,

,

CoRR, 2020

Multi-agent Social Reinforcement Learning Improves Generalization.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

Adaptive Risk Minimization: A Meta-Learning Approach for Tackling Group Shift.

[BibT_eX]

[DOI]

,

Henrik Marklund

,

,

,

CoRR, 2020

Object Files and Schemata: Factorizing Declarative and Procedural Knowledge in Dynamical Systems.

[BibT_eX]

[DOI]

,

,

Phanideep Gampa

,

Philippe Beaudoin

,

,

Charles Blundell

,

,

CoRR, 2020

Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors.

[BibT_eX]

[DOI]

,

,

,

,

Dinesh Jayaraman

,

CoRR, 2020

Ecological Reinforcement Learning.

[BibT_eX]

[DOI]

John D. Co-Reyes

,

Suvansh Sanjeev

,

,

,

CoRR, 2020

Accelerating Online Reinforcement Learning with Offline Datasets.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling.

[BibT_eX]

[DOI]

Russell Mendonca

,

,

,

CoRR, 2020

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

Efficient Adaptation for End-to-End Vision-Based Robotic Manipulation.

[BibT_eX]

[DOI]

,

Benjamin Swanson

,

Gaurav S. Sukhatme

,

,

,

CoRR, 2020

D4RL: Datasets for Deep Data-Driven Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2020

Unsupervised Sequence Forecasting of 100,000 Points for Unsupervised Trajectory Forecasting.

[BibT_eX]

[DOI]

,

,

,

,

Nicholas Rhinehart

CoRR, 2020

AVID: Learning Multi-Stage Tasks via Pixel-Level Translation of Human Videos.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Robotics: Science and Systems XVI, 2020

Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Robotics: Science and Systems XVI, 2020

Learning Agile Robotic Locomotion Skills by Imitating Animals.

[BibT_eX]

[DOI]

,

,

,

Tsang-Wei Edward Lee

,

,

Proceedings of the Robotics: Science and Systems XVI, 2020

MOPO: Model-based Offline Policy Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Gradient Surgery for Multi-Task Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Continual Learning of Control Primitives : Skill Discovery via Reset-Games.

[BibT_eX]

[DOI]

,

Siddharth Verma

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors.

[BibT_eX]

[DOI]

,

,

,

,

Dinesh Jayaraman

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model.

[BibT_eX]

[DOI]

,

Anusha Nagabandi

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Conservative Q-Learning for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Model Inversion Networks for Model-Based Optimization.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement.

[BibT_eX]

[DOI]

,

,

,

Ruslan Salakhutdinov

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design.

[BibT_eX]

[DOI]

,

,

Eugene Vinitsky

,

Alexandre M. Bayen

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Meta-Reinforcement Learning for Robotic Industrial Insertion Tasks.

[BibT_eX]

[DOI]

Gerrit Schoettler

,

,

Juan Aparicio Ojea

,

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Rewards.

[BibT_eX]

[DOI]

Gerrit Schoettler

,

,

,

,

Juan Aparicio Ojea

,

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Scaled Autonomy: Enabling Human Operators to Control Robot Fleets.

[BibT_eX]

[DOI]

,

Siddharth Reddy

,

,

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Scalable Multi-Task Imitation Learning with Autonomous Improvement.

[BibT_eX]

[DOI]

,

,

Alexander Irpan

,

,

,

,

,

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

OmniTact: A Multi-Directional High-Resolution Touch Sensor.

[BibT_eX]

[DOI]

Akhil Padmanabha

,

,

,

Roberto Calandra

,

,

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

TRASS: Time Reversal as Self-Supervision.

[BibT_eX]

[DOI]

,

Mohammad Babaeizadeh

,

,

,

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings.

[BibT_eX]

[DOI]

,

,

,

,

Dinesh Jayaraman

Proceedings of the 37th International Conference on Machine Learning, 2020

Learning Human Objectives by Evaluating Hypothetical Behavior.

[BibT_eX]

[DOI]

Siddharth Reddy

,

,

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

Skew-Fit: State-Covering Self-Supervised Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

Can Autonomous Vehicles Identify, Recover From, and Adapt to Distribution Shifts?

[BibT_eX]

[DOI]

,

Panagiotis Tigas

,

Rowan McAllister

,

Nicholas Rhinehart

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions.

[BibT_eX]

[DOI]

,

Sidhant Kaushik

,

S. Matthew Weinberg

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

The Ingredients of Real World Robotic Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Kristian Hartikainen

,

,

,

Proceedings of the 8th International Conference on Learning Representations, 2020

Watch, Try, Learn: Meta-Learning from Demonstrations and Rewards.

[BibT_eX]

[DOI]

,

,

,

Alexander Herzog

,

,

,

,

Mrinal Kalakrishnan

,

,

Proceedings of the 8th International Conference on Learning Representations, 2020

Meta-Learning without Memorization.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 8th International Conference on Learning Representations, 2020

Thinking While Moving: Deep Reinforcement Learning with Concurrent Control.

[BibT_eX]

[DOI]

,

,

Dmitry Kalashnikov

,

,

,

,

Alexander Herzog

Proceedings of the 8th International Conference on Learning Representations, 2020

Dynamics-Aware Unsupervised Discovery of Skills.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 8th International Conference on Learning Representations, 2020

Deep Imitative Models for Flexible Inference, Planning, and Control.

[BibT_eX]

[DOI]

Nicholas Rhinehart

,

Rowan McAllister

,

Proceedings of the 8th International Conference on Learning Representations, 2020

SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards.

[BibT_eX]

[DOI]

Siddharth Reddy

,

,

Proceedings of the 8th International Conference on Learning Representations, 2020

VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation.

[BibT_eX]

[DOI]

,

Mohammad Babaeizadeh

,

,

,

,

,

Proceedings of the 8th International Conference on Learning Representations, 2020

Model Based Reinforcement Learning for Atari.

[BibT_eX]

[DOI]

,

Mohammad Babaeizadeh

,

,

,

Roy H. Campbell

,

Konrad Czechowski

,

,

,

Piotr Kozakowski

,

,

Afroz Mohiuddin

,

,

,

Henryk Michalewski

Proceedings of the 8th International Conference on Learning Representations, 2020

Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery.

[BibT_eX]

[DOI]

Kristian Hartikainen

,

,

Tuomas Haarnoja

,

Proceedings of the 8th International Conference on Learning Representations, 2020

Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 8th International Conference on Learning Representations, 2020

The Variational Bandwidth Bottleneck: Stochastic Evaluation on an Information Budget.

[BibT_eX]

[DOI]

,

,

Matthew M. Botvinick

,

Proceedings of the 8th International Conference on Learning Representations, 2020

Adversarial Policies: Attacking Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 8th International Conference on Learning Representations, 2020

Learning Predictive Models from Observation and Interaction.

[BibT_eX]

[DOI]

Karl Schmeckpeper

,

,

,

,

Kostas Daniilidis

,

,

Proceedings of the Computer Vision - ECCV 2020, 2020

RL-CycleGAN: Reinforcement Learning Aware Simulation-to-Real.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

MELD: Meta-Reinforcement Learning from Images via Latent State Models.

[BibT_eX]

[DOI]

,

Anusha Nagabandi

,

,

,

Proceedings of the 4th Conference on Robot Learning, 2020

Inverting the Pose Forecasting Pipeline with SPF2: Sequential Pointcloud Forecasting for Sequential Pose Forecasting.

[BibT_eX]

[DOI]

,

,

,

,

Nicholas Rhinehart

Proceedings of the 4th Conference on Robot Learning, 2020

Chaining Behaviors from Data with Model-Free Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 4th Conference on Robot Learning, 2020

Reinforcement Learning with Videos: Combining Offline Observations with Interaction.

[BibT_eX]

[DOI]

Karl Schmeckpeper

,

,

Kostas Daniilidis

,

,

Proceedings of the 4th Conference on Robot Learning, 2020

Assisted Perception: Optimizing Observations to Communicate State.

[BibT_eX]

[DOI]

Siddharth Reddy

,

,

Proceedings of the 4th Conference on Robot Learning, 2020

Never Stop Learning: The Effectiveness of Fine-Tuning in Robotic Reinforcement Learning.

[BibT_eX]

[DOI]

,

Benjamin Swanson

,

Gaurav S. Sukhatme

,

,

,

Proceedings of the 4th Conference on Robot Learning, 2020

Learning to Walk in the Real World with Minimal Human Effort.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 4th Conference on Robot Learning, 2020

Unsupervised Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019

Low-Level Control of a Quadrotor With Deep Model-Based Reinforcement Learning.

[BibT_eX]

[DOI]

Nathan O. Lambert

,

,

Joseph Yaconelli

,

,

Roberto Calandra

,

Kristofer S. J. Pister

IEEE Robotics Autom. Lett., 2019

Reward-Conditioned Policies.

[BibT_eX]

[DOI]

,

,

CoRR, 2019

Learning To Reach Goals Without Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Benjamin Eysenbach

,

CoRR, 2019

SMiRL: Surprise Minimizing RL in Dynamic Environments.

[BibT_eX]

[DOI]

,

,

,

,

Dinesh Jayaraman

,

CoRR, 2019

Plan Arithmetic: Compositional Plan Vectors for Multi-Task Control.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2019

If MaxEnt RL is the Answer, What is the Question?

[BibT_eX]

[DOI]

Benjamin Eysenbach

,

CoRR, 2019

Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2019

Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2019

Dynamical Distance Learning for Unsupervised and Semi-Supervised Skill Discovery.

[BibT_eX]

[DOI]

Kristian Hartikainen

,

,

Tuomas Haarnoja

,

CoRR, 2019

Efficient Exploration via State Marginal Matching.

[BibT_eX]

[DOI]

,

Benjamin Eysenbach

,

Emilio Parisotto

,

,

,

Ruslan Salakhutdinov

CoRR, 2019

Learning Powerful Policies by Using Consistent Dynamics Model.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2019

Watch, Try, Learn: Meta-Learning from Demonstrations and Reward.

[BibT_eX]

[DOI]

,

,

,

Alexander Herzog

,

,

,

,

Mrinal Kalakrishnan

,

,

CoRR, 2019

Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2019

Extending Deep Model Predictive Control with Safety Augmented Value Estimation from Demonstrations.

[BibT_eX]

[DOI]

Brijen Thananjeyan

,

Ashwin Balakrishna

,

,

,

Rowan McAllister

,

Joseph E. Gonzalez

,

,

Francesco Borrelli

,

CoRR, 2019

SQIL: Imitation Learning via Regularized Behavioral Cloning.

[BibT_eX]

[DOI]

Siddharth Reddy

,

,

CoRR, 2019

REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning.

[BibT_eX]

[DOI]

,

,

,

,

Dinesh Jayaraman

CoRR, 2019

End-to-End Robotic Reinforcement Learning without Reward Engineering.

[BibT_eX]

[DOI]

,

,

Kristian Hartikainen

,

,

CoRR, 2019

VideoFlow: A Flow-Based Generative Model for Video.

[BibT_eX]

[DOI]

,

Mohammad Babaeizadeh

,

,

,

,

,

CoRR, 2019

Model-Based Reinforcement Learning for Atari.

[BibT_eX]

[DOI]

,

Mohammad Babaeizadeh

,

,

,

Roy H. Campbell

,

Konrad Czechowski

,

,

,

Piotr Kozakowski

,

,

,

,

Henryk Michalewski

CoRR, 2019

Artificial Intelligence for Prosthetics - challenge solutions.

[BibT_eX]

[DOI]

CoRR, 2019

Improvisation through Physical Understanding: Using Novel Objects As Tools with Visual Foresight.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Robotics: Science and Systems XV, 2019

End-To-End Robotic Reinforcement Learning without Reward Engineering.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Robotics: Science and Systems XV, 2019

Learning to Walk Via Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Tuomas Haarnoja

,

,

,

,

,

Proceedings of the Robotics: Science and Systems XV, 2019

Meta-Learning with Implicit Gradients.

[BibT_eX]

[DOI]

Aravind Rajeswaran

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Wasserstein Dependency Measure for Representation Learning.

[BibT_eX]

[DOI]

,

,

,

Aäron van den Oord

,

,

Pierre Sermanet

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Planning with Goal-Conditioned Policies.

[BibT_eX]

[DOI]

Soroush Nasiriany

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Guided Meta-Policy Search.

[BibT_eX]

[DOI]

Russell Mendonca

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

When to Trust Your Model: Model-Based Policy Optimization.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Unsupervised Curricula for Visual Meta-Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Off-Policy Evaluation via Off-Policy Classification.

[BibT_eX]

[DOI]

Alexander Irpan

,

,

Konstantinos Bousmalis

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Causal Confusion in Imitation Learning.

[BibT_eX]

[DOI]

,

Dinesh Jayaraman

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Search on the Replay Buffer: Bridging Planning and Reinforcement Learning.

[BibT_eX]

[DOI]

,

Ruslan Salakhutdinov

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Compositional Plan Vectors.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

One-Shot Composition of Vision-Based Skills from Demonstration.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Dexterous Manipulation with Deep Reinforcement Learning: Efficient, General, and Low-Cost.

[BibT_eX]

[DOI]

,

,

Aravind Rajeswaran

,

,

Proceedings of the International Conference on Robotics and Automation, 2019

REPLAB: A Reproducible Low-Cost Arm Benchmark for Robotic Learning.

[BibT_eX]

[DOI]

,

Dinesh Jayaraman

,

,

Proceedings of the International Conference on Robotics and Automation, 2019

Manipulation by Feel: Touch-Based Control with Deep Predictive Models.

[BibT_eX]

[DOI]

,

,

Dinesh Jayaraman

,

Mayur Mudigonda

,

,

Roberto Calandra

,

Proceedings of the International Conference on Robotics and Automation, 2019

Robustness to Out-of-Distribution Inputs via Task-Aware Generative Uncertainty.

[BibT_eX]

[DOI]

Rowan McAllister

,

,

,

Proceedings of the International Conference on Robotics and Automation, 2019

Learning to Identify Object Instances by Touch: Tactile Recognition via Multimodal Matching.

[BibT_eX]

[DOI]

,

Roberto Calandra

,

Proceedings of the International Conference on Robotics and Automation, 2019

Data-efficient Learning of Morphology and Controller for a Microrobot.

[BibT_eX]

[DOI]

,

,

,

,

Kristofer S. J. Pister

,

,

Roberto Calandra

Proceedings of the International Conference on Robotics and Automation, 2019

Generalization through Simulation: Integrating Simulated and Real Data into Deep Reinforcement Learning for Vision-Based Autonomous Flight.

[BibT_eX]

[DOI]

,

Suneel Belkhale

,

,

,

Proceedings of the International Conference on Robotics and Automation, 2019

Residual Reinforcement Learning for Robot Control.

[BibT_eX]

[DOI]

Tobias Johannink

,

,

,

,

,

Matthias Loskyll

,

Juan Aparicio Ojea

,

,

Proceedings of the International Conference on Robotics and Automation, 2019

SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Matthew J. Johnson

,

Proceedings of the 36th International Conference on Machine Learning, 2019

Learning a Prior over Intent via Meta-Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 36th International Conference on Machine Learning, 2019

Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables.

[BibT_eX]

[DOI]

,

,

,

,

Deirdre Quillen

Proceedings of the 36th International Conference on Machine Learning, 2019

EMI: Exploration with Mutual Information.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 36th International Conference on Machine Learning, 2019

Diagnosing Bottlenecks in Deep Q-learning Algorithms.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 36th International Conference on Machine Learning, 2019

Online Meta-Learning.

[BibT_eX]

[DOI]

,

Aravind Rajeswaran

,

,

Proceedings of the 36th International Conference on Machine Learning, 2019

Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow.

[BibT_eX]

[DOI]

,

Angjoo Kanazawa

,

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

Deep Online Learning Via Meta-Learning: Continual Adaptation for Model-Based RL.

[BibT_eX]

[DOI]

Anusha Nagabandi

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

Learning to Adapt in Dynamic, Real-World Environments through Meta-Reinforcement Learning.

[BibT_eX]

[DOI]

Anusha Nagabandi

,

,

,

Ronald S. Fearing

,

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

Near-Optimal Representation Learning for Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning.

[BibT_eX]

[DOI]

,

Kumar Krishna Agrawal

,

Debidatta Dwibedi

,

,

Jonathan Tompson

Proceedings of the 7th International Conference on Learning Representations, 2019

Time-Agnostic Prediction: Predicting Predictable Video Frames.

[BibT_eX]

[DOI]

Dinesh Jayaraman

,

,

Alexei A. Efros

,

Proceedings of the 7th International Conference on Learning Representations, 2019

Reasoning About Physical Interactions with Object-Oriented Prediction and Planning.

[BibT_eX]

[DOI]

,

,

William T. Freeman

,

Joshua B. Tenenbaum

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

Unsupervised Learning via Meta-Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

InfoBot: Transfer and Exploration via the Information Bottleneck.

[BibT_eX]

[DOI]

,

,

,

,

Hugo Larochelle

,

Matthew M. Botvinick

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

Recall Traces: Backtracking Models for Efficient Reinforcement Learning.

[BibT_eX]

[DOI]

,

Philemon Brakel

,

,

,

Timothy P. Lillicrap

,

,

Hugo Larochelle

,

Proceedings of the 7th International Conference on Learning Representations, 2019

Learning Actionable Representations with Goal Conditioned Policies.

[BibT_eX]

[DOI]

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

From Language to Goals: Inverse Reinforcement Learning for Vision-Based Instruction Following.

[BibT_eX]

[DOI]

,

Anoop Korattikara

,

,

Sergio Guadarrama

Proceedings of the 7th International Conference on Learning Representations, 2019

Diversity is All You Need: Learning Skills without a Reward Function.

[BibT_eX]

[DOI]

Benjamin Eysenbach

,

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

Guiding Policies with Language via Meta-Learning.

[BibT_eX]

[DOI]

John D. Co-Reyes

,

,

Suvansh Sanjeev

,

,

,

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

Automatically Composing Representation Transformations as a Means for Generalization.

[BibT_eX]

[DOI]

,

,

,

Thomas L. Griffiths

Proceedings of the 7th International Conference on Learning Representations, 2019

PRECOG: PREdiction Conditioned on Goals in Visual Multi-Agent Settings.

[BibT_eX]

[DOI]

Nicholas Rhinehart

,

Rowan McAllister

,

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Sim-To-Real via Sim-To-Sim: Data-Efficient Robotic Grasping via Randomized-To-Canonical Adaptation Networks.

[BibT_eX]

[DOI]

,

,

Mrinal Kalakrishnan

,

Dmitry Kalashnikov

,

,

,

,

,

Konstantinos Bousmalis

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning.

[BibT_eX]

[DOI]

,

Deirdre Quillen

,

,

,

,

,

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Entity Abstraction in Visual Model-Based Reinforcement Learning.

[BibT_eX]

[DOI]

Rishi Veerapaneni

,

John D. Co-Reyes

,

,

,

,

,

Joshua B. Tenenbaum

,

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Contextual Imagined Goals for Self-Supervised Robotic Learning.

[BibT_eX]

[DOI]

,

,

Alexander Khazatsky

,

,

,

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Deep Dynamics Models for Learning Dexterous Manipulation.

[BibT_eX]

[DOI]

Anusha Nagabandi

,

,

,

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Learning Latent Plans from Play.

[BibT_eX]

[DOI]

,

,

,

,

Jonathan Tompson

,

,

Pierre Sermanet

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

RoboNet: Large-Scale Multi-Robot Learning.

[BibT_eX]

[DOI]

,

,

,

,

Bernadette Bucher

,

Karl Schmeckpeper

,

Siddharth Singh

,

,

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

ROBEL: Robotics Benchmarks for Learning with Low-Cost Robots.

[BibT_eX]

[DOI]

,

,

Kristian Hartikainen

,

,

,

,

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018

SFV: reinforcement learning of physical skills from videos.

[BibT_eX]

[DOI]

,

Angjoo Kanazawa

,

,

,

ACM Trans. Graph., 2018

DeepMimic: example-guided deep reinforcement learning of physics-based character skills.

[BibT_eX]

[DOI]

,

,

,

Michiel van de Panne

ACM Trans. Graph., 2018

Learning Flexible and Reusable Locomotion Primitives for a Microrobot.

[BibT_eX]

[DOI]

,

,

Roberto Calandra

,

Daniel Contreras

,

,

Kristofer S. J. Pister

IEEE Robotics Autom. Lett., 2018

More Than a Feeling: Learning to Grasp and Regrasp Using Vision and Touch.

[BibT_eX]

[DOI]

Roberto Calandra

,

,

Dinesh Jayaraman

,

,

,

,

Edward H. Adelson

,

IEEE Robotics Autom. Lett., 2018

Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection.

[BibT_eX]

[DOI]

,

,

Alex Krizhevsky

,

,

Deirdre Quillen

Int. J. Robotics Res., 2018

Soft Actor-Critic Algorithms and Applications.

[BibT_eX]

[DOI]

Tuomas Haarnoja

,

,

Kristian Hartikainen

,

,

,

,

,

,

,

,

CoRR, 2018

Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2018

Hierarchical Policy Design for Sample-Efficient Learning of Robot Table Tennis Through Self-Play.

[BibT_eX]

[DOI]

Reza Mahjourian

,

,

,

,

Risto Miikkulainen

CoRR, 2018

Guiding Policies with Language via Meta-Learning.

[BibT_eX]

[DOI]

John D. Co-Reyes

,

,

Suvansh Sanjeev

,

,

,

,

CoRR, 2018

One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2018

EMI: Exploration with Mutual Information Maximizing State and Action Embeddings.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2018

Time Reversal as Self-Supervision.

[BibT_eX]

[DOI]

,

Mohammad Babaeizadeh

,

,

,

CoRR, 2018

Addressing Sample Inefficiency and Reward Bias in Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

,

Kumar Krishna Agrawal

,

,

Jonathan Tompson

CoRR, 2018

SOLAR: Deep Structured Latent Representations for Model-Based Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Matthew J. Johnson

,

CoRR, 2018

QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation.

[BibT_eX]

[DOI]

Dmitry Kalashnikov

,

,

,

,

Alexander Herzog

,

,

Deirdre Quillen

,

,

Mrinal Kalakrishnan

,

Vincent Vanhoucke

,

CoRR, 2018

Few-Shot Segmentation Propagation with Guided Networks.

[BibT_eX]

[DOI]

,

,

,

Alexei A. Efros

,

CoRR, 2018

Unsupervised Meta-Learning for Reinforcement Learning.

[BibT_eX]

[DOI]

,

Benjamin Eysenbach

,

,

CoRR, 2018

Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review.

[BibT_eX]

[DOI]

CoRR, 2018

Stochastic Adversarial Video Prediction.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2018

Universal Planning Networks.

[BibT_eX]

[DOI]

Aravind Srinivas

,

,

,

,

CoRR, 2018

Recall Traces: Backtracking Models for Efficient Reinforcement Learning.

[BibT_eX]

[DOI]

,

Philemon Brakel

,

,

Timothy P. Lillicrap

,

,

Hugo Larochelle

,

CoRR, 2018

Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments.

[BibT_eX]

[DOI]

CoRR, 2018

Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning.

[BibT_eX]

[DOI]

Lukasz Kidzinski

,

Sharada P. Mohanty

,

Carmichael F. Ong

,

Jennifer L. Hicks

,

Sean F. Carroll

,

,

Marcel Salathé

,

CoRR, 2018

Learning to Adapt: Meta-Learning for Model-Based Control.

[BibT_eX]

[DOI]

,

Anusha Nagabandi

,

Ronald S. Fearing

,

,

,

CoRR, 2018

Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning.

[BibT_eX]

[DOI]

Vladimir Feinberg

,

,

,

Michael I. Jordan

,

Joseph E. Gonzalez

,

CoRR, 2018

One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Robotics: Science and Systems XIV, 2018

Shared Autonomy via Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Siddharth Reddy

,

,

Proceedings of the Robotics: Science and Systems XIV, 2018

Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations.

[BibT_eX]

[DOI]

Aravind Rajeswaran

,

,

,

,

,

Emanuel Todorov

,

Proceedings of the Robotics: Science and Systems XIV, 2018

Where Do You Think You're Going?: Inferring Beliefs about Dynamics from Behavior.

[BibT_eX]

[DOI]

Siddharth Reddy

,

,

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Visual Reinforcement Learning with Imagined Goals.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Data-Efficient Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Visual Memory for Robust Path Following.

[BibT_eX]

[DOI]

,

,

David F. Fouhey

,

,

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Meta-Reinforcement Learning of Structured Exploration Strategies.

[BibT_eX]

[DOI]

,

Russell Mendonca

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Probabilistic Model-Agnostic Meta-Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models.

[BibT_eX]

[DOI]

,

Roberto Calandra

,

Rowan McAllister

,

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning with Latent Language.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Learning Image-Conditioned Dynamics Models for Control of Underactuated Legged Millirobots.

[BibT_eX]

[DOI]

Anusha Nagabandi

,

,

,

,

,

,

Ronald S. Fearing

Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Time-Contrastive Networks: Self-Supervised Learning from Video.

[BibT_eX]

[DOI]

Pierre Sermanet

,

,

Yevgen Chebotar

,

,

,

,

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-to-End Learning from Demonstration.

[BibT_eX]

[DOI]

Rouhollah Rahmatizadeh

,

Pooya Abolghasemi

,

Ladislau Bölöni

,

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods.

[BibT_eX]

[DOI]

Deirdre Quillen

,

,

,

,

,

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning.

[BibT_eX]

[DOI]

Anusha Nagabandi

,

,

Ronald S. Fearing

,

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Self-Supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Composable Deep Reinforcement Learning for Robotic Manipulation.

[BibT_eX]

[DOI]

Tuomas Haarnoja

,

,

,

,

,

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Deep Object-Centric Representations for Generalizable Robot Learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Using Simulation and Domain Adaptation to Improve Efficiency of Deep Robotic Grasping.

[BibT_eX]

[DOI]

Konstantinos Bousmalis

,

,

,

,

,

Mrinal Kalakrishnan

,

,

,

,

,

,

Vincent Vanhoucke

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Universal Planning Networks: Learning Generalizable Representations for Visuomotor Control.

[BibT_eX]

[DOI]

Aravind Srinivas

,

,

,

,

Proceedings of the 35th International Conference on Machine Learning, 2018

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.

[BibT_eX]

[DOI]

Tuomas Haarnoja

,

,

,

Proceedings of the 35th International Conference on Machine Learning, 2018

Latent Space Policies for Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

Tuomas Haarnoja

,

Kristian Hartikainen

,

,

Proceedings of the 35th International Conference on Machine Learning, 2018

Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings.

[BibT_eX]

[DOI]

John D. Co-Reyes

,

,

,

Benjamin Eysenbach

,

,

Proceedings of the 35th International Conference on Machine Learning, 2018

The Mirage of Action-Dependent Baselines in Reinforcement Learning.

[BibT_eX]

[DOI]

,

Surya Bhupatiraju

,

,

Richard E. Turner

,

Zoubin Ghahramani

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Conditional Networks for Few-Shot Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

Alyosha A. Efros

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Temporal Difference Models: Model-Free Deep RL for Model-Based Control.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Regret Minimization for Partially Observable Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Recasting Gradient-Based Meta-Learning as Hierarchical Bayes.

[BibT_eX]

[DOI]

,

,

,

,

Thomas L. Griffiths

Proceedings of the 6th International Conference on Learning Representations, 2018

Divide-and-Conquer Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Aravind Rajeswaran

,

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Reinforcement Learning from Imperfect Demonstrations.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Learning Robust Rewards with Adverserial Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Meta-Learning and Universality: Deep Representations and Gradient Descent can Approximate any Learning Algorithm.

[BibT_eX]

[DOI]

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning.

[BibT_eX]

[DOI]

Benjamin Eysenbach

,

,

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Stochastic Variational Video Prediction.

[BibT_eX]

[DOI]

Mohammad Babaeizadeh

,

,

,

Roy H. Campbell

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control.

[BibT_eX]

[DOI]

Fereshteh Sadeghi

,

Alexander Toshev

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Instance Segmentation by Interaction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Few-Shot Goal Inference for Visuomotor Learning and Planning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation.

[BibT_eX]

[DOI]

Dmitry Kalashnikov

,

,

,

,

Alexander Herzog

,

,

Deirdre Quillen

,

,

Mrinal Kalakrishnan

,

Vincent Vanhoucke

,

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Grasp2Vec: Learning Object Representations from Self-Supervised Grasping.

[BibT_eX]

[DOI]

,

,

Vincent Vanhoucke

,

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Robustness via Retrying: Closed-Loop Robotic Manipulation with Self-Supervised Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

2017

Unifying Map and Landmark Based Representations for Visual Navigation.

[BibT_eX]

[DOI]

,

David F. Fouhey

,

,

CoRR, 2017

Sim2Real View Invariant Visual Servoing by Recurrent Control.

[BibT_eX]

[DOI]

Fereshteh Sadeghi

,

Alexander Toshev

,

,

CoRR, 2017

Neural Network Dynamics Models for Control of Under-actuated Legged Millirobots.

[BibT_eX]

[DOI]

Anusha Nagabandi

,

,

,

,

,

Ronald S. Fearing

CoRR, 2017

Learning Robust Rewards with Adversarial Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

CoRR, 2017

Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations.

[BibT_eX]

[DOI]

Aravind Rajeswaran

,

,

,

,

Emanuel Todorov

,

CoRR, 2017

MBMF: Model-Based Priors for Model-Free Reinforcement Learning.

[BibT_eX]

[DOI]

,

Roberto Calandra

,

,

Claire J. Tomlin

CoRR, 2017

Uncertainty-Aware Reinforcement Learning for Collision Avoidance.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2017

Unsupervised Perceptual Rewards for Imitation Learning.

[BibT_eX]

[DOI]

Pierre Sermanet

,

,

Proceedings of the Robotics: Science and Systems XIII, 2017

CAD2RL: Real Single-Image Flight Without a Single Real Image.

[BibT_eX]

[DOI]

Fereshteh Sadeghi

,

Proceedings of the Robotics: Science and Systems XIII, 2017

Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Richard E. Turner

,

Zoubin Ghahramani

,

Bernhard Schölkopf

,

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

EX2: Exploration with Exemplar Models for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

John D. Co-Reyes

,

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Collective robot reinforcement learning with distributed asynchronous guided policy search.

[BibT_eX]

[DOI]

,

,

Mrinal Kalakrishnan

,

Yevgen Chebotar

,

Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Deep reinforcement learning for tensegrity robot locomotion.

[BibT_eX]

[DOI]

,

,

,

,

Massimo Vespignani

,

Vytas SunSpiral

,

,

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Learning from the hindsight plan - Episodic MPC improvement.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Combining self-supervised learning and imitation for vision-based rope manipulation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Reset-free guided policy search: Efficient deep reinforcement learning with stochastic initial states.

[BibT_eX]

[DOI]

William Montgomery

,

,

,

,

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

PLATO: Policy learning using adaptive trajectory optimization.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates.

[BibT_eX]

[DOI]

,

,

Timothy P. Lillicrap

,

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Deep visual foresight for planning robot motion.

[BibT_eX]

[DOI]

,

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Learning modular neural network policies for multi-task and multi-robot transfer.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Path integral guided policy search.

[BibT_eX]

[DOI]

Yevgen Chebotar

,

Mrinal Kalakrishnan

,

,

,

,

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Reinforcement Learning with Deep Energy-Based Policies.

[BibT_eX]

[DOI]

Tuomas Haarnoja

,

,

,

Proceedings of the 34th International Conference on Machine Learning, 2017

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks.

[BibT_eX]

[DOI]

,

,

Proceedings of the 34th International Conference on Machine Learning, 2017

Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning.

[BibT_eX]

[DOI]

Yevgen Chebotar

,

,

,

Gaurav S. Sukhatme

,

,

Proceedings of the 34th International Conference on Machine Learning, 2017

Modular Multitask Reinforcement Learning with Policy Sketches.

[BibT_eX]

[DOI]

,

,

Proceedings of the 34th International Conference on Machine Learning, 2017

EPOpt: Learning Robust Neural Network Policies Using Model Ensembles.

[BibT_eX]

[DOI]

Aravind Rajeswaran

,

Sarvjeet Ghotra

,

Balaraman Ravindran

,

Proceedings of the 5th International Conference on Learning Representations, 2017

Learning Visual Servoing with Deep Features and Fitted Q-Iteration.

[BibT_eX]

[DOI]

,

,

Proceedings of the 5th International Conference on Learning Representations, 2017

Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic.

[BibT_eX]

[DOI]

,

Timothy P. Lillicrap

,

Zoubin Ghahramani

,

Richard E. Turner

,

Proceedings of the 5th International Conference on Learning Representations, 2017

Generalizing Skills with Semi-Supervised Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 5th International Conference on Learning Representations, 2017

Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 5th International Conference on Learning Representations, 2017

GPLAC: Generalizing Vision-Based Robotic Skills Using Weakly Labeled Images.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Computer Vision, 2017

Time-Contrastive Networks: Self-Supervised Learning from Multi-view Observation.

[BibT_eX]

[DOI]

Pierre Sermanet

,

,

,

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Cognitive Mapping and Planning for Visual Navigation.

[BibT_eX]

[DOI]

,

,

,

Rahul Sukthankar

,

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Learning Robotic Manipulation of Granular Media.

[BibT_eX]

[DOI]

,

Jonathan Tompson

,

,

Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

End-to-End Learning of Semantic Grasping.

[BibT_eX]

[DOI]

,

Sudheendra Vijayanarasimhan

,

,

,

Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

One-Shot Visual Imitation Learning via Meta-Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

Self-Supervised Visual Planning with Temporal Skip Connections.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

The Feeling of Success: Does Touch Sensing Help Predict Grasp Outcomes?

[BibT_eX]

[DOI]

Roberto Calandra

,

,

,

,

,

Edward H. Adelson

,

Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

Goal-driven dynamics learning via Bayesian optimization.

[BibT_eX]

[DOI]

,

Roberto Calandra

,

,

,

Claire J. Tomlin

Proceedings of the 56th IEEE Annual Conference on Decision and Control, 2017

2016

End-to-End Training of Deep Visuomotor Policies.

[BibT_eX]

[DOI]

,

,

,

J. Mach. Learn. Res., 2016

Value Iteration Networks.

[BibT_eX]

[DOI]

,

,

CoRR, 2016

High-Dimensional Continuous Control Using Generalized Advantage Estimation.

[BibT_eX]

[DOI]

,

,

,

Michael I. Jordan

,

Proceedings of the 4th International Conference on Learning Representations, 2016

(CAD)$^2$RL: Real Single-Image Flight without a Single Real Image.

[BibT_eX]

[DOI]

Fereshteh Sadeghi

,

CoRR, 2016

Guided Policy Search as Approximate Mirror Descent.

[BibT_eX]

[DOI]

William Montgomery

,

CoRR, 2016

Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection.

[BibT_eX]

[DOI]

,

,

Alex Krizhevsky

,

Deirdre Quillen

CoRR, 2016

Learning Dexterous Manipulation Policies from Experience and Imitation.

[BibT_eX]

[DOI]

,

,

Emanuel Todorov

,

CoRR, 2016

Learning Dexterous Manipulation for a Soft Robotic Hand from Human Demonstration.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2016

MuProp: Unbiased Backpropagation for Stochastic Neural Networks.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 4th International Conference on Learning Representations, 2016

Deep Reinforcement Learning for Robotic Manipulation.

[BibT_eX]

[DOI]

,

,

Timothy P. Lillicrap

,

CoRR, 2016

Learning Visual Predictive Models of Physics for Playing Billiards.

[BibT_eX]

[DOI]

Katerina Fragkiadaki

,

,

,

Proceedings of the 4th International Conference on Learning Representations, 2016

A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models.

[BibT_eX]

[DOI]

,

Paul F. Christiano

,

,

CoRR, 2016

Learning to Poke by Poking: Experiential Learning of Intuitive Physics.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2016

Adapting Deep Visuomotor Representations with Weak Pairwise Constraints.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Algorithmic Foundations of Robotics XII, 2016

Value Iteration Networks.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Guided Policy Search via Approximate Mirror Descent.

[BibT_eX]

[DOI]

William H. Montgomery

,

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Backprop KF: Learning Discriminative Deterministic State Estimators.

[BibT_eX]

[DOI]

Tuomas Haarnoja

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Unsupervised Learning for Physical Interaction through Video Prediction.

[BibT_eX]

[DOI]

,

Ian J. Goodfellow

,

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Learning Hand-Eye Coordination for Robotic Grasping with Large-Scale Data Collection.

[BibT_eX]

[DOI]

,

,

Alex Krizhevsky

,

Deirdre Quillen

Proceedings of the International Symposium on Experimental Robotics, 2016

Learning dexterous manipulation for a soft robotic hand from human demonstrations.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

One-shot learning of manipulation skills with online dynamics adaptation and neural network priors.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Learning deep neural network policies with continuous memory states.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

Model-based reinforcement learning with parametrized physical models and optimism-driven exploration.

[BibT_eX]

[DOI]

,

,

Teodor Mihai Moldovan

,

,

Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

Optimal control with learned local models: Application to dexterous manipulation.

[BibT_eX]

[DOI]

,

Emanuel Todorov

,

Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

Deep spatial autoencoders for visuomotor learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

Continuous Deep Q-Learning with Model-based Acceleration.

[BibT_eX]

[DOI]

,

Timothy P. Lillicrap

,

,

Proceedings of the 33nd International Conference on Machine Learning, 2016

Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization.

[BibT_eX]

[DOI]

,

,

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

Policy Learning with Continuous Memory States for Partially Observed Robotic Control.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2015

Towards Adapting Deep Visuomotor Representations from Simulated to Real Environments.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2015

Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models.

[BibT_eX]

[DOI]

Bradly C. Stadie

,

,

CoRR, 2015

Recurrent Network Models for Kinematic Tracking.

[BibT_eX]

[DOI]

Katerina Fragkiadaki

,

,

CoRR, 2015

Learning Visual Feature Spaces for Robotic Manipulation with Deep Spatial Autoencoders.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2015

Learning from multiple demonstrations using trajectory-aware non-rigid registration with applications to deformable object manipulation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Learning compound multi-step controllers under unknown dynamics.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Optimism-driven exploration for nonlinear systems.

[BibT_eX]

[DOI]

Teodor Mihai Moldovan

,

,

Michael I. Jordan

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Learning contact-rich manipulation skills with guided policy search.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Learning force-based manipulation of deformable objects from multiple demonstrations.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Trust Region Policy Optimization.

[BibT_eX]

[DOI]

,

,

,

Michael I. Jordan

,

Proceedings of the 32nd International Conference on Machine Learning, 2015

Recurrent Network Models for Human Dynamics.

[BibT_eX]

[DOI]

Katerina Fragkiadaki

,

,

,

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014

Motor skill learning with local trajectory methods.

[BibT_eX]

[DOI]

PhD thesis, 2014

Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Learning Complex Neural Network Policies with Trajectory Optimization.

[BibT_eX]

[DOI]

,

Proceedings of the 31th International Conference on Machine Learning, 2014

Offline policy evaluation across representations with applications to educational games.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

2013

Exploring Deep and Recurrent Architectures for Optimal Control.

[BibT_eX]

[DOI]

CoRR, 2013

Variational Policy Search via Trajectory Optimization.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Guided Policy Search.

[BibT_eX]

[DOI]

,

Proceedings of the 30th International Conference on Machine Learning, 2013

2012

Continuous character control with low-dimensional embeddings.

[BibT_eX]

[DOI]

,

,

,

,

ACM Trans. Graph., 2012

Physically Plausible Simulation for Character Animation.

[BibT_eX]

[DOI]

,

Proceedings of the 2012 Eurographics/ACM SIGGRAPH Symposium on Computer Animation, 2012

Continuous Inverse Optimal Control with Locally Optimal Examples.

[BibT_eX]

[DOI]

,

Proceedings of the 29th International Conference on Machine Learning, 2012

2011

Space-time planning with parameterized locomotion controllers.

[BibT_eX]

[DOI]

,

,

,

ACM Trans. Graph., 2011

Nonlinear Inverse Reinforcement Learning with Gaussian Processes.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

2010

Gesture controllers.

[BibT_eX]

[DOI]

,

Philipp Krähenbühl

,

Sebastian Thrun

,

ACM Trans. Graph., 2010

Feature Construction for Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

2009

Real-time prosody-driven synthesis of body language.

[BibT_eX]

[DOI]

,

Christian Theobalt

,

ACM Trans. Graph., 2009

Loading...