Identifying Co-Adaptation of Algorithmic and Implementational Innovations in Deep Reinforcement Learning: A Taxonomy and Case Study of Inference-based Algorithms.

[BibT_eX]

[DOI]

Hiroki Furuta

Tadashi Kozuno

Tatsuya Matsushima

Yutaka Matsuo

Shixiang Shane Gu

CoRR, 2021

Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A Minimalist Approach to Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Scott Fujimoto

Shixiang Shane Gu

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL.

[BibT_eX]

[DOI]

Seyed Kamyar Seyed Ghasemipour

Dale Schuurmans

Shixiang Shane Gu

Proceedings of the 38th International Conference on Machine Learning, 2021

Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XVI, 2020

Weakly-Supervised Reinforcement Learning for Controllable Behavior.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Dynamics-Aware Unsupervised Discovery of Skills.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Human-centric dialog training via offline reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019

Sample-efficient deep reinforcement learning for continuous control.

[BibT_eX]

[DOI]

Shixiang Gu

PhD thesis, 2019

Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?

[BibT_eX]

[DOI]

CoRR, 2019

Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog.

[BibT_eX]

[DOI]

CoRR, 2019

Language as an Abstraction for Hierarchical Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies.

[BibT_eX]

[DOI]

Seyed Kamyar Seyed Ghasemipour

Shixiang Gu

Richard S. Zemel

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Near-Optimal Representation Learning for Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real.

[BibT_eX]

[DOI]

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

A Divergence Minimization Perspective on Imitation Learning Methods.

[BibT_eX]

[DOI]

Seyed Kamyar Seyed Ghasemipour

Richard S. Zemel

Shixiang Gu

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018

Data-Efficient Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

The Mirage of Action-Dependent Baselines in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Temporal Difference Models: Model-Free Deep RL for Model-Based Control.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control.

[BibT_eX]

[DOI]

Natasha Jaques

Shixiang Gu

Dzmitry Bahdanau

José Miguel Hernández-Lobato

Richard E. Turner

Douglas Eck

Proceedings of the 34th International Conference on Machine Learning, 2017

Tuning Recurrent Neural Networks with Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Categorical Reparameterization with Gumbel-Softmax.

[BibT_eX]

[DOI]

Eric Jang

Shixiang Gu

Ben Poole

Proceedings of the 5th International Conference on Learning Representations, 2017

Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

2016

MuProp: Unbiased Backpropagation for Stochastic Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Learning Representations, 2016

Deep Reinforcement Learning for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, 2016

Continuous Deep Q-Learning with Model-based Acceleration.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

Towards Deep Neural Network Architectures Robust to Adversarial Examples.

[BibT_eX]

[DOI]

Shixiang Gu

Luca Rigazio

Proceedings of the 3rd International Conference on Learning Representations, 2015

Particle Gibbs for Infinite Hidden Markov Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Neural Adaptive Sequential Monte Carlo.

[BibT_eX]

[DOI]

Shixiang Gu

Zoubin Ghahramani

Richard E. Turner

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2012

Realtime HDR (High Dynamic Range) video for eyetap wearable computers, FPGA-based seeing aids, and glasseyes (EyeTaps).

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE Canadian Conference on Electrical and Computer Engineering, 2012

Shixiang Gu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...