Shixiang Gu

According to our database1, Shixiang Gu authored at least 19 papers between 2012 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives.
Proceedings of the 7th International Conference on Learning Representations, 2019

Near-Optimal Representation Learning for Hierarchical Reinforcement Learning.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Data-Efficient Hierarchical Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

The Mirage of Action-Dependent Baselines in Reinforcement Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

The Mirage of Action-Dependent Baselines in Reinforcement Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018

Temporal Difference Models: Model-Free Deep RL for Model-Based Control.
Proceedings of the 6th International Conference on Learning Representations, 2018

Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control.
Proceedings of the 34th International Conference on Machine Learning, 2017

Tuning Recurrent Neural Networks with Reinforcement Learning.
Proceedings of the 5th International Conference on Learning Representations, 2017

Categorical Reparameterization with Gumbel-Softmax.
Proceedings of the 5th International Conference on Learning Representations, 2017

Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
MuProp: Unbiased Backpropagation for Stochastic Neural Networks.
Proceedings of the 4th International Conference on Learning Representations, 2016

Continuous Deep Q-Learning with Model-based Acceleration.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2015
Towards Deep Neural Network Architectures Robust to Adversarial Examples.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Particle Gibbs for Infinite Hidden Markov Models.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Neural Adaptive Sequential Monte Carlo.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2012
Realtime HDR (High Dynamic Range) video for eyetap wearable computers, FPGA-based seeing aids, and glasseyes (EyeTaps).
Proceedings of the 25th IEEE Canadian Conference on Electrical and Computer Engineering, 2012


  Loading...