John Schulman

According to our database1, John Schulman authored at least 38 papers between 2011 and 2018.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2018
Model-Based Reinforcement Learning via Meta-Policy Optimization.
CoRR, 2018

Gotta Learn Fast: A New Benchmark for Generalization in RL.
CoRR, 2018

On First-Order Meta-Learning Algorithms.
CoRR, 2018

Model-Based Reinforcement Learning via Meta-Policy Optimization.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

2017
Meta Learning Shared Hierarchies.
CoRR, 2017

Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations.
CoRR, 2017

Proximal Policy Optimization Algorithms.
CoRR, 2017

Equivalence Between Policy Gradients and Soft Q-Learning.
CoRR, 2017

Teacher-Student Curriculum Learning.
CoRR, 2017

UCB and InfoGain Exploration via $\boldsymbol{Q}$-Ensembles.
CoRR, 2017

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning.
CoRR, 2016

Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks.
CoRR, 2016

RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning.
CoRR, 2016

Benchmarking Deep Reinforcement Learning for Continuous Control.
CoRR, 2016

Variational Lossy Autoencoder.
CoRR, 2016

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets.
CoRR, 2016

OpenAI Gym.
CoRR, 2016

Concrete Problems in AI Safety.
CoRR, 2016

Theano: A Python framework for fast computation of mathematical expressions.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2016

VIME: Variational Information Maximizing Exploration.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Benchmarking Deep Reinforcement Learning for Continuous Control.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation.
CoRR, 2015

Trust Region Policy Optimization.
CoRR, 2015

Gradient Estimation Using Stochastic Computation Graphs.
CoRR, 2015

Gradient Estimation Using Stochastic Computation Graphs.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Trust Region Policy Optimization.
Proceedings of the 32nd International Conference on Machine Learning, 2015

2014
Motion planning with sequential convex optimization and convex collision checking.
I. J. Robotics Res., 2014

Scaling up Gaussian Belief Space Planning Through Covariance-Free Trajectory Optimization and Automatic Differentiation.
Proceedings of the Algorithmic Foundations of Robotics XI, 2014

Gaussian belief space planning with discontinuities in sensing domains.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Planning locally optimal, curvature-constrained trajectories in 3D using sequential convex optimization.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

2013
Finding Locally Optimal, Collision-Free Trajectories with Sequential Convex Optimization.
Proceedings of the Robotics: Science and Systems IX, Technische Universität Berlin, Berlin, Germany, June 24, 2013

Learning from Demonstrations Through the Use of Non-rigid Registration.
Proceedings of the Robotics Research, 2013

A case study of trajectory transfer through non-rigid registration for a simplified suturing scenario.
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013

Sigma hulls for Gaussian belief space planning for imprecise articulated robots amid obstacles.
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013

Tracking deformable objects with point clouds.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

2011
Grasping and Fixturing as Submodular Coverage Problems.
Proceedings of the Robotics Research, 2011


  Loading...