Andrew G. Barto

Proceedings of the 2010 IEEE 9th International Conference on Development and Learning, 2010

2009

Skill Discovery in Continuous Reinforcement Learning Domains using Skill Chaining.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Efficient Skill Learning using Abstraction Selection.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2009, 2009

2008

Skill Characterization Based on Betweenness.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Hierarchical Representations of Behavior for Efficient Creative Search.

[BibT_eX]

[DOI]

Christopher M. Vigorito

Proceedings of the Creative Intelligent Systems, 2008

2007

Temporal difference learning.

[BibT_eX]

[DOI]

Scholarpedia, 2007

Adaptive Control of Duty Cycling in Energy-Harvesting Wireless Sensor Networks.

[BibT_eX]

[DOI]

Christopher M. Vigorito

Deepak Ganesan

Proceedings of the Fourth Annual IEEE Communications Society Conference on Sensor, 2007

Active Learning of Dynamic Bayesian Networks in Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the Abstraction, 2007

Deictic Option Schemas.

[BibT_eX]

[DOI]

Vimal Mathew

Proceedings of the IJCAI 2007, 2007

Building Portable Options: Skill Transfer in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2007, 2007

Repairing Disengagement With Non-Invasive Interventions.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence in Education, 2007

2006

Learning at the level of synergies for a robot weightlifter.

[BibT_eX]

[DOI]

Richard E. A. Van Emmerik

Robotics Auton. Syst., 2006

Causal Graph Based Decomposition of Factored MDPs.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2006

An Adaptive Robot Motivational System.

[BibT_eX]

[DOI]

Proceedings of the From Animals to Animats 9, 2006

Improving Intelligent Tutoring Systems: Using Expectation Maximization to Learn Student Skill Levels.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Tutoring Systems, 8th International Conference, 2006

An intrinsic reward mechanism for efficient exploration.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2006

Autonomous shaping: knowledge transfer in reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2006

Decision Tree Methods for Finding Reusable MDP Homomorphisms.

[BibT_eX]

[DOI]

Alicia P. Wolfe

Proceedings of the Proceedings, 2006

2005

Learning Skills in Reinforcement Learning Using Relative Novelty.

[BibT_eX]

[DOI]

Proceedings of the Abstraction, 2005

Identifying useful subgoals in reinforcement learning by local graph partitioning.

[BibT_eX]

[DOI]

Alicia P. Wolfe

Proceedings of the Machine Learning, 2005

A causal approach to hierarchical decomposition of factored MDPs.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2005

2004

Intrinsically Motivated Reinforcement Learning.

[BibT_eX]

[DOI]

Satinder Singh

Nuttapong Chentanez

Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Using relative novelty to identify useful temporal abstractions in reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2004

Reinforcement learning with supervision by a stable controller.

[BibT_eX]

[DOI]

Proceedings of the 2004 American Control Conference, 2004

2003

Recent Advances in Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

Sridhar Mahadevan

Discret. Event Dyn. Syst., 2003

SMDP Homomorphisms: An Algebraic Approach to Abstraction in Semi-Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the IJCAI-03, 2003

Relativized Options: Choosing the Right Transformation.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2003

2002

Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts.

[BibT_eX]

[DOI]

Amy McGovern

J. Eliot B. Moss

Mach. Learn., 2002

Lyapunov Design for Safe Reinforcement Learning.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2002

The emergence of movement units through learning with noisy efferent signals and delayed sensory feedback.

[BibT_eX]

[DOI]

Michael Kositsky

Neurocomputing, 2002

Model Minimization in Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Abstraction, 2002

PolicyBlocks: An Algorithm for Creating Useful Macro-Actions in Reinforcement Learning.

[BibT_eX]

Marc Pickett

Proceedings of the Machine Learning, 2002

2001

The Emergence of Multiple Movement Units in the Presence of Noise and Feedback Delay.

[BibT_eX]

[DOI]

Michael Kositsky

Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Robot Weightlifting By Direct Policy Search.

[BibT_eX]

Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

Heuristic Search in Infinite State Spaces Guided by Lyapunov Analysis.

[BibT_eX]

Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

Lyapunov-Constrained Action Sets for Reinforcement Learning.

[BibT_eX]

Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

Automatic Discovery of Subgoals in Reinforcement Learning using Diverse Density.

[BibT_eX]

Amy McGovern

Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

2000

Automated State Abstraction for Options using the U-Tree Algorithm.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 13, 2000

Combining Reinforcement Learning with a Local Control Algorithm.

[BibT_eX]

Jette Randløv

Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Machine Learning for Subproblem Selection.

[BibT_eX]

Robert Moll

Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

1999

A Cerebellar Model of Timing and Prediction in the Control of Reaching.

[BibT_eX]

[DOI]

Neural Comput., 1999

1998

Reinforcement Learning: An Introduction.

[BibT_eX]

[DOI]

Richard S. Sutton

IEEE Trans. Neural Networks, 1998

Elevator Group Control Using Multiple Reinforcement Learning Agents.

[BibT_eX]

[DOI]

Robert H. Crites

Mach. Learn., 1998

Learning Instance-Independent Value Functions to Enhance Local Search.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998

Reinforcement learning - an introduction.

[BibT_eX]

[DOI]

Richard S. Sutton

Adaptive computation and machine learning, MIT Press, ISBN: 978-0-262-19398-6, 1998

1997

Automated Aircraft Recovery via Reinforcement Learning: Initial Experiments.

[BibT_eX]

[DOI]

Jeffrey F. Monaco

David G. Ward

Proceedings of the Advances in Neural Information Processing Systems 10, 1997

Cerebellar learning for control of a two-link arm in muscle space.

[BibT_eX]

[DOI]

Proceedings of the 1997 IEEE International Conference on Robotics and Automation, 1997

A model of cerebellar learning for control of arm movements using muscle synergies.

[BibT_eX]

[DOI]

Proceedings of the Proceedings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA'97, 1997

1996

Linear Least-Squares Algorithms for Temporal Difference Learning.

[BibT_eX]

[DOI]

Steven J. Bradtke

Mach. Learn., 1996

Text-Based Information Retrieval Using Exponentiated Gradient Descent.

[BibT_eX]

[DOI]

Ron Papka

James P. Callan

Proceedings of the Advances in Neural Information Processing Systems 9, 1996

Reinforcement Learning for Mixed Open-loop and Closed-loop Control.

[BibT_eX]

[DOI]

Eric A. Hansen

Shlomo Zilberstein

Proceedings of the Advances in Neural Information Processing Systems 9, 1996

Local Bandit Approximation for Optimal Learning Problems.

[BibT_eX]

[DOI]

Michael O. Duff

Proceedings of the Advances in Neural Information Processing Systems 9, 1996

1995

Learning to Act Using Real-Time Dynamic Programming.

[BibT_eX]

[DOI]

Steven J. Bradtke

Satinder P. Singh

Artif. Intell., 1995

Improving Elevator Performance Using Reinforcement Learning.

[BibT_eX]

[DOI]

Robert H. Crites

Proceedings of the Advances in Neural Information Processing Systems 8, 1995

A Predictive Switching Model of Cerebellar Movement Control.

[BibT_eX]

[DOI]

James C. Houk

Proceedings of the Advances in Neural Information Processing Systems 8, 1995

1994

An Actor/Critic Algorithm that is Equivalent to Q-Learning.

[BibT_eX]

[DOI]

Robert H. Crites

Proceedings of the Advances in Neural Information Processing Systems 7, 1994

Learning Admittance Mappings for Force-Guided Assembly.

[BibT_eX]

[DOI]

Vijaykumar Gullapalli

Roderic A. Grupen

Proceedings of the 1994 International Conference on Robotics and Automation, 1994

1993

Task Decompostiion Through Competition in a Modular Connectionist Architecture: The What and Where Vision Tasks.

[BibT_eX]

[DOI]

Robert A. Jacobs

Michael I. Jordan

Proceedings of the Machine Learning: From Theory to Applications, 1993

Robust Reinforcement Learning in Motion Planning.

[BibT_eX]

[DOI]

Satinder Singh

Roderic A. Grupen

Christopher I. Connolly

Proceedings of the Advances in Neural Information Processing Systems 6, 1993

Convergence of Indirect Adaptive Asynchronous Value Iteration Algorithms.

[BibT_eX]

[DOI]

Vijaykumar Gullapalli

Proceedings of the Advances in Neural Information Processing Systems 6, 1993

Monte Carlo Matrix Inversion and Reinforcement Learning.

[BibT_eX]

[DOI]

Michael O. Duff

Proceedings of the Advances in Neural Information Processing Systems 6, 1993

1992

Learning reactive admittance control.

[BibT_eX]

[DOI]

Vijaykumar Gullapalli

Roderic A. Grupen

Proceedings of the 1992 IEEE International Conference on Robotics and Automation, 1992

1991

Task Decomposition Through Competition in a Modular Connectionist Architecture: The What and Where Vision Tasks.

[BibT_eX]

[DOI]

Robert A. Jacobs

Michael I. Jordan

Cogn. Sci., 1991

Linear systems analysis of the relationship between firing of deep cerebellar neurons and the classically conditioned nictitating membrane response in rabbits.

[BibT_eX]

[DOI]

N. E. Berthier

J. W. Moore

Biol. Cybern., 1991

A Cortico-Cerebellar Model that Learns to Generate Distributed Motor Commands to Control a Kinematic Arm.

[BibT_eX]

[DOI]

N. E. Berthier

Satinder P. Singh