Leslie Pack Kaelbling

Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Holonomic planar motion from non-holonomic driving mechanisms: the front-point method.

[BibT_eX]

[DOI]

Selim Temizer

Proceedings of the Mobile Robots XVI, Boston, 2001

Reinforcement learning for robot control.

[BibT_eX]

[DOI]

William D. Smart

Proceedings of the Mobile Robots XVI, Boston, 2001

Approaches to macro decompositions of large Markov decision process planning problems.

[BibT_eX]

[DOI]

Terran Lane

Proceedings of the Mobile Robots XVI, Boston, 2001

2000

Learning to Cooperate via Policy Search.

[BibT_eX]

[DOI]

Leonid Peshkin

Kee-Eung Kim

Proceedings of the UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30, 2000

Adaptive Importance Sampling for Estimation in Structured Domains.

[BibT_eX]

[DOI]

Luis E. Ortiz

Proceedings of the UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30, 2000

Practical Reinforcement Learning in Continuous Spaces.

[BibT_eX]

William D. Smart

Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

State-based Classification of Finger Gestures from Electromyographic Signals.

[BibT_eX]

Peter Ju

Yoram Singer

Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Sampling Methods for Action Selection in Influence Diagrams.

[BibT_eX]

[DOI]

Luis E. Ortiz

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

1999

Accelerating EM: An Empirical Study.

[BibT_eX]

[DOI]

Luis E. Ortiz

Proceedings of the UAI '99: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden, July 30, 1999

Learning Finite-State Controllers for Partially Observable Environments.

[BibT_eX]

[DOI]

Leonid Peshkin

Kee-Eung Kim

Proceedings of the UAI '99: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden, July 30, 1999

Solving POMDPs by Searching the Space of Finite Policies.

[BibT_eX]

[DOI]

Kee-Eung Kim

Proceedings of the UAI '99: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden, July 30, 1999

Multi-Value-Functions: Efficient Automatic Action Hierarchies for Multiple Goal MDPs.

[BibT_eX]

[DOI]

Andrew W. Moore

Leemon C. Baird III

Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, 1999

Learning Policies with External Memory.

[BibT_eX]

Leonid Peshkin

Proceedings of the Sixteenth International Conference on Machine Learning (ICML 1999), Bled, Slovenia, June 27, 1999

1998

Planning and Acting in Partially Observable Stochastic Domains.

[BibT_eX]

[DOI]

Artif. Intell., 1998

Ecological Robotics.

[BibT_eX]

[DOI]

Andrew P. Duchon

William H. Warren

Adapt. Behav., 1998

Hierarchical Solution of Markov Decision Processes using Macro-actions.

[BibT_eX]

[DOI]

Milos Hauskrecht

Craig Boutilier

Proceedings of the UAI '98: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, 1998

Heading in the Right Direction.

[BibT_eX]

Hagit Shatkay

Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), 1998

A Framework for Reinforcement Learning on Real Robots.

[BibT_eX]

[DOI]

William D. Smart

Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998

Solving Very Large Weakly Coupled Markov Decision Processes.

[BibT_eX]

[DOI]

Craig Boutilier

Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998

1997

Learning Topological Maps with Weak Local Odometric Information.

[BibT_eX]

[DOI]

Hagit Shatkay

Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997

1996

Introduction.

[BibT_eX]

[DOI]

Mach. Learn., 1996

Reinforcement Learning: A Survey.

[BibT_eX]

[DOI]

Andrew W. Moore

J. Artif. Intell. Res., 1996

The National Science Foundation Workshop on Reinforcement Learning.

[BibT_eX]

[DOI]

Sridhar Mahadevan

AI Mag., 1996

On reinforcement learning for robots.

[BibT_eX]

[DOI]

Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS 1996, 1996

Acting under uncertainty: discrete Bayesian models for mobile-robot navigation.

[BibT_eX]

[DOI]

James Kurien

Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS 1996, 1996

1995

A Situated View of Representation and Control.

[BibT_eX]

[DOI]

Stanley J. Rosenschein

Artif. Intell., 1995

Planning under Time Constraints in Stochastic Domains.

[BibT_eX]

[DOI]

Jak Kirman

Ann E. Nicholson

Artif. Intell., 1995

Learning Dynamics: System Identification for Perceptually Challenged Agents.

[BibT_eX]

[DOI]

Kenneth Basye

Artif. Intell., 1995

On the Complexity of Solving Markov Decision Problems.

[BibT_eX]

[DOI]

Proceedings of the UAI '95: Proceedings of the Eleventh Annual Conference on Uncertainty in Artificial Intelligence, 1995

Partially Observable Markov Decision Processes for Artificial Intelligence.

[BibT_eX]

[DOI]

Proceedings of the KI-95: Advances in Artificial Intelligence, 1995

Learning Policies for Partially Observable Environments: Scaling Up.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 1995

1994

Associative Reinforcement Learning: A Generate and Test Algorithm.

[BibT_eX]

[DOI]

Mach. Learn., 1994

Associative Reinforcement Learning: Functions in k-DNF.

[BibT_eX]

[DOI]

Mach. Learn., 1994

Learning and intelligent Agents.

[BibT_eX]

Proceedings of the Eleventh European Conference on Artificial Intelligence, 1994

Acting Optimally in Partially Observable Stochastic Domains.

[BibT_eX]

[DOI]

Proceedings of the 12th National Conference on Artificial Intelligence, Seattle, WA, USA, July 31, 1994

1993

Deliberation Scheduling for Time-Critical Sequential Decision Making.

[BibT_eX]

[DOI]

Jak Kirman

Ann E. Nicholson

Proceedings of the UAI '93: Proceedings of the Ninth Annual Conference on Uncertainty in Artificial Intelligence, 1993

Learning to Achieve Goals.

[BibT_eX]

Proceedings of the 13th International Joint Conference on Artificial Intelligence. Chambéry, France, August 28, 1993

Hierarchical Learning in Stochastic Domains: Preliminary Results.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 1993

Planning With Deadlines in Stochastic Domains.

[BibT_eX]

[DOI]

Jak Kirman

Ann E. Nicholson

Proceedings of the 11th National Conference on Artificial Intelligence. Washington, 1993

Learning in embedded systems.

[BibT_eX]

MIT Press, ISBN: 978-0-262-11174-4, 1993

1992

Inferring Finite Automata with Stochastic Output Functions and an Application to Map Learning.

[BibT_eX]

[DOI]

Evangelos Kokkevis

Oded Maron

Proceedings of the 10th National Conference on Artificial Intelligence, 1992

1991

A Situated-Automata Approach to the Design of Embedded Agents.

[BibT_eX]

[DOI]

SIGART Bull., 1991

Foundations of learning in autonomous agents.

[BibT_eX]

[DOI]

Robotics Auton. Syst., 1991

Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons.

[BibT_eX]

[DOI]

David Chapman

Proceedings of the 12th International Joint Conference on Artificial Intelligence. Sydney, 1991

1990

Learning in embedded systems.

[BibT_eX]

[DOI]

PhD thesis, 1990

Action and planning in embedded agents.

[BibT_eX]

[DOI]

Stanley J. Rosenschein

Robotics Auton. Syst., 1990

Learning Functions in k-DNF from Reinforcement.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 1990

1989

Intelligent Robots in the Real World.

[BibT_eX]

Proceedings of the Information Processing 89, Proceedings of the IFIP 11th World Computer Congress, San Francisco, USA, August 28, 1989

A Formal Framework for Learning in Embedded Systems.

[BibT_eX]

Proceedings of the Sixth International Workshop on Machine Learning (ML 1989), 1989

1988

Artificial Intelligence and Robotics.

[BibT_eX]

[DOI]

Proceedings of the COMPCON'88, Digest of Papers, Thirty-Third IEEE Computer Society International Conference, San Francisco, California, USA, February 29, 1988

Goals as Parallel Program Specifications.

[BibT_eX]

[DOI]

Proceedings of the 7th National Conference on Artificial Intelligence, 1988

1986

The Synthesis of Digital Machines With Provable Epistemic Properties.

[BibT_eX]

Stanley J. Rosenschein