Ronald Parr

Shlomo Zilberstein

Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Complexity of Computing Optimal Stackelberg Strategies in Security Resource Allocation Games.

[BibT_eX]

[DOI]

Dmytro Korzhyk

Vincent Conitzer

Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009

Multi-Step Multi-Sensor Hider-Seeker Games.

[BibT_eX]

[DOI]

Erik Halvorson

Vincent Conitzer

Proceedings of the IJCAI 2009, 2009

Kernelized value function approximation for reinforcement learning.

[BibT_eX]

[DOI]

Gavin Taylor

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

2008

Planning Aims for a Network of Horizontal and Overhead Sensors.

[BibT_eX]

[DOI]

Erik Halvorson

Proceedings of the Algorithmic Foundation of Robotics VIII, 2008

An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning.

[BibT_eX]

[DOI]

Christopher Painter-Wakefield

Lihong Li

Gavin Taylor

Michael L. Littman

Proceedings of the Machine Learning, 2008

2007

Nonmyopic Multiaspect Sensing With Partially Observable Markov Decision Processes.

[BibT_eX]

[DOI]

Shihao Ji

Lawrence Carin

IEEE Trans. Signal Process., 2007

Analyzing feature generation for value-function approximation.

[BibT_eX]

[DOI]

Christopher Painter-Wakefield

Lihong Li

Michael L. Littman

Proceedings of the Machine Learning, 2007

Point-Based Policy Iteration.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006

Efficient Selection of Disambiguating Actions for Stereo Vision.

[BibT_eX]

[DOI]

Monika Schaeffer

Proceedings of the UAI '06, 2006

2005

Hierarchical Linear/Constant Time SLAM Using Particle Filters for Dense Maps.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

2004

DP-SLAM 2.0.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

Learning probabilistic motion models for mobile robots.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2004

2003

Least-Squares Policy Iteration.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2003

Efficient Solution Algorithms for Factored MDPs.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2003

Approximate Policy Iteration using Large-Margin Classifiers.

[BibT_eX]

[DOI]

Proceedings of the IJCAI-03, 2003

DP-SLAM: Fast, Robust Simultaneous Localization and Mapping Without Predetermined Landmarks.

[BibT_eX]

[DOI]

Proceedings of the IJCAI-03, 2003

Reinforcement Learning as Classification: Leveraging Modern Classifiers.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2003

2002

XPathLearner: An On-line Self-Tuning Markov Histogram for XML Path Selectivity Estimation.

[BibT_eX]

[DOI]

Proceedings of 28th International Conference on Very Large Data Bases, 2002

Value Function Approximation in Zero-Sum Markov Games.

[BibT_eX]

[DOI]

Proceedings of the UAI '02, 2002

Least-Squares Methods in Reinforcement Learning for Control.

[BibT_eX]

[DOI]

Michael L. Littman

Proceedings of the Methods and Applications of Artificial Intelligence, 2002

Learning in Zero-Sum Team Markov Games Using Factored Value Functions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Coordinated Reinforcement Learning.

[BibT_eX]

Carlos Guestrin

Proceedings of the Machine Learning, 2002

2001

Inference in Hybrid Networks: Theoretical Limits and Practical Algorithms.

[BibT_eX]

[DOI]

Uri Lerner

Proceedings of the UAI '01: Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 2001

Model-Free Least-Squares Policy Iteration.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Multiagent Planning with Factored MDPs.

[BibT_eX]

[DOI]

Carlos Guestrin

Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Max-norm Projections for Factored MDPs.

[BibT_eX]

Carlos Guestrin

Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

2000

Policy Iteration for Factored MDPs.

[BibT_eX]

[DOI]

Proceedings of the UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30, 2000

Bayesian Fault Detection and Diagnosis in Dynamic Systems.

[BibT_eX]

[DOI]

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

Making Rational Decisions Using Adaptive Utility Elicitation.

[BibT_eX]

[DOI]

Urszula Chajewska

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

1999

Reinforcement Learning Using Approximate Belief States.

[BibT_eX]

[DOI]

Andres C. Rodriguez