# Ronald Parr

According to our database

Collaborative distances:

^{1}, Ronald Parr authored at least 57 papers between 1993 and 2016.Collaborative distances:

## Timeline

#### Legend:

Book In proceedings Article PhD thesis Other## Links

#### On csauthors.net:

## Bibliography

2016

Improving PAC Exploration Using the Median Of Means.

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Efficient PAC-Optimal Exploration in Concurrent, Continuous State MDPs with Delayed Updates.

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Distance Minimization for Reward Learning from Scored Trajectories.

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2014

Unsupervised discovery of object classes with a mobile robot.

Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

2013

Sample Complexity and Performance Bounds for Non-Parametric Approximate Linear Programming.

Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

PAC Optimal Exploration in Continuous Space Markov Decision Processes.

Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012

Computing Stackelberg strategies in stochastic games.

SIGecom Exchanges, 2012

Value Function Approximation in Noisy Environments Using Locally Smoothed Regularized Approximate Linear Programs.

Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Object disappearance for object discovery.

Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Greedy Algorithms for Sparse Reinforcement Learning.

Proceedings of the 29th International Conference on Machine Learning, 2012

Computing Optimal Strategies to Commit to in Stochastic Games.

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011

Security Games with Multiple Attacker Resources.

Proceedings of the IJCAI 2011, 2011

Textured occupancy grids for monocular localization without features.

Proceedings of the IEEE International Conference on Robotics and Automation, 2011

Generalized Value Functions for Large Action Sets.

Proceedings of the 28th International Conference on Machine Learning, 2011

Solving Stackelberg games with uncertain observability.

Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Non-Parametric Approximate Linear Programming for MDPs.

Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010

Counting Objects with a Combination of Horizontal and Overhead Sensors.

I. J. Robotics Res., 2010

Linear Complementarity for Regularized Policy Evaluation and Improvement.

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Feature Selection Using Regularization in Approximate Linear Programs for Markov Decision Processes.

Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Complexity of Computing Optimal Stackelberg Strategies in Security Resource Allocation Games.

Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009

Multi-Step Multi-Sensor Hider-Seeker Games.

Proceedings of the IJCAI 2009, 2009

Kernelized value function approximation for reinforcement learning.

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

2008

Planning Aims for a Network of Horizontal and Overhead Sensors.

Proceedings of the Algorithmic Foundation of Robotics VIII, 2008

Planning Aims for a Network of Horizontal and Overhead Sensors.

Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2008

An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning.

Proceedings of the Machine Learning, 2008

2007

Nonmyopic Multiaspect Sensing With Partially Observable Markov Decision Processes.

IEEE Trans. Signal Processing, 2007

Analyzing feature generation for value-function approximation.

Proceedings of the Machine Learning, 2007

Point-Based Policy Iteration.

Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006

Efficient Selection of Disambiguating Actions for Stereo Vision.

Proceedings of the UAI '06, 2006

2005

Hierarchical Linear/Constant Time SLAM Using Particle Filters for Dense Maps.

Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

2004

DP-SLAM 2.0.

Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

Learning probabilistic motion models for mobile robots.

Proceedings of the Machine Learning, 2004

2003

Least-Squares Policy Iteration.

Journal of Machine Learning Research, 2003

Efficient Solution Algorithms for Factored MDPs.

J. Artif. Intell. Res., 2003

Approximate Policy Iteration using Large-Margin Classifiers.

Proceedings of the IJCAI-03, 2003

DP-SLAM: Fast, Robust Simultaneous Localization and Mapping Without Predetermined Landmarks.

Proceedings of the IJCAI-03, 2003

Reinforcement Learning as Classification: Leveraging Modern Classifiers.

Proceedings of the Machine Learning, 2003

2002

XPathLearner: An On-line Self-Tuning Markov Histogram for XML Path Selectivity Estimation.

Proceedings of the VLDB 2002, 2002

Value Function Approximation in Zero-Sum Markov Games.

Proceedings of the UAI '02, 2002

Least-Squares Methods in Reinforcement Learning for Control.

Proceedings of the Methods and Applications of Artificial Intelligence, 2002

Learning in Zero-Sum Team Markov Games Using Factored Value Functions.

Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Coordinated Reinforcement Learning.

Proceedings of the Machine Learning, 2002

2001

Inference in Hybrid Networks: Theoretical Limits and Practical Algorithms.

Proceedings of the UAI '01: Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 2001

Model-Free Least-Squares Policy Iteration.

Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Multiagent Planning with Factored MDPs.

Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Max-norm Projections for Factored MDPs.

Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

2000

Policy Iteration for Factored MDPs.

Proceedings of the UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30, 2000

Bayesian Fault Detection and Diagnosis in Dynamic Systems.

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

Making Rational Decisions Using Adaptive Utility Elicitation.

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

1999

Reinforcement Learning Using Approximate Belief States.

Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999

Policy Search via Density Estimation.

Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999

Computing Factored Value Functions for Policies in Structured MDPs.

Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, 1999

1998

Flexible Decomposition Algorithms for Weakly Coupled Markov Decision Problems.

Proceedings of the UAI '98: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, 1998

1997

Reinforcement Learning with Hierarchies of Machines.

Proceedings of the Advances in Neural Information Processing Systems 10, 1997

Generalized Prioritized Sweeping.

Proceedings of the Advances in Neural Information Processing Systems 10, 1997

1995

Approximating Optimal Policies for Partially Observable Stochastic Domains.

Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995

1993

Provably Bounded Optimal Agents.

Proceedings of the 13th International Joint Conference on Artificial Intelligence. Chambéry, France, August 28, 1993