Least-squares policy iteration MG Lagoudakis, R Parr The Journal of Machine Learning Research 4, 1107-1149, 2003 | 1726 | 2003 |
Reinforcement learning with hierarchies of machines R Parr, S Russell Advances in neural information processing systems 10, 1997 | 1125 | 1997 |
Efficient solution algorithms for factored MDPs C Guestrin, D Koller, R Parr, S Venkataraman Journal of Artificial Intelligence Research 19, 399-468, 2003 | 696 | 2003 |
Multiagent planning with factored MDPs C Guestrin, D Koller, R Parr Advances in neural information processing systems 14, 2001 | 659 | 2001 |
Coordinated reinforcement learning C Guestrin, M Lagoudakis, R Parr ICML 2, 227-234, 2002 | 543 | 2002 |
DP-SLAM: Fast, robust simultaneous localization and mapping without predetermined landmarks A Eliazar, R Parr IJCAI 3, 1135-1142, 2003 | 510 | 2003 |
Making rational decisions using adaptive utility elicitation U Chajewska, D Koller, R Parr Aaai/Iaai, 363-369, 2000 | 413 | 2000 |
Hierarchical control and learning for Markov decision processes RE Parr University of California, Berkeley, 1998 | 405 | 1998 |
Bayesian fault detection and diagnosis in dynamic systems U Lerner, R Parr, D Koller, G Biswas Aaai/iaai, 531-537, 2000 | 362 | 2000 |
An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning R Parr, L Li, G Taylor, C Painter-Wakefield, ML Littman Proceedings of the 25th international conference on Machine learning, 752-759, 2008 | 268 | 2008 |
Approximating optimal policies for partially observable stochastic domains R Parr, S Russell IJCAI 95, 1088-1094, 1995 | 251 | 1995 |
DP-SLAM 2.0 AI Eliazar, R Parr IEEE International Conference on Robotics and Automation, 2004. Proceedings …, 2004 | 247 | 2004 |
Complexity of computing optimal stackelberg strategies in security resource allocation games D Korzhyk, V Conitzer, R Parr Proceedings of the AAAI Conference on Artificial Intelligence 24 (1), 805-810, 2010 | 243 | 2010 |
Policy iteration for factored MDPs D Koller, R Parr arXiv preprint arXiv:1301.3869, 2013 | 221 | 2013 |
Reinforcement learning as classification: Leveraging modern classifiers MG Lagoudakis, R Parr Proceedings of the 20th International Conference on Machine Learning (ICML …, 2003 | 221 | 2003 |
Computing factored value functions for policies in structured MDPs D Koller, R Parr IJCAI 99, 1332-1339, 1999 | 217 | 1999 |
Analyzing feature generation for value-function approximation R Parr, C Painter-Wakefield, L Li, M Littman Proceedings of the 24th international conference on Machine learning, 737-744, 2007 | 191 | 2007 |
Max-norm projections for factored MDPs C Guestrin, D Koller, R Parr IJCAI 1, 673-682, 2001 | 163 | 2001 |
Inference in hybrid networks: Theoretical limits and practical algorithms U Lerner, R Parr arXiv preprint arXiv:1301.2288, 2013 | 149 | 2013 |
Flexible decomposition algorithms for weakly coupled Markov decision problems R Parr arXiv preprint arXiv:1301.7405, 2013 | 145 | 2013 |