Ronald Parr
Research Interests
Methods for solving large stochastic planning
problems (MDPs) using approximate dynamic programming techniques
(reinforcement learning, value function approximation). I also work in
game theory, sensing, and robotics.
Education
- Ph.D. in Computer Science, University of California, Berkeley,
1998. Thesis title: Hierarchical Control and Learning in
Markov Decision Processes (Advisor: Stuart Russell).
- A.B. in Philosophy cum laude, Princeton University, 1990.
Awards and Honors
- Elected AAAI Fellow, 2023
- AAAI Oustanding Paper Honorable Mention, 2013
- IJCAI-JAIR Best Paper Award, 2007
- DARPA CSSG participant, 2006
- NSF CAREER award, 2006
- Alfred P. Sloan Fellow, 2003
Employment History
- Professor, Department of Computer Science, Duke University, 2017-
- Professor and Department Chair, Department of Computer Science, Duke University, 2014-2017.
- Professor, Department of Computer Science, Duke University, 2013-2014.
- Associate Professor (with tenure), Department of Computer Science, Duke University, 2007-present.
- Assistant Professor, Department of Computer Science, Duke University, 2000-2007.
-
Postdoctoral Research Associate, Robotics Laboratory, Stanford University, 1998-2000.
Worked with Daphne Koller on research, student advising and research
grant-related responsibilities.
- Research Assistant/Independent Research, Department of Electrical
Engineering and Computer Science, U.C. Berkeley, 1991-1998.
Advisor: Stuart Russell.
- Graduate Student Instructor: Formal Languages and Automata Theory,
Department of Electrical Engineering and Computer Science, U.C. Berkeley,
1990-1991.
Professional Service
- N(eur)IPS reviewer 1999, 2001, 2002, 2003, 2004, 2005, 2008, 2010-2013,
2015, 2017, 2018, (area chair) 2019, 2020, 2021, 2022
- NSF Panelist 2016, 2018, 2019, 2021, 2022
- ICML Program Committee 2000, 2001, 2003-2006, 2008, 2012, 2013 (Area Chair) 2019-2021
- AAAI Program Committee 1998, 2000, 2002, 2004, 2005, 2008 (Senior), 2014 (senior), 2017, (area chair) 2019-2021
- Journal of Machine Learning Research (JMLR) Action Editor 2009-2012
- UAI 2000-2001 (Program Committee), 2007 (Program Co-Chair), 2008 (General Chair), 2002-2006, 2012-2013 (Area Chair)
- IJCAI 2007 (Program Committee), 2005, 2010 (Senior Program Committee)
- ISAIM Program Committee 2008, 2010
- Machine Learning (MLJ) Editorial Board 2006-2009
- RSS Area Chair 2006
- IJCAI Reviewer 1999, 2001, 2003
- AI & Math Program Committee 2004, 2006
- Symposium on Artificial Intelligence and Mathematics Program Committee
2003, 2005
- GAFOS program committee 2004
- ICAPS Program Committee 2003
- Journal of AI Research (JAIR) editorial board member 2000-2002
- NASA Intelligent Systems Program reviewer 2000
- Organizer of NIPS 98 workshop on Abstraction and Hierarchy in
Reinforcement Learning
Department Service
- Senior AI Recruiting committing 2022
- Executive committee: 2007-
- Department chair: 2014-2017
- Graduate admissions committee: 2001-2002, 2007, 2012-2013 (Chair), 2017-2021 (chair)
- Communications committee: 2004-2011, 2013 (Chair)
- Lab (Infrastructure) committee: 2004-2012, 2013 (gap in 2012-2013
academic year), 2020 (chair)
- Faculty search committee: 2001-2003, 2005, 2006, 2008, 2010-2011
(Chair), 2017, 2018, 2021
- Strategic planning committee: 2002
University Service
- Curriculum Committee 2022-2023
- DST Seed Grant reviewer 2022
- QI Search Committee 2016-2017
Classes Taught
- CompSci 2/570 - Artificial Intelligence (graduate) (Fall 2001, 2002,
2004, 2006, 2010, 2012, 2013, 2015, 2016), 2018-2020, 2022
- CompSCi 1/2/370 - Artificial Intelligence (undergraduate) (Spring
2002-2004, 2006, 2010-2012, 2018, 2021, 2022
- CompSci 590.2 - Reinforcement Learning (Fall 2017, 2021)
- CompSci 271 - Machine Learning (Fall 2003, 2005, 2007, 2009, 2011)
- CompSci 1/296 - Robotics (mixed undergraduate and graduate) (Spring 2007)
- CompSci 271 - Numeric Artificial Inteligence (Spring 2001)
- CompSci 370 - Planning Under Uncertainty (Fall 2000)
Presentations
- Invited speaker at RLDM 2017 workshop.
- Invited speaker at NIPS 2013 workshop on Advances in Machine Learning for Sensorimotor Control.
- Invited seminar at Wake Forest University, 2013.
- MIT LIDS Seminar, 2012.
- U.T. Austin Forum for Artificial Intelligence, 2011.
- Google Distinguished Lecturer, CMU, 2010.
- ARTSI alliance Faculty Workshop presenter, 2010.
- Presenter, 2008 ICML.
- SAMSI Workshop Tutorial on Markov Decision Processes and Reinforcement
Learning, 2007.
- Duke MURI Workshop on Adaptive Multi-Sensor Sensing and Waveform
Scheduling, 2006.
- NSF Approximate Dynamic Programming Workshop, technical talk and
tutorial, 2006.
- Invited talk at ICML Hierarchical Reinforcement Learning Workshop 2005
- Seminars at Princeton, Univ. of Massachusetts, Middlebury, Stony Brook
2005.
- Invited panelist on AFOSR funded workshop on Decision Making in
Adversarial Domains
- Seminars at Univ. of Alberta, Carnegie Mellon and NASA Ames 2004.
- Invited participant in GAFOS workshop 2004.
- Daghstuhl 2003 invited tutorial and research talk
- LICS 2003 Workshop on Probability and AI invited talk
- Daghstuhl 2001 invited tutorial and research talk
- Poster Spotlight, 2001 NIPS.
- Duke ISDS Colloquium 2001.
- Presenter, 2000 UAI.
- Presenter, 1999 IJCAI.
- Invited talks at NASA and SRI, 1999.
- Plenary Presentation, 1998 UAI conference.
- Presenter, 1998 SARA Symposium.
- Full Oral presentation, 1997 NIPS conference.
- Presenter, 1997 ICML Workshop on Reinforcement Learning.
- Presenter, 1997 Joint Brazil, US Workshop on Intelligent Robotic
Agents.
- Presenter, 1996 AAAI Fall Symposium.
- Presenter, 1996 NSF Reinforcement Learning Workshop.
- Presenter, 1995 IJCAI.
- Invited participant, 1995 Joint AI/OR Workshop.
- Guest lectured for Computer Science and Cognitive Science classes
at Berkeley and Stanford.
Journal Papers
-
Counting Objects with a Combination of Horizontal and Overhead
Sensors, Erik Halvorson and Ronald Parr, The International
Journal of Robotics Research, June 2010, 29(7), pp. 840-854.
-
Non-Myopic Multi-Aspect Sensing with
Partially Observable Markov Decision Processes, Shihao Ji, Ronald Parr,
and Lawrence Carin, IEEE
Transactions on Signal Processing, June 2007
Volume 55, Issue: 6, Part 1, 2007, pp. 2720-2730.
-
Least-Squares Policy Iteration, Michail Lagoudakis and Ronald
Parr, Journal of Machine Learning Research (JMLR), Vol. 4,
2003, pp. 1107-1149.
-
Efficient Solution Algorithms for Factored MDPs,
Carlos Guestrin, Daphne Koller,
Ronald Parr and Shobha Venkataraman, Journal of Artificial Intelligence
Research (JAIR), Vol. 19, 2003, pp. 399-468. [Recipient of IJCAI-JAIR
best paper awards for 2007.]
Books or Edited Volumes
-
Proceedings of the 23rd Conference on Uncertainty in Artificial
Intelligence, Ronald Parr and Linda van der Gaag, eds., AUAI Press, 2007.
Highly Refereed Papers (also includes WAFR, ICRA, and IROS)
- On the Existence of Simpler Machine Learning Models
ACM Conference on Fairness, Accountability, and Transparency (ACM
FAccT), 2022, Lesia Semenova, Cynthia Rudin, and Ronald Parr. [25%
acceptance rate]
- Policy
Caches with Successor Features, Mark Nemecek and Ronald Parr,
Proceedings of the 38th International Conference on Machine Learning
(ICML-21). [21% acceptance rate]
-
Deep Radial-Basis Value Functions for Continuous Control, Kavosh Asadi, Neev Parikh, Ronald Parr, George Konidaris, and Michael Littman, Proceedings of the Thirty Fifth AAAI Conference on Artificial Intelligence (AAAI-21). [21% acceptance rate]
-
Revisiting the Softmax Bellman Operator: New Benefits and New Perspective, Zhao Song, Ronald Parr, and Lawrence Carin, Proceedings of the Thirty-sixth International Conference on Machine Learning (ICML 2019). [23% acceptance rate]
-
Improving PAC Exploration Using the Median Of Means, Jason Pazis, Ronald Parr, and Jonathan How, Neural Information Processing Systems 2016 (NIPS 2016). [23% acceptance rate]
-
Linear Feature Encoding for Reinforcement Learning, Zhao Song, Ronald Parr, Xuejun Liao, and Lawrence Carin, Neural Information Processing Systems 2016 (NIPS 2016). [23% acceptance rate]
-
Efficient PAC-optimal Exploration in Concurrent, Continuous State MDPs with Delayed Updates, Jason Pazis and Ronald Parr, Proceedings of the Thirtieth AAAI Conference (AAAI 2016). [26% acceptance rate]
-
Distance Minimization for Reward Learning from Scored Trajectories, Benjamin Burchfiel, Carlo Tomasi, and Ronald Parr, Proceedings of the Thirtieth AAAI Conference (AAAI 2016). [26% acceptance rate]
-
Unsupervised Discovery of Object Classes with
a Mobile Robot, Julian Mason, Bhaskara Marthi, and Ronald Parr,
International Conference on Robotics and Automation (ICRA 2014). [48% acceptance rate]
-
Sample Complexity and Performance Bounds for
Non-parametric Approximate Linear Programming, Jason
Pazis and Ronald Parr, Proceedings of the Twenty Seventh
Association for Advancement of Artificial Intelligence
Conference (AAAI 2013). [29% acceptance rate]
-
PAC Optimal Exploration in Continuous Space
Markov Decision Processes, Jason
Pazis and Ronald Parr, Proceedings of the Twenty Seventh
Association for Advancement of Artificial Intelligence
Conference (AAAI 2013). Outstanding Paper Honorable
Mention [29% acceptance rate]
-
Object Disappearance for Object Discovery, Julian
Mason, Bhaskara Marthi, and Ronald Parr, IEEE/RSJ
International Conference on Intelligent Robots and Systems
(IROS-2012). [45% acceptance rate]
-
Value Function Approximation in Noisy Environments
Using Locally Smoothed Regularized Approximate Linear Programs, Gavin
Taylor and Ronald Parr, Proceedings of the Twenty-Eighth
International Conference on Uncertainty in Artificial
Intelligence (UAI-2012). [31% acceptance rate, selected
for oral presentation; only 25% of accepted papers are selected
for oral presentation]
-
Greedy Algorithms for Sparse Reinforcement
Learning, Christopher Painter-Wakefield and Ronald
Parr, Proceedings of the Twenty-Ningth International
Conference on Machine Learning (ICML-2012), pp. 1391-1398.
[27% acceptance rate]
-
Computing Optimal Strategies to Commit to in
Stochastic Games, Joshua Letchford, Liam MacDermed,
Vincent Conitzer, Ronald Parr, and Charles Isbell,
Proceedings of the Twenty-Sixth AAAI Conference on
Artificial Intelligence (AAAI-2012). [26% acceptance rate]
-
Generalized Value Functions for Large Action Sets, Jason Pazis
and Ronald Parr, Proceedings of the Twenty-Eighth International
Conference on Machine Learning (ICML-2011), pp. 1185-1192. [26% acceptance rate]
-
Non-parametric Approximate Linear Programming for MDPs, Jason
Pazis and Ronald Parr, Proceedings of the Twenty-Fifth AAAI conference
on Artificial Intelligence (AAAI-2011). [25% acceptance rate]
-
Security Games with Multiple Attacker Resources, Dmytro
Korzhyk, Vincent Conitzer, Ronald Parr, Proceedings of the Twenty-second International
Joint Conference on Artificial Intelligence (IJCAI-2011), pp 273-279. [19% acceptance
rate]
-
Textured
Occupancy Grids for Monocular Localization Without Features,
Julian Mason, Susanna Ricco, Ronald Parr, 2011 IEEE International
Conference on Robotics and Automation (ICRA 2011), pp. 5800-5806. [49% acceptance rate]
- Solving Stackelberg Games with Uncertain
Observability, Dmytro Korzhyk, Vincent Conitzer, Ronald Parr, The
Tenth International Conference on Autonomous Agents and Multiagent
Systems (AAMAS 2011), pp. 1013-1020. [22% acceptance rate]
- Linear Complementarity for Regularized Policy Evaluation and
Improvement, Jeff Johns, Christopher Painter-Wakefield, Ronald Parr,
Advances in Neural Information Processing Systems 23, 2010, pp 1009-101. [24% acceptance rate; less than 2% of papers were
selected for an oral presentation, of which this was one]
- Feature Selection Using Regularization in Approximate
Linear Programs for Markov Decision Processes, Marek Petrik, Gavin
Taylor, Ronald Parr, and Shlomo Zilberstein, International
Conference on Machine Learning (ICML-2010), pp. 871-878. [26% acceptance rate]
- Complexity of Computing Optimal Stackelberg
Strategies in Security Resource Allocation Games, Dmytro Korzhyk,
Vincent Conitzer, and Ronald Parr, Proceedings of the
24th National Conference on Artificial Intelligence (AAAI-10) [27%
acceptance rate].
- Kernelized Value Function Approximation for
Reinforcement Learning, Gavin Taylor and Ronald Parr, International
Conference on Machine Learning (ICML-2009), pp. 1017-1024. [27% acceptance rate].
-
Multi-step Multi-sensor Hider-seeker Games, Erik Halvorson, Vincent
Conitzer, and Ronald Parr, Proceedings of the Twenty-First International
Joint Conference on Artificial Intelligence (IJCAI-2009), pp. 159-166. [26% acceptance
rate]
-
Planning Aims for a Network of Horizontal and Overhead
Sensors, Erik Halvorson and Ronald Parr, Workshop on the Algorithmic
Foundations of Robotics 2008 (WAFR-2008), pp. 19-34. [65% acceptance rate]
-
An Analysis of Linear Models, Linear Value-Function
Approximation, and Feature Selection for Reinforcement Learning, Ronald
Parr, Lihong Li, Gavin Taylor, Christopher Painter-Wakefield, and Michael
L. Littman, International Conference on Machine Learning (ICML-2008), pp. 752-759.
[27% acceptance rate] Please note this addendum: http://www.cs.duke.edu/~parr/icml08-addendum.html
-
Point-Based Policy Iteration, Shihao Ji,
Ronald Parr, Hui Li, Xuejun Liao, and Lawrence Carin, Proceedings of the
Twenty-Second National Conference on Artificial Intelligence (AAAI 2007),
pp 1243-1249.
[27% acceptance rate]
-
Analyzing Feature Generation for Value-Function
Approximation, Ronald Parr, Christopher Painter-Wakefield, Lihong Li,
and Michael Littman, International Conference on Machine Learning
(ICML-2007), pp. 737-744. [29% acceptance rate]
- Efficient Selection of Disambiguating Actions for Stereo
Vision, Monika Schaeffer and Ronald Parr, Proceedings of the
Twenty Second Conference on Uncertainty in Artificial Intelligence
(UAI-2006), pp. 418-427. [plenary presentation; 32% acceptance
overall; 12% of submitted papers were given plenary presentations]
-
Hierarchical Linear/Constant Time SLAM using Particle Filters for Dense
Maps, Austin I. Eliazar and Ronald Parr, Advances in
Neural Information Processing Systems
(NIPS-19) 2005, pp. 339-346. [27% acceptance rate]
-
Learning Probabilistic Motion Models for Mobile
Robots, Austin I. Eliazar and Ronald Parr, Proceedings of the Twenty First
International Conference on Machine Learning
(ICML-2004), pp. 249-256. [32% acceptance rate]
-
DP-SLAM 2.0, Austin Eliazar and Ronald Parr,
IEEE 2004 International Conference on Robotics and Automation (ICRA 2004),
pp. 1314-1320. [58% acceptance rate]
-
Reinforcement Learning as Classification: Leveraging
Modern Classifiers, Michail Lagoudakis and Ronald Parr,
Proceedings of the Twentieth International Conference on Machine Learning
(ICML-2003). [32% acceptance rate]
-
DP-SLAM: Fast, Robust Simultaneous Localization
and Mapping without Predetermined Landmarks, Austin Eliazar and Ronald
Parr, Proceedings of the Eighteenth International Joint Conference on
Artificial Intelligence (IJCAI 03). [21% acceptance rate]
-
Learning in Zero-Sum Team Markov Games using Factored Value
Functions Michail Lagoudakis and Ronald Parr, Advances in
Neural Information Processing Systems
(NIPS-15) 2002. [30% acceptance rate]
-
Value Function Approximation in Zero-Sum Markov
Games Michail Lagoudakis and Ronald Parr, Proceedings of the 18th
Conference on Uncertainty in Artificial Intelligence (UAI 2002),
pp. 283-292, August 2002. [34% acceptance rate]
-
Coordinated Reinforcement Learning, Carlos
Guestrin, Michail Lagoudakis,and Ronald Parr. Nineteenth International
Conference on Machine Learning (ICML-2002), pp. 227-234, July
2002. [33% acceptance rate]
-
XPathLearner: An On-Line Self-Tuning Markov Histogram for XML Path
Selectivity Estimation, Lipyeow Lim, Min Wang, Sriram Padmanabhan,
Jeffrey Scott Vitter, Ronald Parr. Twenty Eighth International Conference
on Very Large Databases. (VLDB 2002), pp. 442-453, August 2002. [16%
acceptance rate]
-
Multiagent Planning with Factored MDPs, Carlos Guestrin, Daphne Koller
and Ronald Parr, Advances in Neural Information Processing
Systems 2001 (NIPS-14), pp. 1532-1530, December 2001. [Accepted for a full oral
presentation. Less than 4% of submitted papers got oral presentations this
year.]
-
Model-Free Least-Squares Policy Iteration Michail Lagoudakis, Ronald
Parr, Advances in Neural Information Processing Systems
2001 (NIPS-14), pp. 1547-1554, August 2001. [Accepted for a poster spotlight.
Approximately 10% of submitted papers got poster spotlights this year.]
-
Inference in Hybrid Networks: Theoretical Limits and Practical
Algorithms, Uri Lerner, Ronald Parr, Uncertainty in Artificial
Intelligence, Proceedings of the Seventeenth Conference (UAI 2001),
pp. 310-318, August 2000. [Joint winner of best student paper award
(student first author).]
-
Max-norm Projections for Factored MDPs, Carlos
Guestrin, Daphne Koller and Ronald Parr, Proceedings of the Seventeenth
International Joint Conference on Artificial Intelligence (IJCAI 2001),
pp. 673-680, August 2001. [25% acceptance rate]
-
Policy Iteration for Factored MDPs, Daphne Koller and Ronald Parr,
Uncertainty in Artificial Intelligence, Proceedings of the Sixteenth
Conference (UAI 2000), pp. 326-334, June 2000. [45% acceptance rate]
- Bayesian Fault Detection and Diagnosis in Dynamic Systems,
Uri Lerner, Ronald Parr, Daphne Koller and Gautam Biswas, Proceedings of
the Twelfth National Conference on Artificial Intelligence (AAAI
2000), pp. 531-537, July 2000. [33% acceptance rate]
- Making Rational Decisions Using Adaptive Utility Elicitation,
Urszula Chajewska, Daphne Koller, Ronald Parr, Proceedings of
the Twelfth National Conference on Artificial Intelligence (AAAI
2000), pp. 363-369, July 2000.
[33% acceptance rate]
- Policy Search via Density Estimation, Andrew Y. Ng, Ronald Parr,
Daphne Koller, Neural Information Processing Systems 1999 (NIPS 99),
pp. 1022-1028, December 1999.
[Accepted for a poster spotlight. 10% of submitted papers received
poster spotlights.]
- Reinforcement Learning using Approximate Belief States,
Andrés Rodgríguez, Ronald Parr, Daphne Koller, Neural
Information Processing Systems 1999 (NIPS 99), pp. 1036-1042, December
1999. [32% acceptance rate]
-
Computing Factored Value Functions for Policies in Structured MDPs,
Daphne Koller, Ronald Parr, Proceedings of the Sixteenth International
Joint Conference on Artificial Intelligence (IJCAI 1999), pp. 1332-1339,
July 1999. [26% acceptance rate]
-
Flexible Decomposition Algorithms for Weakly Coupled Markov Decision
Problems, Ronald Parr, Proceedings of the Fourteenth Conference on
Uncertainty in Artificial Intelligence (UAI-98), pp. 422-430, July 1998. [45%
acceptance rate]
-
Reinforcement Learning with Hierarchies of Machines, Ronald Parr,
Stuart Russell, Neural Information Processing Systems 1998 (NIPS 98),
pp. 1043-1049, December 1998.
[Accepted for a full oral presentation. 4.5% of submitted papers
were given an oral presentation.]
-
Generalized Prioritized Sweeping, David Andre, Nir Friedman, Ronald
Parr, Neural Information Processing Systems 1998 (NIPS 98), pp. 1001-1007,
December 1998. [31% acceptance rate]
-
Approximating Optimal Policies for Partially Observable Stochastic
Domains, Ronald Parr, Stuart Russell,
Proceedings of the Fourteenth
International Joint
Conference on Artificial Intelligence (IJCAI-95), pp. 1088-1094, August
1995. [22% acceptance rate]
- Provably Bounded Optimal Agents,
Stuart Russell, Devika Subramanian, Ronald Parr, in Proceedings
of the Thirteenth International Joint Conference on Artificial
Intelligence (IJCAI-93), pp. 338-344, August, 1993. [25% acceptance rate]
Other Papers
- Fitted Q-Learning for Relational Domains, Srijita Das, Sriraam Natarajan, Kaushik Roy, Ronald Parr and Kristian
Kersting, 17th International Conference on Knowledge Representation and Reasoning 2020 (KR2020) poster. Full paper available on arXiv.
- L1 Regularized Linear Temporal Difference
Learning, Duke CS Technical Report TR-2012-01
Christopher Painter-Wakefield and Ronald Parr, 2012.
-
Planning Aims for a Network of Horizontal and Overhead Sensors, Erik
Halvorson and Ronald Parr, International Symposium on Artificial
Intelligence and Mathematics 2008 (ISAIM 2008).
-
Counting Objects with a Combination of Horizontal and Overhead
Sensors, Erik Halvorson and Ronald Parr, Duke CS Technical Report
CS-2007-04. (Longer version of the above ISAIM 2008 symposium paper.)
-
Efficient Selection of Disambiguating Actions for Stereo Vision,
Monika Schaeffer and Ronald Parr, NIPS 2005 Workshop on Value of
Information in Inference, Learning and Decision Making.
-
Least-Squares Methods in Reinforcement Learning for Control, Michail
Lagoudakis, Ronald Parr and Michael L. Littman. Second
Hellenic Conference on Artificial Intelligence (SETN-02).
-
Model-Free Least-Squares Policy Iteration, Michail Lagoudakis and
Ronald Parr, Duke University Technical Report. (Longer version of above
NIPS 2001 paper.)
-
Coordinated Reinforcement Learning, Carlos Guestrin, Michail
Lagoudakis,and Ronald Parr. Proceedings of the 2002
AAAI Spring Symposium Series: Collaborative Learning Agents
-
Selecting the Right Algorithm Michail Lagoudakis, Michael L. Littman and
Ronald Parr, Proceedings of the 2001 AAAI Fall Symposium Series: Using
Uncertainty within Computation, Cape Cod, MA, November 2001.
-
Solving Factored POMDPs with Linear Value Functions, Carlos Guestrin,
Daphne Koller and Ronald Parr, In the IJCAI-01 workshop on Planning under
Uncertainty and Incomplete Information.
-
Max-norm Projections for Factored
MDPs, Carlos Guestrin, Daphne Koller, and Ronald Parr, AAAI Spring
Symposium, Stanford, California, March 2001.
- Adaptive Utility Elicitation using Value of Information,
Chajewska, U., M. Kuppermann, R. Parr and D. Koller. (Abstract),
Presented at the 22nd Annual Meeting of the Society for Medical Decision
Making, 2000).
-
Hierarchical Control and Learning for Markov Decision Processes,
Ph.D. Dissertation, 1998, University of California, Berkeley.
-
A Unifying Framework for Temporal Abstraction in Stochastic Processes,
Ronald Parr, Symposium on Abstraction Reformulation and Approximation,
1998 (SARA-98).
-
Feasibility Study of Fully Automated Vehicles Using Decision-theoretic
Control, Jeffrey Forbes, Nikunj Oza, Ronald Parr, Stuart Russell,
California PATH Research Report, UCB-ITS-PRR-97-18.
-
Policy Based Clustering in Markov Decision Problems, AAAI-96 Fall
Symposium on Learning Complex Behaviors in Adaptive Intelligent Systems.
Funding
- ARO: NeuroSymbolic Hierarchical Reinforcement Learning, Ronald Parr
and Prasad Tadepalli (Oregon State), Submitted 12/20 (funded 10/22), Parr portion: $274K.
-
NSF: Feature Encoding for Reinforcement Learning, $500,000, with Larry Carin, Parr portion approximately $250,000.
-
NSF EAGER: Reasoning and Acting in Large, Structured, Uncertain Domains, $200,000, with Sriraam Natarajan, Parr portion $100,000.
-
DARPA MTO: Foundations of Sequential Learning, $281,000, with Cynthia Rudin and Kamesh Munagala, 2016, Parr portion approximately $70,000.
-
NSF NRI: Expert-Apprentice Collaboration, $750,000, with Carlo Tomasi (PI), 2012-2017.
Parr portion $375,000.
-
NSF RI: Non-parametric Approximate Dynamic Programming for Continuous
Domains, $450,000, 2012-2017.
-
NSF EAGER: Learning in Continuous and High Dimensional Action
Spaces, $150,000, 2011-2013.
-
NSF IGERT: Training Program in Wireless Intelligent Sensor Networks
(WISeNet), $3,000,000 (subject to continuing approval), with Silvia
Ferrari, Gabriel G. Katul, John D. Albertson, and Pankaj K. Agarwal. Parr
portion is difficult to assess since this is a training grant and student
participation in the grant is not decided a priori, 2011-2016.
-
CRA CI Fellow Award, $140,000, funding for postdoc Jeff
Johns, (Parr portion depends upon interpretation. The funding
process is initiated by and attached to the postdoc, but it is
also associated with a particular mentor. For bureaucratic reasons,
the funding is associated completely with Parr within Duke because
postdocs cannot serve as PI's.) 2009-2011.
-
ARO: Efficient Algorithms for Computing Stackelberg Strategies in
Security Games, $180,000, Joint with Vincent Conitzer. (This
primarily funds one of Conitzer's students, so the Parr portion is
neglible.) 2009-2011.
-
U.S. Department of Education GAANN grant
, $384,000, (Joint with
multiple investigators in the department.), Parr portion: $0, (The
Department of Education has determined that my students are too wealthy to
receive funding from this grant, so the funds have been allocated to
other students in the department.) 2007.
-
NSF RI: Feature Discovery and Benchmarks for Exportable Reinforcement
Learning, $450,000, (Joint with Michael Littman from Rutgers
University. Duke/Parr portion $225,000), 2007.
-
DARPA CSSG year 2 funding, $500,000, 2007-2009.
-
NSF CAREER: Observing to Plan, Planning to Observe, $440,000, 2006-2012.
-
DARPA CSSG grant, $80,000, 2006.
-
SAIC gift in support of robotics research, $80,000, 2006.
- SAIC gift of 3 research robots, approx. value $50,000, 2006.
-
Computing a Semantic View of a Scene for Surveillance from Stereo and
Discreet LIDAR, with Carlo Tomasi and Industrial Partner IAI, DARPA
SBIR Phase I. Duke portion: $22,000, Parr portion $11,000, 2005-2006.
(Duke refused to sign the contract for this grant.)
-
A Core Experimental Facility for Computer Vision and Artificial
Intelligence, NSF CRI grant (joint with Carlo Tomasi), $300,000, 2005-2007.
-
SAIC gift in support of robotics research, $80,000, 2005.
-
SAIC gift for graduate student support, $20,000, 2004.
-
Sloan Research Fellowship, Alfred P. Sloan Foundation, $40,000,
2003-2004, extended 1 year.
-
Prediction and Planning: Bridging the Gap, National Science
Foundation, $291,677. Official start date: 2002-2005.
Postdocs Supervised
- Jeff Johns (2009-2010, now at FireEye)
Students Advised
- Jason Pazis (Ph.D., Fall 2015, now at Amazon)
- Mac Mason (Ph.D., Summer 2013, now at Third Wave Automation)
- Christopher Painter-Wakefield (Ph.D., in Spring 2013, Teaching Associate
Professor Colorado School of Mines)
- Monika Schaeffer (Master's in Spring 2011)
- Gavin Taylor (Ph.D. in Spring 2011, Associate Professor at
the United States Naval Academy)
- Erik Halvorson (Master's in Summer 2008)
- Neeti Wagle (Master's in Spring 2008)
- Austin Eliazar (Ph.D. in Fall 2005, now at HealthVerity)
- Michail Lagoudakis (Ph.D. in Spring 2003, now associate professor, Technical
University of Crete, Greece)
Memberships
AAAI
Citizenship
Natural born U.S. citizen.
Contact Information
Department of Computer Science
Duke University
LSRC / Box 90129
Durham, NC 27708
(919)660-6537
http://www.cs.duke.edu/~parr