Sequential Monte Carlo in probabilistic planning reachability heuristics

Daniel Bryce, Subbarao Kambhampati, David E. Smith

Research output: Chapter in Book/Report/Conference proceedingConference contribution

14 Scopus citations

Abstract

The current best conformant probabilistic planners encode the problem as a bounded length CSP or SAT problem. While these approaches can find optimal solutions for given plan lengths, they often do not scale for large problems or plan lengths. As has been shown in classical planning, heuristic search outperforms CSP/SAT techniques (especially when a plan length is not given a priori). The problem with applying heuristic search in probabilistic planning is that effective heuristics are as yet lacking. In this work, we apply heuristic search to conformant probabilistic planning by adapting planning graph heuristics developed for non-deterministic planning. We evaluate a straight-forward application of these planning graph techniques, which amounts to exactly computing the distribution over reachable relaxed planning graph layers. Computing these distributions is costly, so we apply Sequential Monte Carlo to approximate them. We demonstrate on several domains how our approach enables our planner to far out-scale existing (optimal) probabilistic planners and still find reasonable quality solutions.

Original languageEnglish (US)
Title of host publicationICAPS 2006 - Proceedings, Sixteenth International Conference on Automated Planning and Scheduling
Pages233-242
Number of pages10
Volume2006
StatePublished - 2006
EventICAPS 2006 - 16th International Conference on Automated Planning and Scheduling - Cumbria, United Kingdom
Duration: Jun 6 2006Jun 10 2006

Other

OtherICAPS 2006 - 16th International Conference on Automated Planning and Scheduling
Country/TerritoryUnited Kingdom
CityCumbria
Period6/6/066/10/06

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'Sequential Monte Carlo in probabilistic planning reachability heuristics'. Together they form a unique fingerprint.

Cite this