Occupation Measure Heuristics for Probabilistic Planning

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    29 Citations (Scopus)

    Abstract

    For the past 25 years, heuristic search has been used to solve domain-independent probabilistic planning problems, but with heuristics that determinise the problem and ignore precious probabilistic information. To remedy this situation, we explore the use of occupation measures, which represent the expected number of times a given action will be executed in a given state of a policy. By relaxing the well-known linear program that computes them, we derive occupation measure heuristics the first admissible heuristics for stochastic shortest path problems (SSPs) taking probabilities into account. We show that these heuristics can also be obtained by extending recent operator-counting heuristic formulations used in deterministic planning. Since the heuristics are formulated as linear programs over occupation measures, they can easily be extended to more complex probabilistic planning models, such as constrained SSPs (C-SSPs). Moreover, their formulation can be tightly integrated into i-dual, a recent LP-based heuristic search algorithm for (constrained) SSPs, resulting in a novel probabilistic planning approach in which policy update and heuristic computation work in unison. Our experiments in several domains demonstrate the benefits of these new heuristics and approach.
    Original languageEnglish
    Title of host publicationProceedings of the 27th International Conference on Automated Planning and Scheduling, ICAPS 2017
    Place of PublicationAAAI Press
    PublisherAAAI Press
    Pages306-315pp
    ISBN (Print)9781577357896
    Publication statusPublished - 2017
    Event27th International Conference on Automated Planning and Scheduling (ICAPS 2017) - Pittsburgh, United States
    Duration: 1 Jan 2017 → …

    Conference

    Conference27th International Conference on Automated Planning and Scheduling (ICAPS 2017)
    Country/TerritoryUnited States
    Period1/01/17 → …
    OtherJune 18–23, 2017

    Fingerprint

    Dive into the research topics of 'Occupation Measure Heuristics for Probabilistic Planning'. Together they form a unique fingerprint.

    Cite this