Guiding search with generalized policies for probabilistic planning

William Shen, Felipe Trevizan, Sam Toyer, Sylvie Thiébaux, Lexing Xie

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    10 Citations (Scopus)

    Abstract

    We examine techniques for combining generalized policies with search algorithms to exploit the strengths and overcome the weaknesses of each when solving probabilistic planning problems. The Action Schema Network (ASNet) is a recent contribution to planning that uses deep learning and neural networks to learn generalized policies for probabilistic planning problems. ASNets are well suited to problems where local knowledge of the environment can be exploited to improve performance, but may fail to generalize to problems they were not trained on. Monte-Carlo Tree Search (MCTS) is a forward-chaining state space search algorithm for optimal decision making which performs simulations to incrementally build a search tree and estimate the values of each state. Although MCTS can achieve state-of-the-art results when paired with domain-specific knowledge, without this knowledge, MCTS requires a large number of simulations in order to obtain reliable state-value estimates. By combining AS-Nets with MCTS, we are able to improve the capability of an ASNet to generalize beyond the distribution of problems it was trained on, as well as enhance the navigation of the search space by MCTS.

    Original languageEnglish
    Title of host publicationProceedings of the 12th International Symposium on Combinatorial Search, SoCS 2019
    EditorsPavel Surynek, William Yeoh
    PublisherAAAI Press
    Pages97-105
    Number of pages9
    ISBN (Electronic)9781577358084
    Publication statusPublished - 2019
    Event12th International Symposium on Combinatorial Search, SoCS 2019 - Napa, United States
    Duration: 16 Jul 201917 Jul 2019

    Publication series

    NameProceedings of the 12th International Symposium on Combinatorial Search, SoCS 2019

    Conference

    Conference12th International Symposium on Combinatorial Search, SoCS 2019
    Country/TerritoryUnited States
    CityNapa
    Period16/07/1917/07/19

    Fingerprint

    Dive into the research topics of 'Guiding search with generalized policies for probabilistic planning'. Together they form a unique fingerprint.

    Cite this