Action schema networks: Generalised policies with deep learning

Sam Toyer, Felipe Trevizan, Sylvie Thiébaux, Lexing Xie

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    60 Citations (Scopus)

    Abstract

    In this paper, we introduce the Action Schema Network (ASNet): a neural network architecture for learning generalised policies for probabilistic planning problems. By mimicking the relational structure of planning problems, ASNets are able to adopt a weight sharing scheme which allows the network to be applied to any problem from a given planning domain. This allows the cost of training the network to be amortised over all problems in that domain. Further, we propose a training method which balances exploration and supervised training on small problems to produce a policy which remains robust when evaluated on larger problems. In experiments, we show that ASNet's learning capability allows it to significantly outperform traditional non-learning planners in several challenging domains.

    Original languageEnglish
    Title of host publication32nd AAAI Conference on Artificial Intelligence, AAAI 2018
    PublisherAAAI Press
    Pages6294-6301
    Number of pages8
    ISBN (Electronic)9781577358008
    Publication statusPublished - 2018
    Event32nd AAAI Conference on Artificial Intelligence, AAAI 2018 - New Orleans, United States
    Duration: 2 Feb 20187 Feb 2018

    Publication series

    Name32nd AAAI Conference on Artificial Intelligence, AAAI 2018

    Conference

    Conference32nd AAAI Conference on Artificial Intelligence, AAAI 2018
    Country/TerritoryUnited States
    CityNew Orleans
    Period2/02/187/02/18

    Fingerprint

    Dive into the research topics of 'Action schema networks: Generalised policies with deep learning'. Together they form a unique fingerprint.

    Cite this