TY - GEN
T1 - Action schema networks
T2 - 32nd AAAI Conference on Artificial Intelligence, AAAI 2018
AU - Toyer, Sam
AU - Trevizan, Felipe
AU - Thiébaux, Sylvie
AU - Xie, Lexing
N1 - Publisher Copyright:
Copyright © 2018, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
PY - 2018
Y1 - 2018
N2 - In this paper, we introduce the Action Schema Network (ASNet): a neural network architecture for learning generalised policies for probabilistic planning problems. By mimicking the relational structure of planning problems, ASNets are able to adopt a weight sharing scheme which allows the network to be applied to any problem from a given planning domain. This allows the cost of training the network to be amortised over all problems in that domain. Further, we propose a training method which balances exploration and supervised training on small problems to produce a policy which remains robust when evaluated on larger problems. In experiments, we show that ASNet's learning capability allows it to significantly outperform traditional non-learning planners in several challenging domains.
AB - In this paper, we introduce the Action Schema Network (ASNet): a neural network architecture for learning generalised policies for probabilistic planning problems. By mimicking the relational structure of planning problems, ASNets are able to adopt a weight sharing scheme which allows the network to be applied to any problem from a given planning domain. This allows the cost of training the network to be amortised over all problems in that domain. Further, we propose a training method which balances exploration and supervised training on small problems to produce a policy which remains robust when evaluated on larger problems. In experiments, we show that ASNet's learning capability allows it to significantly outperform traditional non-learning planners in several challenging domains.
UR - http://www.scopus.com/inward/record.url?scp=85054987121&partnerID=8YFLogxK
M3 - Conference contribution
T3 - 32nd AAAI Conference on Artificial Intelligence, AAAI 2018
SP - 6294
EP - 6301
BT - 32nd AAAI Conference on Artificial Intelligence, AAAI 2018
PB - AAAI Press
Y2 - 2 February 2018 through 7 February 2018
ER -