Analytic Decision analysis via symbolic dynamic programming for parameterized hybrid MDPs

Shamin Kinathil, Harold Soh, Scott Sanner

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Decision analysis w.r.t. unknown parameters is a critical task in decision-making under uncertainty. For example, we may need to (i) perform inverse learning of the cost parameters of a multi-objective reward based on observed agent behavior; (ii) perform sensitivity analyses of policies to various parameter settings; or (iii) analyze and optimize policy performance as a function of policy parameters. When such problems have mixed discrete and continuous state and/or action spaces, this leads to parameterized hybrid MDPs (PHMDPs) that are often approximately solved via discretization, sampling, and/or local gradient methods (when optimization is involved). In this paper we combine two recent advances that allow for the first exact solution and optimization of PHMDPs. We first show how each of the aforementioned use cases can be formalized as PHMDPs, which can then be solved via an extension of symbolic dynamic programming (SDP) even when the solution is piecewise nonlinear. Secondly, we can leverage recent advances in non-convex solvers that require symbolic forms of the objective function for non-convex global optimization in (i), (ii), and (iii) using SDP to derive symbolic solutions for each PHMDP formalization. We demonstrate the efficacy and scalability of our optimal analytical framework on nonlinear examples of each of the aforementioned use cases.

Original languageEnglish
Title of host publicationProceedings of the 27th International Conference on Automated Planning and Scheduling, ICAPS 2017
EditorsLaura Barbulescu, Stephen F. Smith, Mausam, Jeremy D. Frank
PublisherAAAI Press
Pages181-185
Number of pages5
ISBN (Electronic)9781577357896
Publication statusPublished - 2017
Externally publishedYes
Event27th International Conference on Automated Planning and Scheduling, ICAPS 2017 - Pittsburgh, United States
Duration: 18 Jun 201723 Jun 2017

Publication series

NameProceedings International Conference on Automated Planning and Scheduling, ICAPS
ISSN (Print)2334-0835
ISSN (Electronic)2334-0843

Conference

Conference27th International Conference on Automated Planning and Scheduling, ICAPS 2017
Country/TerritoryUnited States
CityPittsburgh
Period18/06/1723/06/17

Fingerprint

Dive into the research topics of 'Analytic Decision analysis via symbolic dynamic programming for parameterized hybrid MDPs'. Together they form a unique fingerprint.

Cite this