Real-time symbolic dynamic programming for hybrid MDPs

Luis G.R. Vianna, Leliane N. De Barros, Scott Sanner

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    6 Citations (Scopus)

    Abstract

    Recent advances in Symbolic Dynamic Programming (SDP) combined with the extended algebraic decision diagram (XADD) have provided exact solutions for expressive subclasses of finite-horizon Hybrid Markov Decision Processes (HMDPs) with mixed continuous and discrete state and action parameters. Unfortunately, SDP suffers from two major drawbacks: (1) it solves for all states and can be intractable for many problems that inherently have large optimal XADD value function representations; and (2) it cannot maintain compact (pruned) XADD representations for domains with nonlinear dynamics and reward due to the need for nonlinear constraint checking. In this work, we simultaneously address both of these problems by introducing real-time SDP (RTSDP). RTSDP addresses (1) by focusing the solution and value representation only on regions reachable from a set of initial states and RTSDP addresses (2) by using visited states as witnesses of reachable regions to assist in pruning irrelevant or unreachable (nonlinear) regions of the value function. To this end, RTSDP enjoys provable convergence over the set of initial states and substantial space and time savings over SDP as we demonstrate in a variety of hybrid domains ranging from inventory to reservoir to traffic control.

    Original languageEnglish
    Title of host publicationProceedings of the 29th AAAI Conference on Artificial Intelligence, AAAI 2015 and the 27th Innovative Applications of Artificial Intelligence Conference, IAAI 2015
    PublisherAI Access Foundation
    Pages3402-3408
    Number of pages7
    ISBN (Electronic)9781577357032
    Publication statusPublished - 1 Jun 2015
    Event29th AAAI Conference on Artificial Intelligence, AAAI 2015 and the 27th Innovative Applications of Artificial Intelligence Conference, IAAI 2015 - Austin, United States
    Duration: 25 Jan 201530 Jan 2015

    Publication series

    NameProceedings of the National Conference on Artificial Intelligence
    Volume5

    Conference

    Conference29th AAAI Conference on Artificial Intelligence, AAAI 2015 and the 27th Innovative Applications of Artificial Intelligence Conference, IAAI 2015
    Country/TerritoryUnited States
    CityAustin
    Period25/01/1530/01/15

    Fingerprint

    Dive into the research topics of 'Real-time symbolic dynamic programming for hybrid MDPs'. Together they form a unique fingerprint.

    Cite this