Continuous real time dynamic programming for discrete and continuous state MDPs

Luis Gustavo Rocha Vianna, Scott Sanner, Leliane Nunes De Barros

    Research output: Chapter in Book/Report/Conference proceedingConference Paperpeer-review

    2 Citations (Scopus)

    Abstract

    Applications of automated planning under uncertainty are often modelled as a discrete and continuous state Markov Decision Process (DC-MDP). Symbolic Dynamic Programming is the existent exact solution for DC-MDPs that uses the eXtended Algebraic Decision Diagrams (XADDs) to symbolically represent the state value function and that computes a complete state-space policy (which is very costly and limits solution to problems with small size and depth). Real-Time Dynamic Programming (RTDP) is an efficient solution method for discrete state MDPs that provides a partial solution for a known initial state. In this paper we combine the RTDP solution with XADD symbolic representation and computation of the value function to propose the Continuous Real Time Dynamic Programming (CRTDP) algorithm. This novel planner uses heuristic search and symbolic generalisation to efficiently update the value function by regions. We show that using the initial state information greatly reduces the number of regions in the value function, therefore allowing CRTDP to solve DC-MDPs more efficiently than standard symbolic dynamic programming both in time and space required for the solution.

    Original languageEnglish
    Title of host publicationProceedings - 2014 Brazilian Conference on Intelligent Systems, BRACIS 2014
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages134-139
    Number of pages6
    ISBN (Electronic)9781479956180
    DOIs
    Publication statusPublished - 12 Dec 2014
    Event3rd Brazilian Conference on Intelligent Systems, BRACIS 2014 - Sao Carlos, Sao Paulo, Brazil
    Duration: 19 Oct 201423 Oct 2014

    Publication series

    NameProceedings - 2014 Brazilian Conference on Intelligent Systems, BRACIS 2014

    Conference

    Conference3rd Brazilian Conference on Intelligent Systems, BRACIS 2014
    Country/TerritoryBrazil
    CitySao Carlos, Sao Paulo
    Period19/10/1423/10/14

    Fingerprint

    Dive into the research topics of 'Continuous real time dynamic programming for discrete and continuous state MDPs'. Together they form a unique fingerprint.

    Cite this