Robust optimization for hybrid MDPs with state-dependent noise

Zahra Zaman, Scott Sanner, Karina Valdivia Delgado, Leliane Nunes De Barros

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    3 Citations (Scopus)

    Abstract

    Recent advances in solutions to Hybrid MDPs with discrete and continuous state and action spaces have significantly extended the class of MDPs for which exact solutions can be derived, albeit at the expense of a restricted transition noise model. In this paper, we work around limitations of previous solutions by adopting a robust optimization approach in which Nature is allowed to adversarially determine transition noise within pre-specified confidence intervals. This allows one to derive an optimal policy with an arbitrary (user-specified) level of success probability and significantly extends the class of transition noise models for which Hybrid MDPs can be solved. This work also significantly extends results for the related "chance- constrained" approach in stochastic hybrid control to accommodate state-dependent noise. We demonstrate our approach working on a variety of hybrid MDPs taken from AI planning, operations research, and control theory, noting that this is the first time robust solutions with strong guarantees over all states have been automatically derived for such problems.

    Original languageEnglish
    Title of host publicationIJCAI 2013 - Proceedings of the 23rd International Joint Conference on Artificial Intelligence
    Pages2437-2443
    Number of pages7
    Publication statusPublished - 2013
    Event23rd International Joint Conference on Artificial Intelligence, IJCAI 2013 - Beijing, China
    Duration: 3 Aug 20139 Aug 2013

    Publication series

    NameIJCAI International Joint Conference on Artificial Intelligence
    ISSN (Print)1045-0823

    Conference

    Conference23rd International Joint Conference on Artificial Intelligence, IJCAI 2013
    Country/TerritoryChina
    CityBeijing
    Period3/08/139/08/13

    Fingerprint

    Dive into the research topics of 'Robust optimization for hybrid MDPs with state-dependent noise'. Together they form a unique fingerprint.

    Cite this