Loss-calibrated Monte Carlo action selection

Ehsan Abbasnejad, Justin Domke, Scott Sanner

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    6 Citations (Scopus)

    Abstract

    Bayesian decision-theory underpins robust decisionmaking in applications ranging from plant control to robotics where hedging action selection against state uncertainty is critical for minimizing low probability but potentially catastrophic outcomes (e.g, uncontrollable plant conditions or robots falling into stairwells). Unfortunately, belief state distributions in such settings are often complex and/or high dimensional, thus prohibiting the efficient application of analytical techniques for expected utility computation when real-time control is required. This leaves Monte Carlo evaluation as one of the few viable (and hence frequently used) techniques for online action selection. However, loss-insensitive Monte Carlo methods may require large numbers of samples to identify optimal actions with high certainty since they may sample from high probability regions that do not disambiguate action utilities.In this paper we remedy this problem by deriving an optimal proposal distribution for a loss-calibrated Monte Carlo importance sampler that bounds the regret of using an estimated optimal action. Empirically, we show that using our loss-calibrated Monte Carlo method yields high-accuracy optimal action selections in a fraction of the number of samples required by conventional loss-insensitive samplers.

    Original languageEnglish
    Title of host publicationProceedings of the 29th AAAI Conference on Artificial Intelligence, AAAI 2015 and the 27th Innovative Applications of Artificial Intelligence Conference, IAAI 2015
    PublisherAI Access Foundation
    Pages3447-3453
    Number of pages7
    ISBN (Electronic)9781577357032
    Publication statusPublished - 1 Jun 2015
    Event29th AAAI Conference on Artificial Intelligence, AAAI 2015 and the 27th Innovative Applications of Artificial Intelligence Conference, IAAI 2015 - Austin, United States
    Duration: 25 Jan 201530 Jan 2015

    Publication series

    NameProceedings of the National Conference on Artificial Intelligence
    Volume5

    Conference

    Conference29th AAAI Conference on Artificial Intelligence, AAAI 2015 and the 27th Innovative Applications of Artificial Intelligence Conference, IAAI 2015
    Country/TerritoryUnited States
    CityAustin
    Period25/01/1530/01/15

    Fingerprint

    Dive into the research topics of 'Loss-calibrated Monte Carlo action selection'. Together they form a unique fingerprint.

    Cite this