Skip to main navigation Skip to search Skip to main content

An On-Line Planner for POMDPs with Large Discrete Action Space: A Quantile-Based Approach

Erli Wang, Hanna Kurniawati, Dirk P. Kroese

Research output: Chapter in Book/Report/Conference proceedingConference Paperpeer-review

9 Citations (Scopus)

Abstract

Making principled decisions in the presence of uncertainty
is often facilitated by Partially Observable Markov Decision Processes (POMDPs). Despite tremendous advances
in POMDP solvers, finding good policies with large action
spaces remains difficult. To alleviate this difficulty, this paper presents an on-line approximate solver, called QuantileBased Action Selector (QBASE). It uses quantile-statistics to
adaptively evaluate a small subset of the action space without
sacrificing the quality of the generated decision strategies by
much. Experiments on four different robotics tasks with up
to 10,000 actions indicate that QBASE can generate substantially better strategies than a state-of-the-art method.
Original languageEnglish
Title of host publicationProceedings of the Twenty-Eighth International Conference on Automated Planning and Scheduling, ICAPS 2018, Delft, The Netherlands, June 24-29, 2018
EditorsMathijs de Weerdt, Sven Koenig, Gabriele Röger, Matthijs T. J. Spaan
PublisherAAAI Press
Pages273-277
Number of pages5
Publication statusPublished - 2018
Externally publishedYes

Fingerprint

Dive into the research topics of 'An On-Line Planner for POMDPs with Large Discrete Action Space: A Quantile-Based Approach'. Together they form a unique fingerprint.

Cite this