Abstract
Making principled decisions in the presence of uncertainty
is often facilitated by Partially Observable Markov Decision Processes (POMDPs). Despite tremendous advances
in POMDP solvers, finding good policies with large action
spaces remains difficult. To alleviate this difficulty, this paper presents an on-line approximate solver, called QuantileBased Action Selector (QBASE). It uses quantile-statistics to
adaptively evaluate a small subset of the action space without
sacrificing the quality of the generated decision strategies by
much. Experiments on four different robotics tasks with up
to 10,000 actions indicate that QBASE can generate substantially better strategies than a state-of-the-art method.
is often facilitated by Partially Observable Markov Decision Processes (POMDPs). Despite tremendous advances
in POMDP solvers, finding good policies with large action
spaces remains difficult. To alleviate this difficulty, this paper presents an on-line approximate solver, called QuantileBased Action Selector (QBASE). It uses quantile-statistics to
adaptively evaluate a small subset of the action space without
sacrificing the quality of the generated decision strategies by
much. Experiments on four different robotics tasks with up
to 10,000 actions indicate that QBASE can generate substantially better strategies than a state-of-the-art method.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the Twenty-Eighth International Conference on Automated Planning and Scheduling, ICAPS 2018, Delft, The Netherlands, June 24-29, 2018 |
| Editors | Mathijs de Weerdt, Sven Koenig, Gabriele Röger, Matthijs T. J. Spaan |
| Publisher | AAAI Press |
| Pages | 273-277 |
| Number of pages | 5 |
| Publication status | Published - 2018 |
| Externally published | Yes |
Fingerprint
Dive into the research topics of 'An On-Line Planner for POMDPs with Large Discrete Action Space: A Quantile-Based Approach'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver