Skip to main navigation Skip to search Skip to main content

An online and approximate solver for POMDPs with continuous action space

Konstantin M. Seiler, Hanna Kurniawati, Surya P. N. Singh

Research output: Chapter in Book/Report/Conference proceedingConference Paperpeer-review

54 Citations (Scopus)

Abstract

For agile, accurate autonomous robotics, it is desirable to plan motion in the presence of uncertainty. The Partially Observable Markov Decision Process (POMDP) provides a principled framework for this. Despite the tremendous advances of POMDP-based planning, most can only solve problems with a small and discrete set of actions. This paper presents General Pattern Search in Adaptive Belief Tree (GPS-ABT), an approximate and online POMDP solver for problems with continuous action spaces. Generalized Pattern Search (GPS) is used as a search strategy for action selection. Under certain conditions, GPS-ABT converges to the optimal solution in probability. Results on a box pushing and an extended Tag benchmark problem are promising.
Original languageEnglish
Title of host publicationIEEE International Conference on Robotics and Automation, ICRA 2015, Seattle, WA, USA, 26-30 May, 2015
PublisherIEEE
Pages2290-2297
Number of pages8
DOIs
Publication statusPublished - 2015
Externally publishedYes

Fingerprint

Dive into the research topics of 'An online and approximate solver for POMDPs with continuous action space'. Together they form a unique fingerprint.

Cite this