Point-based policy transformation: Adapting policy to changing pomdp models

Hanna Kurniawati, Nicholas M. Patrikalakis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Motion planning under uncertainty that can efficiently take into account changes in the environment is critical for robots to operate reliably in our living spaces. Partially Observable Markov Decision Process (POMDP) provides a systematic and general framework for motion planning under uncertainty. Point-based POMDP has advanced POMDP planning tremendously over the past few years, enabling POMDP planning to be practical for many simple to moderately difficult robotics problems. However, when environmental changes alter the POMDP model, most existing POMDP planners recompute the solution from scratch, often wasting significant computational resources that have been spent for solving the original problem. In this paper, we propose a novel algorithm, called Point-Based Policy Transformation (PBPT), that solves the altered POMDP problem by transforming the solution of the original problem to accommodate changes in the problem. PBPT uses the point-based POMDP approach. It transforms the original solution by modifying the set of sampled beliefs that represents the belief space B, and then uses this new set of sampled beliefs to revise the original solution. Preliminary results indicate that PBPT generates a good policy for the altered POMDP model in a matter of minutes, while recomputing the policy using the fastest offline POMDP planner today fails to find a policy with similar quality after two hours of planning time, even when the policy for the original problem is reused as an initial policy.

Original languageEnglish
Title of host publicationSpringer Tracts in Advanced Robotics
EditorsEmilio Frazzoli, Nicholas Roy, Tomas Lozano-Perez, Daniela Rus
PublisherSpringer Verlag Italia
Pages493-509
Number of pages17
ISBN (Print)9783642362781
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event10th International Workshop on the Algorithmic Foundations of Robotics, WAFR 2012 - Cambridge, United States
Duration: 13 Jun 201215 Jun 2012

Publication series

NameSpringer Tracts in Advanced Robotics
Volume86
ISSN (Print)1610-7438
ISSN (Electronic)1610-742X

Conference

Conference10th International Workshop on the Algorithmic Foundations of Robotics, WAFR 2012
Country/TerritoryUnited States
CityCambridge
Period13/06/1215/06/12

Fingerprint

Dive into the research topics of 'Point-based policy transformation: Adapting policy to changing pomdp models'. Together they form a unique fingerprint.

Cite this