TY - GEN
T1 - Metric state space reinforcement learning for a vision-capable mobile robot
AU - Zhumatiy, Viktor
AU - Gomez, Faustino
AU - Hutter, Marcus
AU - Schmidhuber, Jürgen
PY - 2006
Y1 - 2006
N2 - We address the problem of autonomously learning controllers for vision-capable mobile robots. We extend McCallum's (1995) Nearest-Sequence Memory algorithm to allow for general metrics over state-action trajectories. We demonstrate the feasibility of our approach by successfully running our algorithm on a real mobile robot. The algorithm is novel and unique in that it (a) explores the environment and learns directly on a mobile robot without using a hand-made computer model as an intermediate step, (b) does not require manual discretization of the sensor input space, (c) works in piecewise continuous perceptual spaces, and (d) copes with partial observability. Together this allows learning from much less experience compared to previous methods.
AB - We address the problem of autonomously learning controllers for vision-capable mobile robots. We extend McCallum's (1995) Nearest-Sequence Memory algorithm to allow for general metrics over state-action trajectories. We demonstrate the feasibility of our approach by successfully running our algorithm on a real mobile robot. The algorithm is novel and unique in that it (a) explores the environment and learns directly on a mobile robot without using a hand-made computer model as an intermediate step, (b) does not require manual discretization of the sensor input space, (c) works in piecewise continuous perceptual spaces, and (d) copes with partial observability. Together this allows learning from much less experience compared to previous methods.
KW - Mobile robots
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=84871868155&partnerID=8YFLogxK
M3 - Conference contribution
SN - 1586035959
SN - 9781586035952
T3 - Intelligent Autonomous Systems 9, IAS 2006
SP - 272
EP - 281
BT - Intelligent Autonomous Systems 9, IAS 2006
T2 - 9th International Conference on Intelligent Autonomous Systems, IAS 2006
Y2 - 7 March 2006 through 9 March 2006
ER -