Abstract
Reinforcement learning systems improve behaviour based on scalar rewards from a critic. In this work vision based behaviours, servoing and wandering, are learned through a Q-learning method which handles continuous states and actions. There is no requirement for camera calibration, an actuator model, or a knowledgeable teacher. Learning through observing the actions of other behaviours improves learning speed. Experiments were performed on a mobile robot using a real-time vision system.
Original language | English |
---|---|
Pages | 403-409 |
Number of pages | 7 |
Publication status | Published - 2000 |
Event | 2000 IEEE/RSJ International Conference on Intelligent Robots and Systems - Takamatsu, Japan Duration: 31 Oct 2000 → 5 Nov 2000 |
Conference
Conference | 2000 IEEE/RSJ International Conference on Intelligent Robots and Systems |
---|---|
Country/Territory | Japan |
City | Takamatsu |
Period | 31/10/00 → 5/11/00 |