Learning implicit models during target pursuit

Chris Gaskett*, Peter Brown, Gordon Cheng, Alexander Zelinsky

*Corresponding author for this work

    Research output: Contribution to journalConference articlepeer-review

    2 Citations (Scopus)

    Abstract

    Smooth control using an active vision head's verge-axis joint is performed through continuous state and action reinforcement learning. The system learns to perform visual servoing based on rewards given relative to tracking performance. The learned controller compensates for the velocity of the target and performs lag-free pursuit of a swinging target. By comparing controllers exposed to different environments we show that the controller is predicting the motion of the target by forming an implicit model of the target's motion. Experimental results are presented that demonstrate the advantages and disadvantages of implicit modelling.

    Original languageEnglish
    Pages (from-to)4122-4129
    Number of pages8
    JournalProceedings - IEEE International Conference on Robotics and Automation
    Volume3
    Publication statusPublished - 2003
    Event2003 IEEE International Conference on Robotics and Automation - Taipei, Taiwan
    Duration: 14 Sept 200319 Sept 2003

    Fingerprint

    Dive into the research topics of 'Learning implicit models during target pursuit'. Together they form a unique fingerprint.

    Cite this