Robust visual vocabulary tracking using hierarchical model fusion

Behzad Bozorgtabar, Roland Goecke

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    In this paper, we propose a new visual tracking approach based on the Hierarchical Model Fusion framework, which fuses two different trackers to cope with different tracking problems. We use an Incremental Multiple Principal Component Analysis tracker as our main model as well as an image patch tracker as our auxiliary model. Firstly, we randomly sample image patches within the target region obtained by the main model in the training frames for constructing a visual vocabulary using Histogram of Oriented Gradient features. Secondly, we use a supervised learning algorithm based on a Gaussian Mixture Model, which not only operates on supervised information to improve the discriminative power of the clusters, but also increases the purity of the clusters. Then, auxiliary models are initialised by obtaining confidence scores of image patches based on the similarity between candidates and codewords. In addition, an updating procedure and a result refinement scheme are included in the proposed tracking approach. Experiments on challenging video sequences demonstrate the robustness of the proposed approach to handling occlusion, pose variation and rotation.

    Original languageEnglish
    Title of host publication2013 International Conference on Digital Image Computing
    Subtitle of host publicationTechniques and Applications, DICTA 2013
    DOIs
    Publication statusPublished - 2013
    Event2013 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2013 - Hobart, TAS, Australia
    Duration: 26 Nov 201328 Nov 2013

    Publication series

    Name2013 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2013

    Conference

    Conference2013 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2013
    Country/TerritoryAustralia
    CityHobart, TAS
    Period26/11/1328/11/13

    Fingerprint

    Dive into the research topics of 'Robust visual vocabulary tracking using hierarchical model fusion'. Together they form a unique fingerprint.

    Cite this