Robust online visual tracking with a single convolutional neural network

Hanxi Li*, Yi Li, Fatih Porikli

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    69 Citations (Scopus)

    Abstract

    Deep neural networks, albeit their great success on feature learning in various computer vision tasks, are usually considered as impractical for online visual tracking because they require very long training time and a large number of training samples. In this work, we present an efficient and very robust online tracking algorithm using a single Convolutional Neural Network (CNN) for learning effective feature representations of the target object over time. Our contributions are multifold: First, we introduce a novel truncated structural loss function that maintains as many training samples as possible and reduces the risk of tracking error accumulation, thus drift, by accommodating the uncertainty of the model output. Second, we enhance the ordinary Stochastic Gradient Descent approach in CNN training with a temporal selection mechanism, which generates positive and negative samples within different time periods. Finally, we propose to update the CNN model in a “lazy” style to speed-up the training stage, where the network is updated only when a significant appearance change occurs on the object, without sacrificing tracking accuracy. The CNN tracker outperforms all compared state-ofthe- art methods in our extensive evaluations that involve 18 well-known benchmark video sequences.

    Original languageEnglish
    Title of host publicationComputer Vision - ACCV 2014 - 12th Asian Conference on Computer Vision, Revised Selected Papers
    EditorsDaniel Cremers, Hideo Saito, Ian Reid, Ming-Hsuan Yang
    PublisherSpringer Verlag
    Pages194-209
    Number of pages16
    ISBN (Electronic)9783319168135
    DOIs
    Publication statusPublished - 2015
    Event12th Asian Conference on Computer Vision, ACCV 2014 - Singapore, Singapore
    Duration: 1 Nov 20145 Nov 2014

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume9007
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference12th Asian Conference on Computer Vision, ACCV 2014
    Country/TerritorySingapore
    CitySingapore
    Period1/11/145/11/14

    Fingerprint

    Dive into the research topics of 'Robust online visual tracking with a single convolutional neural network'. Together they form a unique fingerprint.

    Cite this