Dynamic Image Networks for Action Recognition

Hakan Bilen, Basura Fernando, Efstratios Gavves, Andrea Vedaldi, Stephen Gould

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    504 Citations (Scopus)

    Abstract

    We introduce the concept of dynamic image, a novel compact representation of videos useful for video analysis especially when convolutional neural networks (CNNs) are used. The dynamic image is based on the rank pooling concept and is obtained through the parameters of a ranking machine that encodes the temporal evolution of the frames of the video. Dynamic images are obtained by directly applying rank pooling on the raw image pixels of a video producing a single RGB image per video. This idea is simple but powerful as it enables the use of existing CNN models directly on video data with fine-tuning. We present an efficient and effective approximate rank pooling operator, speeding it up orders of magnitude compared to rank pooling. Our new approximate rank pooling CNN layer allows us to generalize dynamic images to dynamic feature maps and we demonstrate the power of our new representations on standard benchmarks in action recognition achieving state-of-the-art performance.

    Original languageEnglish
    Title of host publicationProceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016
    PublisherIEEE Computer Society
    Pages3034-3042
    Number of pages9
    ISBN (Electronic)9781467388504
    DOIs
    Publication statusPublished - 9 Dec 2016
    Event29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 - Las Vegas, United States
    Duration: 26 Jun 20161 Jul 2016

    Publication series

    NameProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
    Volume2016-December
    ISSN (Print)1063-6919

    Conference

    Conference29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016
    Country/TerritoryUnited States
    CityLas Vegas
    Period26/06/161/07/16

    Fingerprint

    Dive into the research topics of 'Dynamic Image Networks for Action Recognition'. Together they form a unique fingerprint.

    Cite this