Ordered pooling of optical flow sequences for action recognition

Jue Wang, Anoop Cherian, Fatih Porikli

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    28 Citations (Scopus)

    Abstract

    Training of Convolutional Neural Networks (CNNs) on long video sequences is computationally expensive due to the substantial memory requirements and the massive number of parameters that deep architectures demand. Early fusion of video frames is thus a standard technique, in which several consecutive frames are first agglomerated into a compact representation, and then fed into the CNN as an input sample. For this purpose, a summarization approach that represents a set of consecutive RGB frames by a single dynamic image to capture pixel dynamics is proposed recently. In this paper, we introduce a novel ordered representation of consecutive optical flow frames as an alternative and argue that this representation captures the action dynamics more efficiently than RGB frames. We provide intuitions on why such a representation is better for action recognition. We validate our claims on standard benchmark datasets and demonstrate that using summaries of flow images lead to significant improvements over RGB frames while achieving accuracy comparable to the stateof-The-Art on UCF101 and HMDB datasets.

    Original languageEnglish
    Title of host publicationProceedings - 2017 IEEE Winter Conference on Applications of Computer Vision, WACV 2017
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages168-176
    Number of pages9
    ISBN (Electronic)9781509048229
    DOIs
    Publication statusPublished - 11 May 2017
    Event17th IEEE Winter Conference on Applications of Computer Vision, WACV 2017 - Santa Rosa, United States
    Duration: 24 Mar 201731 Mar 2017

    Publication series

    NameProceedings - 2017 IEEE Winter Conference on Applications of Computer Vision, WACV 2017

    Conference

    Conference17th IEEE Winter Conference on Applications of Computer Vision, WACV 2017
    Country/TerritoryUnited States
    CitySanta Rosa
    Period24/03/1731/03/17

    Fingerprint

    Dive into the research topics of 'Ordered pooling of optical flow sequences for action recognition'. Together they form a unique fingerprint.

    Cite this