Channel Recurrent Attention Networks for Video Pedestrian Retrieval

Pengfei Fang*, Pan Ji, Jieming Zhou, Lars Petersson, Mehrtash Harandi

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Full attention, which generates an attention value per element of the input feature maps, has been successfully demonstrated to be beneficial in visual tasks. In this work, we propose a fully attentional network, termed channel recurrent attention network, for the task of video pedestrian retrieval. The main attention unit, channel recurrent attention, identifies attention maps at the frame level by jointly leveraging spatial and channel patterns via a recurrent neural network. This channel recurrent attention is designed to build a global receptive field by recurrently receiving and learning the spatial vectors. Then, a set aggregation cell is employed to generate a compact video representation. Empirical experimental results demonstrate the superior performance of the proposed deep network, outperforming current state-of-the-art results across standard video person retrieval benchmarks, and a thorough ablation study shows the effectiveness of the proposed units.

    Original languageEnglish
    Title of host publicationComputer Vision – ACCV 2020 - 15th Asian Conference on Computer Vision, 2020, Revised Selected Papers
    EditorsHiroshi Ishikawa, Cheng-Lin Liu, Tomas Pajdla, Jianbo Shi
    PublisherSpringer Science and Business Media Deutschland GmbH
    Pages427-443
    Number of pages17
    ISBN (Print)9783030695439
    DOIs
    Publication statusPublished - 2021
    Event15th Asian Conference on Computer Vision, ACCV 2020 - Virtual, Online
    Duration: 30 Nov 20204 Dec 2020

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume12627 LNCS
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference15th Asian Conference on Computer Vision, ACCV 2020
    CityVirtual, Online
    Period30/11/204/12/20

    Fingerprint

    Dive into the research topics of 'Channel Recurrent Attention Networks for Video Pedestrian Retrieval'. Together they form a unique fingerprint.

    Cite this