Neural aggregation network for video face recognition

Jiaolong Yang, Peiran Ren, Dongqing Zhang, Dong Chen, Fang Wen, Hongdong Li, Gang Hua

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    269 Citations (Scopus)

    Abstract

    This paper presents a Neural Aggregation Network (NAN) for video face recognition. The network takes a face video or face image set of a person with a variable number of face images as its input, and produces a compact, fixed-dimension feature representation for recognition. The whole network is composed of two modules. The feature embedding module is a deep Convolutional Neural Network (CNN) which maps each face image to a feature vector. The aggregation module consists of two attention blocks which adaptively aggregate the feature vectors to form a single feature inside the convex hull spanned by them. Due to the attention mechanism, the aggregation is invariant to the image order. Our NAN is trained with a standard classification or verification loss without any extra supervision signal, and we found that it automatically learns to advocate high-quality face images while repelling low-quality ones such as blurred, occluded and improperly exposed faces. The experiments on IJB-A, YouTube Face, Celebrity-1000 video face recognition benchmarks show that it consistently outperforms naive aggregation methods and achieves the state-of-the-art accuracy.

    Original languageEnglish
    Title of host publicationProceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages5216-5225
    Number of pages10
    ISBN (Electronic)9781538604571
    DOIs
    Publication statusPublished - 6 Nov 2017
    Event30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 - Honolulu, United States
    Duration: 21 Jul 201726 Jul 2017

    Publication series

    NameProceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017
    Volume2017-January

    Conference

    Conference30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017
    Country/TerritoryUnited States
    CityHonolulu
    Period21/07/1726/07/17

    Fingerprint

    Dive into the research topics of 'Neural aggregation network for video face recognition'. Together they form a unique fingerprint.

    Cite this