Set augmented triplet loss for video person re-identification

Pengfei Fang, Pan Ji, Lars Petersson, Mehrtash Harandi

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    10 Citations (Scopus)

    Abstract

    Modern video person re-identification (re-ID) machines are often trained using a metric learning approach, supervised by a triplet loss. The triplet loss used in video re-ID is usually based on so-called clip features, each aggregated from a few frame features. In this paper, we propose to model the video clip as a set and instead study the distance between sets in the corresponding triplet loss. In contrast to the distance between clip representations, the distance between clip sets considers the pair-wise similarity of each element (i.e., frame representation) between two sets. This allows the network to directly optimize the feature representation at a frame level. Apart from the commonly-used set distance metrics (e.g., ordinary distance and Hausdorff distance), we further propose a hybrid distance metric, tailored for the set-aware triplet loss. Also, we propose a hard positive set construction strategy using the learned class prototypes in a batch. Our proposed method achieves state-of-the-art results across several standard benchmarks, demonstrating the advantages of the proposed method.

    Original languageEnglish
    Title of host publicationProceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages464-473
    Number of pages10
    ISBN (Electronic)9780738142661
    DOIs
    Publication statusPublished - Jan 2021
    Event2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021 - Virtual, Online, United States
    Duration: 5 Jan 20219 Jan 2021

    Publication series

    NameProceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021

    Conference

    Conference2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021
    Country/TerritoryUnited States
    CityVirtual, Online
    Period5/01/219/01/21

    Fingerprint

    Dive into the research topics of 'Set augmented triplet loss for video person re-identification'. Together they form a unique fingerprint.

    Cite this