More about VLAD: A leap from Euclidean to Riemannian manifolds

Masoud Faraki, Mehrtash T. Harandi, Fatih Porikli

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    41 Citations (Scopus)

    Abstract

    This paper takes a step forward in image and video coding by extending the well-known Vector of Locally Aggregated Descriptors (VLAD) onto an extensive space of curved Riemannian manifolds. We provide a comprehensive mathematical framework that formulates the aggregation problem of such manifold data into an elegant solution. In particular, we consider structured descriptors from visual data, namely Region Covariance Descriptors and linear subspaces that reside on the manifold of Symmetric Positive Definite matrices and the Grassmannian manifolds, respectively. Through rigorous experimental validation, we demonstrate the superior performance of this novel Riemannian VLAD descriptor on several visual classification tasks including video-based face recognition, dynamic scene recognition, and head pose classification.

    Original languageEnglish
    Title of host publicationIEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015
    PublisherIEEE Computer Society
    Pages4951-4960
    Number of pages10
    ISBN (Electronic)9781467369640
    DOIs
    Publication statusPublished - 14 Oct 2015
    EventIEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015 - Boston, United States
    Duration: 7 Jun 201512 Jun 2015

    Publication series

    NameProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
    Volume07-12-June-2015
    ISSN (Print)1063-6919

    Conference

    ConferenceIEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015
    Country/TerritoryUnited States
    CityBoston
    Period7/06/1512/06/15

    Fingerprint

    Dive into the research topics of 'More about VLAD: A leap from Euclidean to Riemannian manifolds'. Together they form a unique fingerprint.

    Cite this