When VLAD met hilbert

Mehrtash Harandi, Mathieu Salzmann, Fatih Porikli

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    In many challenging visual recognition tasks where training data is limited, Vectors of Locally Aggregated Descriptors (VLAD) have emerged as powerful image/video representations that compete with or outperform state-of the-art approaches. In this paper, we address two fundamental limitations of VLAD: its requirement for the local descriptors to have vector form and its restriction to linear classifiers due to its high-dimensionality. To this end, we introduce a kernelized version of VLAD. This not only lets us inherently exploit more sophisticated classification schemes, but also enables us to efficiently aggregate nonvector descriptors (e.g., manifold-valued data) in the VLAD framework. Furthermore, we propose an approximate formulation that allows us to accelerate the coding process while still benefiting from the properties of kernel VLAD. Our experiments demonstrate the effectiveness of our approach at handling manifold-valued data, such as covariance descriptors, on several classification tasks. Our results also evidence the benefits of our nonlinear VLAD descriptors against the linear ones in Euclidean space using several standard benchmark datasets.

    Original languageEnglish
    Title of host publicationProceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016
    PublisherIEEE Computer Society
    Pages5185-5194
    Number of pages10
    ISBN (Electronic)9781467388504
    DOIs
    Publication statusPublished - 9 Dec 2016
    Event29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 - Las Vegas, United States
    Duration: 26 Jun 20161 Jul 2016

    Publication series

    NameProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
    Volume2016-December
    ISSN (Print)1063-6919

    Conference

    Conference29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016
    Country/TerritoryUnited States
    CityLas Vegas
    Period26/06/161/07/16

    Fingerprint

    Dive into the research topics of 'When VLAD met hilbert'. Together they form a unique fingerprint.

    Cite this