Siamese networks: The tale of two manifolds

Soumava Roy, Mehrtash Harandi, Richard Nock, Richard Hartley

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    40 Citations (Scopus)

    Abstract

    Siamese networks are non-linear deep models that have found their ways into a broad set of problems in learning theory, thanks to their embedding capabilities. In this paper, we study Siamese networks from a new perspective and question the validity of their training procedure. We show that in the majority of cases, the objective of a Siamese network is endowed with an invariance property. Neglecting the invariance property leads to a hindrance in training the Siamese networks. To alleviate this issue, we propose two Riemannian structures and generalize a well-established accelerated stochastic gradient descent method to take into account the proposed Riemannian structures. Our empirical evaluations suggest that by making use of the Riemannian geometry, we achieve state-of-the-art results against several algorithms for the challenging problem of fine-grained image classification.

    Original languageEnglish
    Title of host publicationProceedings - 2019 International Conference on Computer Vision, ICCV 2019
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages3046-3055
    Number of pages10
    ISBN (Electronic)9781728148038
    DOIs
    Publication statusPublished - Oct 2019
    Event17th IEEE/CVF International Conference on Computer Vision, ICCV 2019 - Seoul, Korea, Republic of
    Duration: 27 Oct 20192 Nov 2019

    Publication series

    NameProceedings of the IEEE International Conference on Computer Vision
    Volume2019-October
    ISSN (Print)1550-5499

    Conference

    Conference17th IEEE/CVF International Conference on Computer Vision, ICCV 2019
    Country/TerritoryKorea, Republic of
    CitySeoul
    Period27/10/192/11/19

    Fingerprint

    Dive into the research topics of 'Siamese networks: The tale of two manifolds'. Together they form a unique fingerprint.

    Cite this