Lending orientation to neural networks for cross-view geo-localization

Liu Liu, Hongdong Li

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    155 Citations (Scopus)

    Abstract

    This paper studies image-based geo-localization (IBL) problem using ground-to-aerial cross-view matching. The goal is to predict the spatial location of a ground-level query image by matching it to a large geotagged aerial image database (e.g., satellite imagery). This is a challenging task due to the drastic differences in their viewpoints and visual appearances. Existing deep learning methods for this problem have been focused on maximizing feature similarity between spatially close-by image pairs, while minimizing other images pairs which are far apart. They do so by deep feature embedding based on visual appearance in those ground-and-aerial images. However, in everyday life, humans commonly use orientation information as an important cue for the task of spatial localization. Inspired by this insight, this paper proposes a novel method which endows deep neural networks with the 'commonsense' of orientation. Given a ground-level spherical panoramic image as query input (and a large georeferenced satellite image database), we design a Siamese network which explicitly encodes the orientation (i.e., spherical directions) of each pixel of the images. Our method significantly boosts the discriminative power of the learned deep features, leading to a much higher recall and precision outperforming all previous methods. Our network is also more compact using only 1/5th number of parameters than a previously best-performing network. To evaluate the generalization of our method, we also created a large-scale cross-view localization benchmark containing 100K geotagged ground-aerial pairs covering a city. Our codes and datasets are available at https://github.com/Liumouliu/OriCNN.

    Original languageEnglish
    Title of host publicationProceedings - 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019
    PublisherIEEE Computer Society
    Pages5617-5626
    Number of pages10
    ISBN (Electronic)9781728132938
    DOIs
    Publication statusPublished - Jun 2019
    Event32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019 - Long Beach, United States
    Duration: 16 Jun 201920 Jun 2019

    Publication series

    NameProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
    Volume2019-June
    ISSN (Print)1063-6919

    Conference

    Conference32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019
    Country/TerritoryUnited States
    CityLong Beach
    Period16/06/1920/06/19

    Fingerprint

    Dive into the research topics of 'Lending orientation to neural networks for cross-view geo-localization'. Together they form a unique fingerprint.

    Cite this