Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts

Shafin Rahman*, Salman Khan, Fatih Porikli

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    67 Citations (Scopus)

    Abstract

    Current Zero-Shot Learning (ZSL) approaches are restricted to recognition of a single dominant unseen object category in a test image. We hypothesize that this setting is ill-suited for real-world applications where unseen objects appear only as a part of a complex scene, warranting both ‘recognition’ and ‘localization’ of an unseen category. To address this limitation, we introduce a new ‘Zero-Shot Detection’ (ZSD) problem setting, which aims at simultaneously recognizing and locating object instances belonging to novel categories without any training examples. We also propose a new experimental protocol for ZSD based on the highly challenging ILSVRC dataset, adhering to practical issues, e.g., the rarity of unseen objects. To the best of our knowledge, this is the first end-to-end deep network for ZSD that jointly models the interplay between visual and semantic domain information. To overcome the noise in the automatically derived semantic descriptions, we utilize the concept of meta-classes to design an original loss function that achieves synergy between max-margin class separation and semantic space clustering. Furthermore, we present a baseline approach extended from recognition to ZSD setting. Our extensive experiments show significant performance boost over the baseline on the imperative yet difficult ZSD problem.

    Original languageEnglish
    Title of host publicationComputer Vision – ACCV 2018 - 14th Asian Conference on Computer Vision, Revised Selected Papers
    EditorsKonrad Schindler, Hongdong Li, Greg Mori, C.V. Jawahar
    PublisherSpringer Verlag
    Pages547-563
    Number of pages17
    ISBN (Print)9783030208868
    DOIs
    Publication statusPublished - 2019
    Event14th Asian Conference on Computer Vision, ACCV 2018 - Perth, Australia
    Duration: 2 Dec 20186 Dec 2018

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume11361 LNCS
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference14th Asian Conference on Computer Vision, ACCV 2018
    Country/TerritoryAustralia
    CityPerth
    Period2/12/186/12/18

    Fingerprint

    Dive into the research topics of 'Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts'. Together they form a unique fingerprint.

    Cite this