Zero-Shot Object Detection: Joint Recognition and Localization of Novel Concepts

Shafin Rahman*, Salman H. Khan, Fatih Porikli

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    53 Citations (Scopus)

    Abstract

    Zero shot learning (ZSL) identifies unseen objects for which no training images are available. Conventional ZSL approaches are restricted to a recognition setting where each test image is categorized into one of several unseen object classes. We posit that this setting is ill-suited for real-world applications where unseen objects appear only as a part of a complete scene, warranting both ‘recognition’ and ‘localization’ of the unseen category. To address this limitation, we introduce a new ‘Zero-Shot Detection’ (ZSD) problem setting, which aims at simultaneously recognizing and locating object instances belonging to novel categories, without any training samples. We introduce an integrated solution to the ZSD problem that jointly models the complex interplay between visual and semantic domain information. Ours is an end-to-end trainable deep network for ZSD that effectively overcomes the noise in the unsupervised semantic descriptions. To this end, we utilize the concept of meta-classes to design an original loss function that achieves synergy between max-margin class separation and semantic domain clustering. In order to set a benchmark for ZSD, we propose an experimental protocol for the large-scale ILSVRC dataset that adheres to practical challenges, e.g., rare classes are more likely to be the unseen ones. Furthermore, we present a baseline approach extended from conventional recognition to the ZSD setting. Our extensive experiments show a significant boost in performance (in terms of mAP and Recall) on the imperative yet difficult ZSD problem on ImageNet detection, MSCOCO and FashionZSD datasets.

    Original languageEnglish
    Pages (from-to)2979-2999
    Number of pages21
    JournalInternational Journal of Computer Vision
    Volume128
    Issue number12
    DOIs
    Publication statusPublished - 1 Dec 2020

    Fingerprint

    Dive into the research topics of 'Zero-Shot Object Detection: Joint Recognition and Localization of Novel Concepts'. Together they form a unique fingerprint.

    Cite this