Optimization of robust loss functions for weakly-labeled image taxonomies: An ImageNet case study

Julian J. McAuley*, Arnau Ramisa, Tibério S. Caetano

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    1 Citation (Scopus)

    Abstract

    The recently proposed ImageNet dataset consists of several million images, each annotated with a single object category. However, these annotations may be imperfect, in the sense that many images contain multiple objects belonging to the label vocabulary. In other words, we have a multi-label problem but the annotations include only a single label (and not necessarily the most prominent). Such a setting motivates the use of a robust evaluation measure, which allows for a limited number of labels to be predicted and, as long as one of the predicted labels is correct, the overall prediction should be considered correct. This is indeed the type of evaluation measure used to assess algorithm performance in a recent competition on ImageNet data. Optimizing such types of performance measures presents several hurdles even with existing structured output learning methods. Indeed, many of the current state-of-the-art methods optimize the prediction of only a single output label, ignoring this 'structure' altogether. In this paper, we show how to directly optimize continuous surrogates of such performance measures using structured output learning techniques with latent variables. We use the output of existing binary classifiers as input features in a new learning stage which optimizes the structured loss corresponding to the robust performance measure. We present empirical evidence that this allows us to 'boost' the performance of existing binary classifiers which are the state-of-the-art for the task of object classification in ImageNet.

    Original languageEnglish
    Title of host publicationEnergy Minimazation Methods in Computer Vision and Pattern Recognition - 8th International Conference, EMMCVPR 2011, Proceedings
    Pages355-368
    Number of pages14
    DOIs
    Publication statusPublished - 2011
    Event8th International Conference on Energy Minimization Methods in Computer Vision and Pattern Recognition, EMMCVPR 2011 - St. Petersburg, Russian Federation
    Duration: 25 Jul 201127 Jul 2011

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume6819 LNCS
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference8th International Conference on Energy Minimization Methods in Computer Vision and Pattern Recognition, EMMCVPR 2011
    Country/TerritoryRussian Federation
    CitySt. Petersburg
    Period25/07/1127/07/11

    Fingerprint

    Dive into the research topics of 'Optimization of robust loss functions for weakly-labeled image taxonomies: An ImageNet case study'. Together they form a unique fingerprint.

    Cite this