Indoor scene parsing with instance segmentation, semantic labeling and support relationship inference

Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    29 Citations (Scopus)

    Abstract

    Over the years, indoor scene parsing has attracted a growing interest in the computer vision community. Existing methods have typically focused on diverse subtasks of this challenging problem. In particular, while some of them aim at segmenting the image into regions, such as object or surface instances, others aim at inferring the semantic labels of given regions, or their support relationships. These different tasks are typically treated as separate ones. However, they bear strong connections: good regions should respect the semantic labels; support can only be defined for meaningful regions; support relationships strongly depend on semantics. In this paper, we therefore introduce an approach to jointly segment the instances and infer their semantic labels and support relationships from a single input image. By exploiting a hierarchical segmentation, we formulate our problem as that of jointly finding the regions in the hierarchy that correspond to instances and estimating their class labels and pairwise support relationships. We express this via a Markov Random Field, which allows us to further encode links between the different types of variables. Inference in this model can be done exactly via integer linear programming, and we learn its parameters in a structural SVM framework. Our experiments on NYUv2 demonstrate the benefits of reasoning jointly about all these subtasks of indoor scene parsing.

    Original languageEnglish
    Title of host publicationProceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages6269-6275
    Number of pages7
    ISBN (Electronic)9781538604571
    DOIs
    Publication statusPublished - 6 Nov 2017
    Event30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 - Honolulu, United States
    Duration: 21 Jul 201726 Jul 2017

    Publication series

    NameProceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017
    Volume2017-January

    Conference

    Conference30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017
    Country/TerritoryUnited States
    CityHonolulu
    Period21/07/1726/07/17

    Fingerprint

    Dive into the research topics of 'Indoor scene parsing with instance segmentation, semantic labeling and support relationship inference'. Together they form a unique fingerprint.

    Cite this