3D Guided Weakly Supervised Semantic Segmentation

Weixuan Sun*, Jing Zhang, Nick Barnes

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    4 Citations (Scopus)

    Abstract

    Pixel-wise clean annotation is necessary for fully-supervised semantic segmentation, which is laborious and expensive to obtain. In this paper, we propose a weakly supervised 2D semantic segmentation model by incorporating sparse bounding box labels with available 3D information, which is much easier to obtain with advanced sensors. We introduce a 2D-3D inference module to generate accurate pixel-wise segment proposal masks. Guided by 3D information, we first generate a point cloud of objects and calculate a per class objectness probability score for each point using projected bounding-boxes. Then we project the point cloud with objectness probabilities back to the 2D images followed by a refinement step to obtain segment proposals, which are treated as pseudo labels to train a semantic segmentation network. Our method works in a recursive manner to gradually refine the above-mentioned segment proposals. We conducted extensive experimental results on the 2D-3D-S dataset where we manually labeled a subset of images with bounding boxes. We show that the proposed method can generate accurate segment proposals when bounding box labels are available on only a small subset of training images. Performance comparison with recent state-of-the-art methods further illustrates the effectiveness of our method.

    Original languageEnglish
    Title of host publicationComputer Vision – ACCV 2020 - 15th Asian Conference on Computer Vision, 2020, Revised Selected Papers
    EditorsHiroshi Ishikawa, Cheng-Lin Liu, Tomas Pajdla, Jianbo Shi
    PublisherSpringer Science and Business Media Deutschland GmbH
    Pages585-602
    Number of pages18
    ISBN (Print)9783030695248
    DOIs
    Publication statusPublished - 2021
    Event15th Asian Conference on Computer Vision, ACCV 2020 - Virtual, Online
    Duration: 30 Nov 20204 Dec 2020

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume12622 LNCS
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference15th Asian Conference on Computer Vision, ACCV 2020
    CityVirtual, Online
    Period30/11/204/12/20

    Fingerprint

    Dive into the research topics of '3D Guided Weakly Supervised Semantic Segmentation'. Together they form a unique fingerprint.

    Cite this