2D-3D semantic segmentation using cardinality as higher-order loss

Shahin Rahmatollahi Namin, Jose M. Alvarez, Lars Petersson

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Multi-modal scene analysis is a growing field of importance as additional sensors, such as 3D LIDAR, is becoming a common complement to image capturing systems. However, while additional sensory data potentially can make the analysis more accurate, it also comes with a host of associated issues. For example, inconsistencies in the data between sensors resulting from, e.g., misalignment, moving objects, or parallax effects, can severely affect the performance. Additionally, real-world scenes tend to have an inherent imbalance in the number of items of each class which typically suppresses the performance of infrequent classes. In this paper, we address those two issues specifically by a) using a cardinality loss function designed to target inconsistencies at training time, and b) devising an average per class loss function addressing the imbalance issue.

    Original languageEnglish
    Title of host publication2016 23rd International Conference on Pattern Recognition, ICPR 2016
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages3775-3780
    Number of pages6
    ISBN (Electronic)9781509048472
    DOIs
    Publication statusPublished - 1 Jan 2016
    Event23rd International Conference on Pattern Recognition, ICPR 2016 - Cancun, Mexico
    Duration: 4 Dec 20168 Dec 2016

    Publication series

    NameProceedings - International Conference on Pattern Recognition
    Volume0
    ISSN (Print)1051-4651

    Conference

    Conference23rd International Conference on Pattern Recognition, ICPR 2016
    Country/TerritoryMexico
    CityCancun
    Period4/12/168/12/16

    Fingerprint

    Dive into the research topics of '2D-3D semantic segmentation using cardinality as higher-order loss'. Together they form a unique fingerprint.

    Cite this