Joint semantic and geometric segmentation of videos with a stage model

Buyu Liu, Xuming He, Stephen Gould

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    4 Citations (Scopus)

    Abstract

    We address the problem of geometric and semantic consistent video segmentation for outdoor scenes. With no assumption on camera movement, we jointly model the semantic-geometric class of spatio-temporal regions (supervoxels) and geometric scene layout in each frame. Our main contribution is to propose a stage scene model to efficiently capture the dependency between the semantic and geometric labels. We build a unified CRF model on supervoxel labels and stage parameters, and design an alternating inference algorithm to minimize the resulting energy function. We also extend smoothing based on hierarchical image segmentation to spatio-temporal setting and show it achieves better performance than a pairwise random field model. Our method is evaluated on the CamVid dataset and achieves state-of-the-art per-pixel as well as per-class accuracy in predicting both semantic and geometric labels.

    Original languageEnglish
    Title of host publication2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014
    PublisherIEEE Computer Society
    Pages737-744
    Number of pages8
    ISBN (Print)9781479949854
    DOIs
    Publication statusPublished - 2014
    Event2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014 - Steamboat Springs, CO, United States
    Duration: 24 Mar 201426 Mar 2014

    Publication series

    Name2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014

    Conference

    Conference2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014
    Country/TerritoryUnited States
    CitySteamboat Springs, CO
    Period24/03/1426/03/14

    Fingerprint

    Dive into the research topics of 'Joint semantic and geometric segmentation of videos with a stage model'. Together they form a unique fingerprint.

    Cite this