Learning dynamic hierarchical models for anytime scene labeling

Buyu Liu*, Xuming He

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    7 Citations (Scopus)

    Abstract

    With increasing demand for efficient image and video analysis, test-time cost of scene parsing becomes critical for many large-scale or time-sensitive vision applications. We propose a dynamic hierarchical model for anytime scene labeling that allows us to achieve flexible tradeoffs between efficiency and accuracy in pixel-level prediction. In particular, our approach incorporates the cost of feature computation and model inference, and optimizes the model performance for any given test-time budget by learning a sequence of image-adaptive hierarchical models. We formulate this anytime representation learning as a Markov Decision Process with a discrete-continuous state-action space. A high-quality policy of feature and model selection is learned based on an approximate policy iteration method with action proposal mechanism. We demonstrate the advantages of our dynamic non-myopic anytime scene parsing on three semantic segmentation datasets, which achieves 90% of the state-of-the-art performances by using 15% of their overall costs.

    Original languageEnglish
    Title of host publicationComputer Vision - 14th European Conference, ECCV 2016, Proceedings
    EditorsBastian Leibe, Jiri Matas, Nicu Sebe, Max Welling
    PublisherSpringer Verlag
    Pages650-666
    Number of pages17
    ISBN (Print)9783319464657
    DOIs
    Publication statusPublished - 2016
    Event14th European Conference on Computer Vision, ECCV 2016 - Amsterdam, Netherlands
    Duration: 8 Oct 201616 Oct 2016

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume9910 LNCS
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference14th European Conference on Computer Vision, ECCV 2016
    Country/TerritoryNetherlands
    CityAmsterdam
    Period8/10/1616/10/16

    Fingerprint

    Dive into the research topics of 'Learning dynamic hierarchical models for anytime scene labeling'. Together they form a unique fingerprint.

    Cite this