Learning Hough forest with depth-encoded context for object detection

Tao Wang*, Xuming He, Nick Barnes

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    3 Citations (Scopus)

    Abstract

    In this paper, we propose a novel extension to the Class-specific Hough Forest (CHF) framework for object detection and localization. Our approach utilizes depth information during training to build a more discriminative codebook which simultaneously encodes features from the object and the surrounding context. In particular, we augment the CHF with contextual image patches, and design a series of depth-aware uncertainty measures for the binary tests used in CHF training. The new splitting criteria integrates relative physical scales of image patches, 3D offset uncertainty of votes, and 3D-distance modulated voting confidence. We show that the extended CHF is capable of learning better context models and building high- quality codebooks in appearance. As the model relies on depth information only in training, our system can be applied to object localization in 2D images. We demonstrate the efficacy of our method by experiments on two challenging RGB-D object datasets, and we empirically show our method achieves significant improvement over the state of the art with a more robust Hough voting scheme.

    Original languageEnglish
    Title of host publication2012 International Conference on Digital Image Computing Techniques and Applications, DICTA 2012
    DOIs
    Publication statusPublished - 2012
    Event2012 14th International Conference on Digital Image Computing Techniques and Applications, DICTA 2012 - Fremantle, WA, Australia
    Duration: 3 Dec 20125 Dec 2012

    Publication series

    Name2012 International Conference on Digital Image Computing Techniques and Applications, DICTA 2012

    Conference

    Conference2012 14th International Conference on Digital Image Computing Techniques and Applications, DICTA 2012
    Country/TerritoryAustralia
    CityFremantle, WA
    Period3/12/125/12/12

    Fingerprint

    Dive into the research topics of 'Learning Hough forest with depth-encoded context for object detection'. Together they form a unique fingerprint.

    Cite this