Learning spatial transforms for refining object segment proposals

Haoyang Zhang, Xuming He, Fatih Porikli

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

We address the problem of object segment proposal generation, which is a critical step in many instance-level semantic segmentation and scene understanding pipelines. In contrast to prior works that predict binary segment masks from images, we take an alternative refinement approach to improve the quality of a given segment candidate pool. In particular, we propose an efficient deep network that learns 2D spatial transforms to warp an initial object mask towards nearby object region. We formulate this segment refinement task as a regression problem and design a novel feature pooling strategy in our deep network to predict an affine transformation for each object mask. We evaluate our method extensively on two challenging public benchmarks and apply our refinement network to three different initial segment proposal settings. Our results show sizable improvements in average recall across all the settings, achieving the state-of-The-Art performances.

Original languageEnglish
Title of host publicationProceedings - 2017 IEEE Winter Conference on Applications of Computer Vision, WACV 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages37-46
Number of pages10
ISBN (Electronic)9781509048229
DOIs
Publication statusPublished - 11 May 2017
Externally publishedYes
Event17th IEEE Winter Conference on Applications of Computer Vision, WACV 2017 - Santa Rosa, United States
Duration: 24 Mar 201731 Mar 2017

Publication series

NameProceedings - 2017 IEEE Winter Conference on Applications of Computer Vision, WACV 2017

Conference

Conference17th IEEE Winter Conference on Applications of Computer Vision, WACV 2017
Country/TerritoryUnited States
CitySanta Rosa
Period24/03/1731/03/17

Fingerprint

Dive into the research topics of 'Learning spatial transforms for refining object segment proposals'. Together they form a unique fingerprint.

Cite this