Large-scale semantic co-labeling of image sets

Jose M. Alvarez, Mathieu Salzmann, Nick Barnes

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Citations (Scopus)

Abstract

As evidenced by video segmentation and cosegmentation approaches, exploiting multiple images is key to the success of visual scene understanding. With the availability of increasingly large sets of images, there is a clear need for methods that can efficiently analyze the similarities and structure across huge numbers of image pixels. Furthermore, to make effective use of this data, these similarities should not just be considered locally between neighboring pixels, but between all pairs of pixels across all images. In this paper, we tackle this challenging scenario by introducing a semantic co-labeling approach that performs efficient inference in a fully-connected CRF defined over the pixels, or superpixels, of an image set. Our experimental evaluation demonstrates that our approach yields improved accuracy while coming at no additional computation cost compared to performing segmentation sequentially on individual images. Furthermore, our formulation lets us perform inference over ten thousand images in a matter of seconds.

Original languageEnglish
Title of host publication2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014
PublisherIEEE Computer Society
Pages501-508
Number of pages8
ISBN (Print)9781479949854
DOIs
Publication statusPublished - 2014
Externally publishedYes
Event2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014 - Steamboat Springs, CO, United States
Duration: 24 Mar 201426 Mar 2014

Publication series

Name2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014

Conference

Conference2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014
Country/TerritoryUnited States
CitySteamboat Springs, CO
Period24/03/1426/03/14

Fingerprint

Dive into the research topics of 'Large-scale semantic co-labeling of image sets'. Together they form a unique fingerprint.

Cite this