TY - GEN
T1 - Large-scale semantic co-labeling of image sets
AU - Alvarez, Jose M.
AU - Salzmann, Mathieu
AU - Barnes, Nick
PY - 2014
Y1 - 2014
N2 - As evidenced by video segmentation and cosegmentation approaches, exploiting multiple images is key to the success of visual scene understanding. With the availability of increasingly large sets of images, there is a clear need for methods that can efficiently analyze the similarities and structure across huge numbers of image pixels. Furthermore, to make effective use of this data, these similarities should not just be considered locally between neighboring pixels, but between all pairs of pixels across all images. In this paper, we tackle this challenging scenario by introducing a semantic co-labeling approach that performs efficient inference in a fully-connected CRF defined over the pixels, or superpixels, of an image set. Our experimental evaluation demonstrates that our approach yields improved accuracy while coming at no additional computation cost compared to performing segmentation sequentially on individual images. Furthermore, our formulation lets us perform inference over ten thousand images in a matter of seconds.
AB - As evidenced by video segmentation and cosegmentation approaches, exploiting multiple images is key to the success of visual scene understanding. With the availability of increasingly large sets of images, there is a clear need for methods that can efficiently analyze the similarities and structure across huge numbers of image pixels. Furthermore, to make effective use of this data, these similarities should not just be considered locally between neighboring pixels, but between all pairs of pixels across all images. In this paper, we tackle this challenging scenario by introducing a semantic co-labeling approach that performs efficient inference in a fully-connected CRF defined over the pixels, or superpixels, of an image set. Our experimental evaluation demonstrates that our approach yields improved accuracy while coming at no additional computation cost compared to performing segmentation sequentially on individual images. Furthermore, our formulation lets us perform inference over ten thousand images in a matter of seconds.
UR - http://www.scopus.com/inward/record.url?scp=84904705247&partnerID=8YFLogxK
U2 - 10.1109/WACV.2014.6836060
DO - 10.1109/WACV.2014.6836060
M3 - Conference contribution
SN - 9781479949854
T3 - 2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014
SP - 501
EP - 508
BT - 2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014
PB - IEEE Computer Society
T2 - 2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014
Y2 - 24 March 2014 through 26 March 2014
ER -