TY - GEN
T1 - Towards unsupervised semantic segmentation of street scenes from motion cues
AU - Sokeh, Hajar Sadeghi
AU - Gould, Stephen
PY - 2012
Y1 - 2012
N2 - Motion provides a rich source of information about the world. It can be used as an important cue to analyse the behaviour of objects in a scene and consequently identify interesting locations within it. In this paper, given an unannotated video sequence of a dynamic scene from fixed viewpoint, we first present a set of useful motion features that can be efficiently extracted at each pixel by optical flow. Using these features, we then develop an algorithm that can extract motion topic models and identify semantically significant regions and landmarks in a complex scene from a short video sequence. For example, by watching a street scene our algorithm can extract meaningful regions such as roads and important landmarks such as parking spots. Our method is robust to complicating factors such as shadows and occlusions.
AB - Motion provides a rich source of information about the world. It can be used as an important cue to analyse the behaviour of objects in a scene and consequently identify interesting locations within it. In this paper, given an unannotated video sequence of a dynamic scene from fixed viewpoint, we first present a set of useful motion features that can be efficiently extracted at each pixel by optical flow. Using these features, we then develop an algorithm that can extract motion topic models and identify semantically significant regions and landmarks in a complex scene from a short video sequence. For example, by watching a street scene our algorithm can extract meaningful regions such as roads and important landmarks such as parking spots. Our method is robust to complicating factors such as shadows and occlusions.
KW - motion cues
KW - scene understanding
KW - semantic segmentation
UR - http://www.scopus.com/inward/record.url?scp=84873382374&partnerID=8YFLogxK
U2 - 10.1145/2425836.2425884
DO - 10.1145/2425836.2425884
M3 - Conference contribution
SN - 9781450314732
T3 - ACM International Conference Proceeding Series
SP - 232
EP - 237
BT - Proceedings of IVCNZ 2012 - The 27th Image and Vision Computing New Zealand Conference
T2 - 27th Image and Vision Computing New Zealand Conference, IVCNZ 2012
Y2 - 26 November 2012 through 28 November 2012
ER -