TY - JOUR
T1 - Large-scale vehicle detection, indexing, and search in urban surveillance videos
AU - Feris, Rogerio Schmidt
AU - Siddiquie, Behjat
AU - Petterson, James
AU - Zhai, Yun
AU - Datta, Ankur
AU - Brown, Lisa M.
AU - Pankanti, Sharath
PY - 2012/2
Y1 - 2012/2
N2 - We present a novel approach for visual detection and attribute-based search of vehicles in crowded surveillance scenes. Large-scale processing is addressed along two dimensions: 1) large-scale indexing, where hundreds of billions of events need to be archived per month to enable effective search and 2) learning vehicle detectors with large-scale feature selection, using a feature pool containing millions of feature descriptors. Our method for vehicle detection also explicitly models occlusions and multiple vehicle types (e.g., buses, trucks, SUVs, cars), while requiring very few manual labeling. It runs quite efficiently at an average of 66 Hz on a conventional laptop computer. Once a vehicle is detected and tracked over the video, fine-grained attributes are extracted and ingested into a database to allow future search queries such as Show me all blue trucks larger than 7 ft. length traveling at high speed northbound last Saturday, from 2 pm to 5 pm. We perform a comprehensive quantitative analysis to validate our approach, showing its usefulness in realistic urban surveillance settings.
AB - We present a novel approach for visual detection and attribute-based search of vehicles in crowded surveillance scenes. Large-scale processing is addressed along two dimensions: 1) large-scale indexing, where hundreds of billions of events need to be archived per month to enable effective search and 2) learning vehicle detectors with large-scale feature selection, using a feature pool containing millions of feature descriptors. Our method for vehicle detection also explicitly models occlusions and multiple vehicle types (e.g., buses, trucks, SUVs, cars), while requiring very few manual labeling. It runs quite efficiently at an average of 66 Hz on a conventional laptop computer. Once a vehicle is detected and tracked over the video, fine-grained attributes are extracted and ingested into a database to allow future search queries such as Show me all blue trucks larger than 7 ft. length traveling at high speed northbound last Saturday, from 2 pm to 5 pm. We perform a comprehensive quantitative analysis to validate our approach, showing its usefulness in realistic urban surveillance settings.
KW - Large-scale learning
KW - large-scale video collections
KW - vehicle search
KW - video surveillance
UR - http://www.scopus.com/inward/record.url?scp=84856193864&partnerID=8YFLogxK
U2 - 10.1109/TMM.2011.2170666
DO - 10.1109/TMM.2011.2170666
M3 - Article
SN - 1520-9210
VL - 14
SP - 28
EP - 42
JO - IEEE Transactions on Multimedia
JF - IEEE Transactions on Multimedia
IS - 1
M1 - 6035786
ER -