TY - JOUR
T1 - Beyond feature integration
T2 - a coarse-to-fine framework for cascade correlation tracking
AU - Li, Dongdong
AU - Wen, Gongjian
AU - Kuai, Yangliu
AU - Porikli, Fatih
N1 - Publisher Copyright:
© 2019, Springer-Verlag GmbH Germany, part of Springer Nature.
PY - 2019/4/15
Y1 - 2019/4/15
N2 - Discriminative correlation filters (DCF) have achieved enormous popularity in the tracking community. Recently, the performance advancement in DCF-based trackers is predominantly driven by the use of convolutional features. In pursuit of extreme tracking performance, state-of-the-art trackers (e.g., cascade correlation tracking [1] and HCF [2]) equip DCF with hierarchical convolutional features to capture both semantics and spatial details of the target appearance. While such methods have been shown to work well, multiple feature integration results in high model complexity which significantly increases the over-fitting risk and computational burden. In this paper, we present a coarse-to-fine framework for cascade correlation tracking (CCT). Instead of integrating hierarchical features, this framework decomposes a complicated tracker into two low-complexity modules, a coarse tracker C and a refined tracker R, working in a coarse-to-fine manner. The coarse tracker C employs low-resolution semantic convolutional features extracted from a large search area to cope with large target displacement and appearance change between adjacent frames. By contrast, the refined tracker R employs high-resolution handcraft features extracted from a small search area to further refine the coarse location of C. Our CCT tracker enjoys the strong discriminative power of C and the high efficiency of R. Experiments on the OTB2013 and TC128 benchmarks show that CCT performs favorably against state-of-the-art trackers.
AB - Discriminative correlation filters (DCF) have achieved enormous popularity in the tracking community. Recently, the performance advancement in DCF-based trackers is predominantly driven by the use of convolutional features. In pursuit of extreme tracking performance, state-of-the-art trackers (e.g., cascade correlation tracking [1] and HCF [2]) equip DCF with hierarchical convolutional features to capture both semantics and spatial details of the target appearance. While such methods have been shown to work well, multiple feature integration results in high model complexity which significantly increases the over-fitting risk and computational burden. In this paper, we present a coarse-to-fine framework for cascade correlation tracking (CCT). Instead of integrating hierarchical features, this framework decomposes a complicated tracker into two low-complexity modules, a coarse tracker C and a refined tracker R, working in a coarse-to-fine manner. The coarse tracker C employs low-resolution semantic convolutional features extracted from a large search area to cope with large target displacement and appearance change between adjacent frames. By contrast, the refined tracker R employs high-resolution handcraft features extracted from a small search area to further refine the coarse location of C. Our CCT tracker enjoys the strong discriminative power of C and the high efficiency of R. Experiments on the OTB2013 and TC128 benchmarks show that CCT performs favorably against state-of-the-art trackers.
KW - Coarse to fine
KW - Correlation filter
KW - Visual tracking
UR - http://www.scopus.com/inward/record.url?scp=85061584888&partnerID=8YFLogxK
U2 - 10.1007/s00138-019-01009-9
DO - 10.1007/s00138-019-01009-9
M3 - Article
SN - 0932-8092
VL - 30
SP - 519
EP - 528
JO - Machine Vision and Applications
JF - Machine Vision and Applications
IS - 3
ER -