TY - JOUR
T1 - Quadruplet Network with One-Shot Learning for Fast Visual Object Tracking
AU - Dong, Xingping
AU - Shen, Jianbing
AU - Wu, Dongming
AU - Guo, Kan
AU - Jin, Xiaogang
AU - Porikli, Fatih
N1 - Publisher Copyright:
© 1992-2012 IEEE.
PY - 2019/7
Y1 - 2019/7
N2 - In the same vein of discriminative one-shot learning, Siamese networks allow recognizing an object from a single exemplar with the same class label. However, they do not take advantage of the underlying structure of the data and the relationship among the multitude of samples as they only rely on the pairs of instances for training. In this paper, we propose a new quadruplet deep network to examine the potential connections among the training instances, aiming to achieve a more powerful representation. We design a shared network with four branches that receive a multi-tuple of instances as inputs and are connected by a novel loss function consisting of pair loss and triplet loss. According to the similarity metric, we select the most similar and the most dissimilar instances as the positive and negative inputs of triplet loss from each multi-tuple. We show that this scheme improves the training performance. Furthermore, we introduce a new weight layer to automatically select suitable combination weights, which will avoid the conflict between triplet and pair loss leading to worse performance. We evaluate our quadruplet framework by model-free tracking-by-detection of objects from a single initial exemplar in several visual object tracking benchmarks. Our extensive experimental analysis demonstrates that our tracker achieves superior performance with a real-time processing speed of 78 frames/s. Our source code is available.
AB - In the same vein of discriminative one-shot learning, Siamese networks allow recognizing an object from a single exemplar with the same class label. However, they do not take advantage of the underlying structure of the data and the relationship among the multitude of samples as they only rely on the pairs of instances for training. In this paper, we propose a new quadruplet deep network to examine the potential connections among the training instances, aiming to achieve a more powerful representation. We design a shared network with four branches that receive a multi-tuple of instances as inputs and are connected by a novel loss function consisting of pair loss and triplet loss. According to the similarity metric, we select the most similar and the most dissimilar instances as the positive and negative inputs of triplet loss from each multi-tuple. We show that this scheme improves the training performance. Furthermore, we introduce a new weight layer to automatically select suitable combination weights, which will avoid the conflict between triplet and pair loss leading to worse performance. We evaluate our quadruplet framework by model-free tracking-by-detection of objects from a single initial exemplar in several visual object tracking benchmarks. Our extensive experimental analysis demonstrates that our tracker achieves superior performance with a real-time processing speed of 78 frames/s. Our source code is available.
KW - Quadruplet deep network
KW - Siamese deep network
KW - visual object tracking
UR - http://www.scopus.com/inward/record.url?scp=85061524356&partnerID=8YFLogxK
U2 - 10.1109/TIP.2019.2898567
DO - 10.1109/TIP.2019.2898567
M3 - Article
SN - 1057-7149
VL - 28
SP - 3516
EP - 3527
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
IS - 7
M1 - 8638788
ER -