TY - JOUR
T1 - Pose-Invariant Embedding for Deep Person Re-Identification
AU - Zheng, Liang
AU - Huang, Yujia
AU - Lu, Huchuan
AU - Yang, Yi
N1 - Publisher Copyright:
© 1992-2012 IEEE.
PY - 2019/9
Y1 - 2019/9
N2 - Pedestrian misalignment, which mainly arises from detector errors and pose variations, is a critical problem for a robust person re-identification (re-ID) system. With poor alignment, the feature learning and matching process might be largely compromised. To address this problem, this paper introduces pose-invariant embedding (PIE) as a pedestrian descriptor. First, in order to align pedestrians to a standard pose, the PoseBox structure is introduced, which is generated through pose estimation followed by affine transformations. Second, to reduce the impact of pose estimation errors and information loss during the PoseBox construction, we design a PoseBox fusion (PBF) CNN architecture that takes the original image, the PoseBox, and the pose estimation confidence as input. The proposed PIE descriptor is thus defined as the fully connected layer of the PBF network for the retrieval task. Experiments are conducted on the Market-1501, CUHK03-NP, and DukeMTMC-reID datasets. We show that PoseBox alone yields decent re-ID accuracy and that when integrated in the PBF network, the learned PIE descriptor produces competitive performance compared with state-of-the-art approaches.
AB - Pedestrian misalignment, which mainly arises from detector errors and pose variations, is a critical problem for a robust person re-identification (re-ID) system. With poor alignment, the feature learning and matching process might be largely compromised. To address this problem, this paper introduces pose-invariant embedding (PIE) as a pedestrian descriptor. First, in order to align pedestrians to a standard pose, the PoseBox structure is introduced, which is generated through pose estimation followed by affine transformations. Second, to reduce the impact of pose estimation errors and information loss during the PoseBox construction, we design a PoseBox fusion (PBF) CNN architecture that takes the original image, the PoseBox, and the pose estimation confidence as input. The proposed PIE descriptor is thus defined as the fully connected layer of the PBF network for the retrieval task. Experiments are conducted on the Market-1501, CUHK03-NP, and DukeMTMC-reID datasets. We show that PoseBox alone yields decent re-ID accuracy and that when integrated in the PBF network, the learned PIE descriptor produces competitive performance compared with state-of-the-art approaches.
KW - Pose invariant embedding
KW - PoseBox
KW - PoseBox fusion network
KW - person re-identification
UR - http://www.scopus.com/inward/record.url?scp=85065572972&partnerID=8YFLogxK
U2 - 10.1109/TIP.2019.2910414
DO - 10.1109/TIP.2019.2910414
M3 - Article
SN - 1057-7149
VL - 28
SP - 4500
EP - 4509
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
IS - 9
M1 - 8693885
ER -