More Web Proxy on the site http://driver.im/

Article

Identity-Guided Human Semantic Parsing for Person Re-identification

Authors:

Jinqiao WangAuthors Info & Claims

Computer Vision – ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part III

Pages 346 - 363

https://doi.org/10.1007/978-3-030-58580-8_21

Published: 23 August 2020 Publication History

Abstract

Existing alignment-based methods have to employ the pre-trained human parsing models to achieve the pixel-level alignment, and cannot identify the personal belongings (e.g., backpacks and reticule) which are crucial to person re-ID. In this paper, we propose the identity-guided human semantic parsing approach (ISP) to locate both the human body parts and personal belongings at pixel-level for aligned person re-ID only with person identity labels. We design the cascaded clustering on feature maps to generate the pseudo-labels of human parts. Specifically, for the pixels of all images of a person, we first group them to foreground or background and then group the foreground pixels to human parts. The cluster assignments are subsequently used as pseudo-labels of human parts to supervise the part estimation and ISP iteratively learns the feature maps and groups them. Finally, local features of both human body parts and personal belongings are obtained according to the self-learned part estimation, and only features of visible parts are utilized for the retrieval. Extensive experiments on three widely used datasets validate the superiority of ISP over lots of state-of-the-art methods. Our code is available at https://github.com/CASIA-IVA-Lab/ISP-reID.

References

[1]

Alp Güler, R., Neverova, N., Kokkinos, I.: DensePose: dense human pose estimation in the wild. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7297–7306 (2018)

[2]

Caron M, Bojanowski P, Joulin A, and Douze M Ferrari V, Hebert M, Sminchisescu C, and Weiss Y Deep clustering for unsupervised learning of visual features Computer Vision – ECCV 2018 2018 Cham Springer 139-156

Digital Library

[3]

Chen, B., Deng, W., Hu, J.: Mixed high-order attention network for person re-identification. In: Proceedings of the IEEE International Conference on Computer Vision (2019)

[4]

Deng, J., et al.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)

[5]

Ge, Y., Li, Z., Zhao, H., Yin, G., Yi, S., Wang, X., et al.: FD-GAN: pose-guided feature distilling GAN for robust person re-identification. In: Advances in Neural Information Processing Systems, pp. 1222–1233 (2018)

[6]

Guo, J., Yuan, Y., Huang, L., Zhang, C., Yao, J.G., Han, K.: Beyond human parts: dual part-aligned representations for person re-identification. In: The IEEE International Conference on Computer Vision (ICCV), October 2019

[7]

He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

[8]

He, L., Liang, J., Li, H., Sun, Z.: Deep spatial feature reconstruction for partial person re-identification: alignment-free approach. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7073–7082 (2018)

[9]

He, L., Sun, Z., Zhu, Y., Wang, Y.: Recognizing partial biometric patterns. arXiv preprint arXiv:1810.07399 (2018)

[10]

Hermans, A., Beyer, L., Leibe, B.: In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737 (2017)

[11]

Hong, S., Noh, H., Han, B.: Decoupled deep neural network for semi-supervised semantic segmentation. In: Advances in Neural Information Processing Systems, pp. 1495–1503 (2015)

[12]

Hong, S., Yeo, D., Kwak, S., Lee, H., Han, B.: Weakly supervised semantic segmentation using web-crawled videos. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7322–7330 (2017)

[13]

Hou, R., Ma, B., Chang, H., Gu, X., Shan, S., Chen, X.: Interaction-and-aggregation network for person re-identification. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2019

[14]

Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)

[15]

Huang, H., Li, D., Zhang, Z., Chen, X., Huang, K.: Adversarially occluded samples for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5098–5107 (2018)

[16]

Huang, Z., Wang, X., Wang, J., Liu, W., Wang, J.: Weakly-supervised semantic segmentation network with deep seeded region growing. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7014–7023 (2018)

[17]

Jaderberg, M., Simonyan, K., Zisserman, A., et al.: Spatial transformer networks. In: Advances in Neural Information Processing Systems, pp. 2017–2025 (2015)

[18]

Kalayeh, M.M., Basaran, E., Gökmen, M., Kamasak, M.E., Shah, M.: Human semantic parsing for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1062–1071 (2018)

[19]

Lee, J., Kim, E., Lee, S., Lee, J., Yoon, S.: FickleNet: weakly and semi-supervised semantic image segmentation using stochastic inference. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5267–5276 (2019)

[20]

Li, D., Chen, X., Zhang, Z., Huang, K.: Learning deep context-aware features over body and latent parts for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 384–393 (2017)

[21]

Li, P., Xu, Y., Wei, Y., Yang, Y.: Self-correction for human parsing. arXiv preprint arXiv:1910.09777 (2019)

[22]

Li, W., Zhao, R., Xiao, T., Wang, X.: DeepReId: deep filter pairing neural network for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 152–159 (2014)

[23]

Li, W., Zhu, X., Gong, S.: Harmonious attention network for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2285–2294 (2018)

[24]

Liang X, Gong K, Shen X, and Lin L Look into person: joint body parsing & pose estimation network and a new benchmark IEEE Trans. Pattern Anal. Mach. Intell. 2018 41 4 871-885

Digital Library

[25]

Lin, Y., Dong, X., Zheng, L., Yan, Y., Yang, Y.: A bottom-up clustering approach to unsupervised person re-identification. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 8738–8745 (2019)

[26]

Liu, J., Ni, B., Yan, Y., Zhou, P., Cheng, S., Hu, J.: Pose transferrable person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4099–4108 (2018)

[27]

Luo, H., Gu, Y., Liao, X., Lai, S., Jiang, W.: Bag of tricks and a strong baseline for deep person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (2019)

[28]

Miao, J., Wu, Y., Liu, P., Ding, Y., Yang, Y.: Pose-guided feature alignment for occluded person re-identification. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 542–551 (2019)

[29]

Saquib Sarfraz, M., Schumann, A., Eberle, A., Stiefelhagen, R.: A pose-sensitive embedding for person re-identification with expanded cross neighborhood re-ranking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 420–429 (2018)

[30]

Si, J., Zhang, H., Li, C.G., Kuen, J., Kong, X., Kot, A.C., Wang, G.: Dual attention matching network for context-aware feature sequence based person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5363–5372 (2018)

[31]

Song, C., Huang, Y., Ouyang, W., Wang, L.: Mask-guided contrastive attention model for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1179–1188 (2018)

[32]

Souly, N., Spampinato, C., Shah, M.: Semi and weakly supervised semantic segmentation using generative adversarial network. arXiv preprint arXiv:1703.09695 (2017)

[33]

Su, C., Li, J., Zhang, S., Xing, J., Gao, W., Tian, Q.: Pose-driven deep convolutional model for person re-identification. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3960–3969 (2017)

[34]

Suh Y, Wang J, Tang S, Mei T, and Lee KM Ferrari V, Hebert M, Sminchisescu C, and Weiss Y Part-aligned bilinear representations for person re-identification Computer Vision – ECCV 2018 2018 Cham Springer 418-437

Digital Library

[35]

Sun, K., Xiao, B., Liu, D., Wang, J.: Deep high-resolution representation learning for human pose estimation. In: CVPR (2019)

[36]

Sun Y, Zheng L, Yang Y, Tian Q, and Wang S Ferrari V, Hebert M, Sminchisescu C, and Weiss Y Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline) Computer Vision – ECCV 2018 2018 Cham Springer 501-518

Digital Library

[37]

Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818–2826 (2016)

[38]

Tay, C.P., Roy, S., Yap, K.H.: AANet: attribute attention network for person re-identifications. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2019

[39]

Wang C, Zhang Q, Huang C, Liu W, and Wang X Ferrari V, Hebert M, Sminchisescu C, and Weiss Y Mancs: a multi-task attentional network with curriculum sampling for person re-identification Computer Vision – ECCV 2018 2018 Cham Springer 384-400

Digital Library

[40]

Wang, F., Jiang, M., Qian, C., Yang, S., Li, C., Zhang, H., Wang, X., Tang, X.: Residual attention network for image classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3156–3164 (2017)

[41]

Wang, G., Yuan, Y., Chen, X., Li, J., Zhou, X.: Learning discriminative features with multiple granularities for person re-identification. In: 2018 ACM Multimedia Conference on Multimedia Conference, pp. 274–282. ACM (2018)

[42]

Wang, X., You, S., Li, X., Ma, H.: Weakly-supervised semantic segmentation by iteratively mining common object features. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1354–1362 (2018)

[43]

Wang, Y., et al.: Resource aware person re-identification across multiple resolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8042–8051 (2018)

[44]

Wei, L., Zhang, S., Yao, H., Gao, W., Tian, Q.: GLAD: global-local-alignment descriptor for pedestrian retrieval. In: Proceedings of the 25th ACM International Conference on Multimedia, pp. 420–428. ACM (2017)

[45]

Wei, Y., Feng, J., Liang, X., Cheng, M.M., Zhao, Y., Yan, S.: Object region mining with adversarial erasing: a simple classification to semantic segmentation approach. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1568–1576 (2017)

[46]

Woo S, Park J, Lee J-Y, and Kweon IS Ferrari V, Hebert M, Sminchisescu C, and Weiss Y CBAM: convolutional block attention module Computer Vision – ECCV 2018 2018 Cham Springer 3-19

Digital Library

[47]

Xu, J., Zhao, R., Zhu, F., Wang, H., Ouyang, W.: Attention-aware compositional network for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2119–2128 (2018)

[48]

Yang, W., Huang, H., Zhang, Z., Chen, X., Huang, K., Zhang, S.: Towards rich feature discovery with class activation maps augmentation for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1389–1398 (2019)

[49]

Yao H, Zhang S, Hong R, Zhang Y, Xu C, and Tian Q Deep representation learning with part loss for person re-identification IEEE Trans. Image Process. 2019 28 6 2860-2871

[50]

Zhang, X., et al.: AlignedReID: surpassing human-level performance in person re-identification. arXiv preprint arXiv:1711.08184v2 (2018)

[51]

Zhang, Z., Lan, C., Zeng, W., Chen, Z.: Densely semantically aligned person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 667–676 (2019)

[52]

Zhao, L., Li, X., Zhuang, Y., Wang, J.: Deeply-learned part-aligned representations for person re-identification. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3219–3228 (2017)

[53]

Zheng, F., et al.: Pyramidal person re-identification via multi-loss dynamic training. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2019

[54]

Zheng, L., Huang, Y., Lu, H., Yang, Y.: Pose invariant embedding for deep person re-identification. IEEE Trans. Image Process. (2019)

[55]

Zheng, L., Shen, L., Tian, L., Wang, S., Wang, J., Tian, Q.: Scalable person re-identification: a benchmark. In: Computer Vision, IEEE International Conference on Computer Vision, pp. 1116–1124 (2015)

[56]

Zheng, M., Karanam, S., Wu, Z., Radke, R.J.: Re-identification with consistent attentive siamese networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5735–5744 (2019)

[57]

Zheng, Z., Zheng, L., Yang, Y.: Unlabeled samples generated by GAN improve the person re-identification baseline in vitro. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3754–3762 (2017)

[58]

Zhong, Z., Zheng, L., Cao, D., Li, S.: Re-ranking person re-identification with k-reciprocal encoding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1318–1327 (2017)

[59]

Zhong, Z., Zheng, L., Kang, G., Li, S., Yang, Y.: Random erasing data augmentation. arXiv preprint arXiv:1708.04896 (2017)

[60]

Zhou, Y., Zhu, Y., Ye, Q., Qiu, Q., Jiao, J.: Weakly supervised instance segmentation using class peak response. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3791–3800 (2018)

Cited By

Liu FYe MDu BCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)Cloth-aware Augmentation for Cloth-generalized Person Re-identificationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680956(4053-4062)Online publication date: 28-Oct-2024
https://dl.acm.org/doi/10.1145/3664647.3680956
Li YCheng DFang CJiao CWang NGao XCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)Disentangling Identity Features from Interference Factors for Cloth-Changing Person Re-identificationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680823(2252-2261)Online publication date: 28-Oct-2024
https://dl.acm.org/doi/10.1145/3664647.3680823
Zhang YYang YKang WZhen J(2024)Multi-scale occlusion suppression network for occluded person re-identificationPattern Recognition Letters10.1016/j.patrec.2024.07.009185:C(66-72)Online publication date: 1-Sep-2024
https://dl.acm.org/doi/10.1016/j.patrec.2024.07.009
Show More Cited By

Recommendations

Uncertainty-guided Robust labels refinement for unsupervised person re-identification
Abstract
Existing state-of-the-art unsupervised person re-identification (Re-ID) methods rely on clustering to generate pseudo labels for training. The reliability of pseudo labels directly affects the performance of these methods. Due to unsatisfactory ...
Multi-Human Parsing Machines
MM '18: Proceedings of the 26th ACM international conference on Multimedia

Human parsing is an important task in human-centric analysis. Despite the remarkable progress in single-human parsing, the more realistic case of multi-human parsing remains challenging in terms of the data and the model. Compared with the considerable ...
Mask-Guided Deformation Adaptive Network for Human Parsing
Due to the challenges of densely compacted body parts, nonrigid clothing items, and severe overlap in crowd scenes, human parsing needs to focus more on multilevel feature representations compared to general scene parsing tasks. Based on this observation, ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings

Computer Vision – ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part III

Aug 2020

829 pages

ISBN:978-3-030-58579-2

DOI:10.1007/978-3-030-58580-8

Editors:
Andrea Vedaldi
University of Oxford, Oxford, UK
,
Horst Bischof
Graz University of Technology, Graz, Austria
,
Thomas Brox
University of Freiburg, Freiburg im Breisgau, Germany
,
Jan-Michael Frahm
University of North Carolina at Chapel Hill, Chapel Hill, NC, USA

© Springer Nature Switzerland AG 2020.

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 23 August 2020

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

32
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 10 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Liu FYe MDu BCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)Cloth-aware Augmentation for Cloth-generalized Person Re-identificationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680956(4053-4062)Online publication date: 28-Oct-2024
https://dl.acm.org/doi/10.1145/3664647.3680956
Li YCheng DFang CJiao CWang NGao XCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)Disentangling Identity Features from Interference Factors for Cloth-Changing Person Re-identificationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680823(2252-2261)Online publication date: 28-Oct-2024
https://dl.acm.org/doi/10.1145/3664647.3680823
Zhang YYang YKang WZhen J(2024)Multi-scale occlusion suppression network for occluded person re-identificationPattern Recognition Letters10.1016/j.patrec.2024.07.009185:C(66-72)Online publication date: 1-Sep-2024
https://dl.acm.org/doi/10.1016/j.patrec.2024.07.009
Pham DNguyen ANguyen H(2024)GAN-based data augmentation and pseudo-label refinement with holistic features for unsupervised domain adaptation person re-identificationKnowledge-Based Systems10.1016/j.knosys.2024.111471288:COnline publication date: 15-Mar-2024
https://dl.acm.org/doi/10.1016/j.knosys.2024.111471
Lv GDing YChen XZheng Y(2024)MP2PMatchJournal of Visual Communication and Image Representation10.1016/j.jvcir.2024.104128100:COnline publication date: 17-Jul-2024
https://dl.acm.org/doi/10.1016/j.jvcir.2024.104128
Xie GSu HLuo YWang WYi YZhong S(2024)Person re-identification by utilizing hierarchical spatial relation reasoningImage and Vision Computing10.1016/j.imavis.2024.105111148:COnline publication date: 1-Aug-2024
https://dl.acm.org/doi/10.1016/j.imavis.2024.105111
Pang YZhang HZhu LLiu DLiu L(2024)Self-similarity guided probabilistic embedding matching based on transformer for occluded person re-identificationExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.121504237:PBOnline publication date: 1-Feb-2024
https://dl.acm.org/doi/10.1016/j.eswa.2023.121504
Yang SYang HPu YWang YYou Z(2024)Context-Aware Relative Distinctive Feature Learning for Person Re-identificationAdvanced Intelligent Computing Technology and Applications10.1007/978-981-97-5603-2_17(203-215)Online publication date: 5-Aug-2024
https://dl.acm.org/doi/10.1007/978-981-97-5603-2_17
Sun ZWu GPan QLi YLiu Y(2024)Person Re-identification Method Based on Dual Feature Attention Backbone NetworkAdvanced Intelligent Computing Technology and Applications10.1007/978-981-97-5597-4_41(484-495)Online publication date: 5-Aug-2024
https://dl.acm.org/doi/10.1007/978-981-97-5597-4_41
Somers VAlahi AVleeschouwer C(2024)Keypoint Promptable Re-IdentificationComputer Vision – ECCV 202410.1007/978-3-031-72986-7_13(216-233)Online publication date: 29-Sep-2024
https://dl.acm.org/doi/10.1007/978-3-031-72986-7_13
Show More Cited By

View Options

View options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents