[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Dual-Stream Guided-Learning via a Priori Optimization for Person Re-identification

Published: 13 January 2022 Publication History

Abstract

The task of person re-identification (re-ID) is to find the same pedestrian across non-overlapping camera views. Generally, the performance of person re-ID can be affected by background clutter. However, existing segmentation algorithms cannot obtain perfect foreground masks to cover the background information clearly. In addition, if the background is completely removed, some discriminative ID-related cues (i.e., backpack or companion) may be lost. In this article, we design a dual-stream network consisting of a Provider Stream (P-Stream) and a Receiver Stream (R-Stream). The R-Stream performs an a priori optimization operation on foreground information. The P-Stream acts as a pusher to guide the R-Stream to concentrate on foreground information and some useful ID-related cues in the background. The proposed dual-stream network can make full use of the a priori optimization and guided-learning strategy to learn encouraging foreground information and some useful ID-related information in the background. Our method achieves Rank-1 accuracy of 95.4% on Market-1501, 89.0% on DukeMTMC-reID, 78.9% on CUHK03 (labeled), and 75.4% on CUHK03 (detected), outperforming state-of-the-art methods.

References

[1]
Rıza Alp Güler, Natalia Neverova, and Iasonas Kokkinos. 2018. DensePose: Dense human pose estimation in the wild. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, Los Alamitos, CA, 7297–7306.
[2]
Mykhaylo Andriluka, Leonid Pishchulin, Peter Gehler, and Bernt Schiele. 2014. 2D human pose estimation: New benchmark and state of the art analysis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’14). 3686–3693.
[3]
Binghui Chen, Weihong Deng, and Jiani Hu. 2019. Mixed high-order attention network for person re-identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’19). 371–381.
[4]
Tianlong Chen, Shaojin Ding, Jingyi Xie, Ye Yuan, Wuyang Chen, Yang Yang, Zhou Ren, and Zhangyang Wang. 2019. ABD-Net: Attentive but diverse person re-identification. In Proceedings of the IEEE International Conference on Computer Vision. 8351–8361.
[5]
Yanbei Chen, Xiatian Zhu, and Shaogang Gong. 2017. Person re-identification by deep learning multi-scale representations. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’17). 2590–2600.
[6]
Zuozhuo Dai, Mingqiang Chen, Xiaodong Gu, Siyu Zhu, and Ping Tan. 2018. Batch DropBlock network for person re-identification and beyond. arXiv:1811.07130.
[7]
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’09). IEEE, Los Alamitos, CA, 248–255.
[8]
Michela Farenzena, Loris Bazzani, Alessandro Perina, Vittorio Murino, and Marco Cristani. 2010. Person re-identification by symmetry-driven accumulation of local features. In Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’10). IEEE, Los Alamitos, CA, 2360–2367.
[9]
Pedro F. Felzenszwalb, Ross B. Girshick, David McAllester, and Deva Ramanan. 2009. Object detection with discriminatively trained part-based models. IEEE Transactions on Pattern Analysis and Machine Intelligence 32, 9 (2009), 1627–1645.
[10]
Niloofar Gheissari, Thomas B. Sebastian, and Richard Hartley. 2006. Person reidentification using spatiotemporal appearance. In Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 2. IEEE, Los Alamitos, CA, 1528–1535.
[11]
Priya Goyal, Piotr Dollár, Ross Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming He. 2017. Accurate, large minibatch SGD: Training ImageNet in 1 hour. arXiv:1706.02677.
[12]
Jianyuan Guo, Yuhui Yuan, Lang Huang, Chao Zhang, Jin-Ge Yao, and Kai Han. 2019. Beyond human parts: Dual part-aligned representations for person re-identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’19). 3642–3651.
[13]
Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’17). 2961–2969.
[14]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16). 770–778.
[15]
Alexander Hermans, Lucas Beyer, and Bastian Leibe. 2017. In defense of the triplet loss for person re-identification. arXiv:1703.07737.
[16]
Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv:1503.02531.
[17]
Yan Huang, Hao Sheng, Yanwei Zheng, and Zhang Xiong. 2017. DeepDiff: Learning deep difference features on human body parts for person re-identification. Neurocomputing 241 (2017), 191–203.
[18]
Yan Huang, Qiang Wu, JingSong Xu, and Yi Zhong. 2019. SBSGAN: Suppression of inter-domain background shift for person re-identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’19). 9527–9536.
[19]
Yan Huang, Jingsong Xu, Qiang Wu, Zhedong Zheng, Zhaoxiang Zhang, and Jian Zhang. 2019. Multi-pseudo regularized label for generated data in person re-identification. IEEE Transactions on Image Processing 28, 3 (2019), 1391–1403.
[20]
Yan Huang, Jingsong Xu, Qiang Wu, Yi Zhong, and Zhaoxiang Zhang. 2020. Beyond scalar neuron: Adopting vector-neuron capsules for long-term person re-identification. IEEE Transactions on Circuits and Systems for Video Technology 30, 10 (2020), 3459–3471.
[21]
Max Jaderberg, Karen Simonyan, Andrew Zisserman, and Koray Kavukcuoglu. 2015. Spatial transformer networks. In Advances in Neural Information Processing Systems. 2017–2025.
[22]
Mahdi M. Kalayeh, Emrah Basaran, Muhittin Gökmen, Mustafa E. Kamasak, and Mubarak Shah. 2018. Human semantic parsing for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18). 1062–1071.
[23]
Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv:1412.6980.
[24]
Cuong Vo Le, Quan Nguyen Hong, Trung Tran Quang, and Nghia Doan Trung. 2016. Superpixel-based background removal for accuracy salience person re-identification. In Proceedings of the 2016 IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia’16). IEEE, Los Alamitos, CA, 1–4.
[25]
Dangwei Li, Xiaotang Chen, Zhang Zhang, and Kaiqi Huang. 2017. Learning deep context-aware features over body and latent parts for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17). 384–393.
[26]
Wei Li, Rui Zhao, Tong Xiao, and Xiaogang Wang. 2014. DeepReid: Deep filter pairing neural network for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’14). 152–159.
[27]
Wei Li, Xiatian Zhu, and Shaogang Gong. 2017. Person re-identification by deep joint learning of multi-loss classification. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI’17). 2194–2200.
[28]
Wei Li, Xiatian Zhu, and Shaogang Gong. 2018. Harmonious attention network for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18). 2285–2294.
[29]
Xiaodan Liang, Ke Gong, Xiaohui Shen, and Liang Lin. 2018. Look into person: Joint body parsing and pose estimation network and a new benchmark. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 4 (2018), 871–885.
[30]
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. 2014. Microsoft COCO: Common objects in context. In Proceedings of the European Conference on Computer Vision (ECCV’14). 740–755.
[31]
Hao Liu, Jiashi Feng, Meibin Qi, Jianguo Jiang, and Shuicheng Yan. 2017. End-to-end comparative attention networks for person re-identification. IEEE Transactions on Image Processing 26, 7 (2017), 3492–3506.
[32]
Yiheng Liu, Wengang Zhou, Jianzhuang Liu, Guojun Qi, Qi Tian, and Houqiang Li. 2019. An end-to-end foreground-aware network for person re-identification. arXiv:1910.11547.
[33]
Thuy Binh Nguyen, Van Phu Pham, Thi-Lan Le, and Cuong Vo Le. 2016. Background removal for improving saliency-based person re-identification. In Proceedings of the 2016 8th International Conference on Knowledge and Systems Engineering (KSE’16). IEEE, Los Alamitos, CA, 339–344.
[34]
Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, and Adam Lerer. 2017. Automatic differentiation in PyTorch. In Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS’17). 1–4.
[35]
Lei Qi, Jing Huo, Lei Wang, Yinghuan Shi, and Yang Gao. 2018. MaskReid: A mask based deep ranking neural network for person re-identification. arXiv:1804.03864.
[36]
Xuelin Qian, Yanwei Fu, Tao Xiang, Wenxuan Wang, Jie Qiu, Yang Wu, Yu-Gang Jiang, and Xiangyang Xue. 2018. Pose-normalized image generation for person re-identification. In Proceedings of the European Conference on Computer Vision (ECCV’18). 650–667.
[37]
Pengyuan Ren and Jianmin Li. 2018. Factorized distillation: Training holistic person re-identification model by distilling an ensemble of partial Reid models. arXiv:1811.08073.
[38]
Ergys Ristani, Francesco Solera, Roger Zou, Rita Cucchiara, and Carlo Tomasi. 2016. Performance measures and a data set for multi-target, multi-camera tracking. In Proceedings of the European Conference on Computer Vision (ECCV’16). 17–35.
[39]
M. Saquib Sarfraz, Arne Schumann, Andreas Eberle, and Rainer Stiefelhagen. 2018. A pose-sensitive embedding for person re-identification with expanded cross neighborhood re-ranking. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18). 420–429.
[40]
Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, and Dhruv Batra. 2017. Grad-CAM: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’17). 618–626.
[41]
Yantao Shen, Hongsheng Li, Tong Xiao, Shuai Yi, Dapeng Chen, and Xiaogang Wang. 2018. Deep group-shuffling random walk for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18). 2265–2274.
[42]
Chunfeng Song, Yan Huang, Wanli Ouyang, and Liang Wang. 2018. Mask-guided contrastive attention model for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18). 1179–1188.
[43]
Chi Su, Jianing Li, Shiliang Zhang, Junliang Xing, Wen Gao, and Qi Tian. 2017. Pose-driven deep convolutional model for person re-identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’17). 3960–3969.
[44]
Yumin Suh, Jingdong Wang, Siyu Tang, Tao Mei, and Kyoung Mu Lee. 2018. Part-aligned bilinear representations for person re-identification. In Proceedings of the European Conference on Computer Vision (ECCV’18). 402–419.
[45]
Yifan Sun, Liang Zheng, Weijian Deng, and Shengjin Wang. 2017. SVDNet for pedestrian retrieval. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’17). 3800–3808.
[46]
Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, and Shengjin Wang. 2018. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In Proceedings of the European Conference on Computer Vision (ECCV’18). 480–496.
[47]
Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jon Shlens, and Zbigniew Wojna. 2016. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16). 2818–2826.
[48]
Maoqing Tian, Shuai Yi, Hongsheng Li, Shihua Li, Xuesen Zhang, Jianping Shi, Junjie Yan, and Xiaogang Wang. 2018. Eliminating background-bias for robust person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18). 5794–5803.
[49]
Cheng Wang, Qian Zhang, Chang Huang, Wenyu Liu, and Xinggang Wang. 2018. Mancs: A multi-task attentional network with curriculum sampling for person re-identification. In Proceedings of the European Conference on Computer Vision (ECCV’18). 365–381.
[50]
Guanshuo Wang, Yufeng Yuan, Xiong Chen, Jiwei Li, and Xi Zhou. 2018. Learning discriminative features with multiple granularities for person re-identification. In Proceedings of the 2018 ACM Multimedia Conference. ACM, New York, NY, 274–282.
[51]
Liang Wang, Tieniu Tan, Huazhong Ning, and Weiming Hu. 2003. Silhouette analysis-based gait recognition for human identification. IEEE Transactions on Pattern Analysis and Machine Intelligence 25, 12 (2003), 1505–1518.
[52]
Longhui Wei, Shiliang Zhang, Hantao Yao, Wen Gao, and Qi Tian. 2017. Glad: Global-local-alignment descriptor for pedestrian retrieval. In Proceedings of the 25th ACM International Conference on Multimedia (MM’17). ACM, New York, NY, 420–428.
[53]
Junyi Wu, Lingxiang Yao, Yan Huang, Jingsong Xu, Wu Qiang, and Liqin Huang. 2019. Improving person re-identification performance using body mask via cross-learning strategy. In Proceedings of the 2019 IEEE Visual Communications and Image Processing (VCIP’19). IEEE, Los Alamitos, CA, 1–4.
[54]
Jing Xu, Rui Zhao, Feng Zhu, Huaming Wang, and Wanli Ouyang. 2018. Attention-aware compositional network for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18). 2119–2128.
[55]
Rui Yu, Zhichao Zhou, Song Bai, and Xiang Bai. 2017. Divide and fuse: A re-ranking approach for person re-identification. arXiv:1708.04169.
[56]
Zhizheng Zhang, Cuiling Lan, Wenjun Zeng, and Zhibo Chen. 2019. Densely semantically aligned person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 667–676.
[57]
Haiyu Zhao, Maoqing Tian, Shuyang Sun, Jing Shao, Junjie Yan, Shuai Yi, Xiaogang Wang, and Xiaoou Tang. 2017. Spindle Net: Person re-identification with human body region guided feature decomposition and fusion. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17). 1077–1085.
[58]
Liang Zheng, Yujia Huang, Huchuan Lu, and Yi Yang. 2019. Pose invariant embedding for deep person re-identification. IEEE Transactions on Image Processing 28, 9 (2019), 4500–4509.
[59]
Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable person re-identification: A benchmark. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’15). 1116–1124.
[60]
Liang Zheng, Yi Yang, and Alexander G. Hauptmann. 2016. Person re-identification: Past, present and future. arXiv:1610.02984.
[61]
Meng Zheng, Srikrishna Karanam, Ziyan Wu, and Richard J. Radke. 2019. Re-identification with consistent attentive siamese networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’19). 5735–5744.
[62]
Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, and Jan Kautz. 2019. Joint discriminative and generative learning for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’19).
[63]
Zhedong Zheng, Liang Zheng, and Yi Yang. 2018. A discriminatively learned CNN embedding for person reidentification. ACM Transactions on Multimedia Computing, Communications, and Applications 14, 1 (2018), 13.
[64]
Zhun Zhong, Liang Zheng, Donglin Cao, and Shaozi Li. 2017. Re-ranking person re-identification with k-reciprocal encoding. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17). 1318–1327.
[65]
Kaiyang Zhou, Yongxin Yang, Andrea Cavallaro, and Tao Xiang. 2019. Omni-scale feature learning for person re-identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’19).
[66]
Sanping Zhou, Jinjun Wang, Deyu Meng, Yudong Liang, Yihong Gong, and Nanning Zheng. 2019. Discriminative feature learning with foreground attention for person re-identification. arXiv:1807.01455.
[67]
Kuan Zhu, Haiyun Guo, Zhiwei Liu, Ming Tang, and Jinqiao Wang. 2020. Identity-guided human semantic parsing for person re-identification. In Proceedings of the European Conference on Computer Vision (ECCV’20).

Cited By

View all
  • (2024)Selective and orthogonal feature activation for pedestrian attribute recognitionProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i6.28419(6039-6047)Online publication date: 20-Feb-2024
  • (2024)Person in Uniforms Re-IdentificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/370383921:2(1-23)Online publication date: 11-Nov-2024
  • (2024)Text-and-Image Learning Transformer for Cross-Modal Person Re-IdentificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/368616021:1(1-18)Online publication date: 15-Oct-2024
  • Show More Cited By

Index Terms

  1. Dual-Stream Guided-Learning via a Priori Optimization for Person Re-identification

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Multimedia Computing, Communications, and Applications
    ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 17, Issue 4
    November 2021
    529 pages
    ISSN:1551-6857
    EISSN:1551-6865
    DOI:10.1145/3492437
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 January 2022
    Accepted: 01 January 2021
    Revised: 01 January 2021
    Received: 01 July 2020
    Published in TOMM Volume 17, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Person re-identification
    2. a priori optimization
    3. guided-learning
    4. foreground images

    Qualifiers

    • Research-article
    • Refereed

    Funding Sources

    • Major Science and Technology Projects in Fujian, China

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)49
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 26 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Selective and orthogonal feature activation for pedestrian attribute recognitionProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i6.28419(6039-6047)Online publication date: 20-Feb-2024
    • (2024)Person in Uniforms Re-IdentificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/370383921:2(1-23)Online publication date: 11-Nov-2024
    • (2024)Text-and-Image Learning Transformer for Cross-Modal Person Re-IdentificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/368616021:1(1-18)Online publication date: 15-Oct-2024
    • (2024)Exploiting Blockchain to Make AI Trustworthy: A Software Development Lifecycle ViewACM Computing Surveys10.1145/361442456:7(1-31)Online publication date: 9-Apr-2024
    • (2024)A Two-Stream Hybrid Convolution-Transformer Network Architecture for Clothing-Change Person Re-IdentificationIEEE Transactions on Multimedia10.1109/TMM.2023.333156926(5326-5339)Online publication date: 2024
    • (2024)Attribute-Guided Pedestrian Retrieval: Bridging Person Re-ID with Internal Attribute Variability2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.01675(17689-17699)Online publication date: 16-Jun-2024
    • (2023)Blockchain-Based Privacy Preservation for IoT-Enabled Healthcare SystemACM Transactions on Sensor Networks10.1145/357792619:3(1-17)Online publication date: 2-Mar-2023
    • (2023)Lightweight branched blockchain security framework for Internet of VehiclesTransactions on Emerging Telecommunications Technologies10.1002/ett.452034:11Online publication date: 12-Nov-2023
    • (2022)Long-Term Person Re-Identification Based on Appearance and Gait Feature Fusion under Covariate ChangesProcesses10.3390/pr1004077010:4(770)Online publication date: 14-Apr-2022
    • (2022)A Secure Cloud Storage Algorithm for University Financial Data Based on the Blockchain TechnologyMobile Information Systems10.1155/2022/50307382022Online publication date: 1-Jan-2022
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media