[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3503161.3547764acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Dynamic Prototype Mask for Occluded Person Re-Identification

Published: 10 October 2022 Publication History

Abstract

Although person re-identification has achieved an impressive improvement in recent years, the common occlusion case caused by different obstacles is still an unsettled issue in real application scenarios. Existing methods mainly address this issue by employing body clues provided by an extra network to distinguish the visible part. Nevertheless, the inevitable domain gap between the assistant model and the ReID datasets has highly increased the difficulty to obtain an effective and efficient model. To escape from the extra pre-trained networks and achieve an automatic alignment in an end-to-end trainable network, we propose a novel Dynamic Prototype Mask (DPM) based on two self-evident prior knowledge. Specifically, we first devise a Hierarchical Mask Generator which utilizes the hierarchical semantic to select the visible pattern space between the high-quality holistic prototype and the feature representation of the occluded input image. Under this condition, the occluded representation could be well aligned in a selected subspace spontaneously. Then, to enrich the feature representation of the high-quality holistic prototype and provide a more complete feature space, we introduce a Head Enrich Module to encourage different heads to aggregate different patterns representation in the whole image. Extensive experimental evaluations conducted on occluded and holistic person re-identification benchmarks demonstrate the superior performance of the DPM over the state-of-the-art methods.

Supplementary Material

MP4 File (MM22-fp112.mp4)
This is the presentation video of dynamic prototype mask (DPM) for occluded person re-identification. DPM escapes from the extra pre-trained networks and achieve an automatic alignment in an end-to-end trainable network. DPM reaches superior performance on occluded and holistic person re-identification benchmarks over the state-of-the-art methods.

References

[1]
Binghui Chen, Weihong Deng, and Jiani Hu. 2019. Mixed High-Order Attention Network for Person Re-Identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV).
[2]
Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, Jian Shao, Wei Liu, and Tat-Seng Chua. 2017. Sca-cnn: Spatial and channel-wise attention in convolutional networks for image captioning. In Proceedings of the IEEE conference on computer vision and pattern recognition. 5659--5667.
[3]
Peixian Chen, Wenfeng Liu, Pingyang Dai, Jianzhuang Liu, Qixiang Ye, Mingliang Xu, Qi'an Chen, and Rongrong Ji. 2021b. Occlude them all: Occlusion-aware attention network for occluded person re-id. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 11833--11842.
[4]
Zhiwei Chen, Liujuan Cao, Yunhang Shen, Feihong Lian, Yongjian Wu, and Rongrong Ji. 2021a. E2Net: Excitative-expansile learning for weakly supervised object Localization. In Proceedings of the 29th ACM International Conference on Multimedia. 573--581.
[5]
Zhiwei Chen, Changan Wang, Yabiao Wang, Guannan Jiang, Yunhang Shen, Ying Tai, Chengjie Wang, Wei Zhang, and Liujuan Cao. 2022. Lctr: On awakening the local continuity of transformer for weakly supervised object localization. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 410--418.
[6]
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In Proceedings of the CVPR. 248--255.
[7]
Jiankang Deng, Jia Guo, Niannan Xue, and Stefanos Zafeiriou. 2019. Arcface: Additive angular margin loss for deep face recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4690--4699.
[8]
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, et al. 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020).
[9]
Chanho Eom and Bumsub Ham. 2019. Learning disentangled representation for robust person re-identification. In Proceedings of the NeurIPS. 5297--5308.
[10]
Shang Gao, Jingya Wang, Huchuan Lu, and Zimo Liu. 2020. Pose-guided visible part matching for occluded person ReID. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 11744--11752.
[11]
Yixiao Ge, Zhuowan Li, Haiyu Zhao, Guojun Yin, Shuai Yi, Xiaogang Wang, et al. 2018. Fd-gan: Pose-guided feature distilling gan for robust person re-identification. Advances in neural information processing systems, Vol. 31 (2018).
[12]
Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask r-cnn. In Proceedings of the IEEE international conference on computer vision. 2961--2969.
[13]
Lingxiao He, Jian Liang, Haiqing Li, and Zhenan Sun. 2018. Deep spatial feature reconstruction for partial person re-identification: Alignment-free approach. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7073--7082.
[14]
Lingxiao He, Yinggang Wang, Wu Liu, He Zhao, Zhenan Sun, and Jiashi Feng. 2019. Foreground-aware pyramid reconstruction for alignment-free occluded person re-identification. In Proceedings of the IEEE/CVF international conference on computer vision. 8450--8459.
[15]
Shuting He, Hao Luo, Pichao Wang, Fan Wang, Hao Li, and Wei Jiang. 2021. Transreid: Transformer-based object re-identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 15013--15022.
[16]
Ruibing Hou, Bingpeng Ma, Hong Chang, Xinqian Gu, Shiguang Shan, and Xilin Chen. 2021. Feature completion for occluded person re-identification. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021).
[17]
Jie Hu, Li Shen, and Gang Sun. 2018. Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7132--7141.
[18]
Houjing Huang, Dangwei Li, Zhang Zhang, Xiaotang Chen, and Kaiqi Huang. 2018. Adversarially occluded samples for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5098--5107.
[19]
Martin Koestinger, Martin Hirzer, Paul Wohlhart, Peter M Roth, and Horst Bischof. 2012. Large scale metric learning from equivalence constraints. In 2012 IEEE conference on computer vision and pattern recognition. IEEE, 2288--2295.
[20]
Hanjun Li, Gaojie Wu, and Wei-Shi Zheng. 2021b. Combined depth space based architecture search for person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 6729--6738.
[21]
Yulin Li, Jianfeng He, Tianzhu Zhang, Xiang Liu, Yongdong Zhang, and Feng Wu. 2021a. Diverse part discovery: Occluded person re-identification with part-aware transformer. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2898--2907.
[22]
Shengcai Liao, Yang Hu, Xiangyu Zhu, and Stan Z Li. 2015. Person re-identification by local maximal occurrence representation and metric learning. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2197--2206.
[23]
Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. 2021. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 10012--10022.
[24]
Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yan Wang, Liujuan Cao, Yongjian Wu, Feiyue Huang, and Rongrong Ji. 2022. Towards Lightweight Transformer Via Group-Wise Transformation for Vision-and-Language Tasks. IEEE Transactions on Image Processing, Vol. 31 (2022), 3386--3398.
[25]
Hao Luo, Youzhi Gu, Xingyu Liao, Shenqi Lai, and Wei Jiang. 2019. Bag of tricks and a strong baseline for deep person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops. 0--0.
[26]
Bingpeng Ma, Yu Su, and Frederic Jurie. 2014. Covariance descriptor based on bio-inspired features for person re-identification and face verification. Image and Vision Computing, Vol. 32, 6--7 (2014), 379--390.
[27]
Jiaxu Miao, Yu Wu, Ping Liu, Yuhang Ding, and Yi Yang. 2019. Pose-guided feature alignment for occluded person re-identification. In Proceedings of the IEEE/CVF international conference on computer vision. 542--551.
[28]
Yongqiang Mou, Lei Tan, Hui Yang, Jingying Chen, Leyuan Liu, Rui Yan, and Yaohong Huang. 2020. Plugnet: Degradation aware scene text recognition supervised by a pluggable super-resolution unit. In European Conference on Computer Vision. Springer, 158--174.
[29]
Jun Peng, Yiyi Zhou, Xiaoshuai Sun, Liujuan Cao, Yongjian Wu, Feiyue Huang, and Rongrong Ji. 2021. Knowledge-Driven Generative Adversarial Network for Text-to-Image Synthesis. IEEE Transactions on Multimedia (2021).
[30]
Xuelin Qian, Yanwei Fu, Tao Xiang, Wenxuan Wang, Jie Qiu, Yang Wu, Yu-Gang Jiang, and Xiangyang Xue. 2018. Pose-normalized image generation for person re-identification. In Proceedings of the ECCV. 650--667.
[31]
Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015. Facenet: A unified embedding for face recognition and clustering. In Proceedings of the CVPR. 815--823.
[32]
Yumin Suh, Jingdong Wang, Siyu Tang, Tao Mei, and Kyoung Mu Lee. 2018. Part-aligned bilinear representations for person re-identification. In Proceedings of the European conference on computer vision (ECCV). 402--419.
[33]
Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, and Shengjin Wang. 2018. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In Proceedings of the ECCV. 480--496.
[34]
Guan'an Wang, Shuo Yang, Huanyu Liu, Zhicheng Wang, Yang Yang, Shuliang Wang, Gang Yu, Erjin Zhou, and Jian Sun. 2020. High-order information matters: Learning relation and topology for occluded person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 6449--6458.
[35]
Guanshuo Wang, Yufeng Yuan, Xiong Chen, Jiwei Li, and Xi Zhou. 2018. Learning discriminative features with multiple granularities for person re-identification. In Proceedings of the ACM MM. 274--282.
[36]
Longhui Wei, Shiliang Zhang, Wen Gao, and Qi Tian. 2018. Person transfer gan to bridge domain gap for person re-identification. In Proceedings of the CVPR. 79--88.
[37]
Jinrui Yang, Jiawei Zhang, Fufu Yu, Xinyang Jiang, Mengdan Zhang, Xing Sun, Ying-Cong Chen, and Wei-Shi Zheng. 2021. Learning To Know Where To See: A Visibility-Aware Approach for Occluded Person Re-Identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 11885--11894.
[38]
Yang Yang, Jimei Yang, Junjie Yan, Shengcai Liao, Dong Yi, and Stan Z Li. 2014. Salient color names for person re-identification. In European conference on computer vision. Springer, 536--551.
[39]
Mang Ye, Weijian Ruan, Bo Du, and Mike Zheng Shou. 2021a. Channel Augmented Joint Learning for Visible-Infrared Recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 13567--13576.
[40]
Mang Ye, Jianbing Shen, Gaojie Lin, Tao Xiang, Ling Shao, and Steven CH Hoi. 2021b. Deep learning for person re-identification: A survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021).
[41]
Yunpeng Zhai, Shijian Lu, Qixiang Ye, Xuebo Shan, Jie Chen, Rongrong Ji, and Yonghong Tian. 2020. AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-Identification. In Proceedings of the CVPR.
[42]
Yukang Zhang, Yan Yan, Yang Lu, and Hanzi Wang. 2021. Towards a Unified Middle Modality Learning for Visible-Infrared Person Re-Identification. In Proceedings of the 29th ACM International Conference on Multimedia. 788--796.
[43]
Feng Zheng, Cheng Deng, Xing Sun, Xinyang Jiang, Xiaowei Guo, Zongqiao Yu, Feiyue Huang, and Rongrong Ji. 2019a. Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training. In Proceedings of the CVPR. 8514--8522.
[44]
Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable person re-identification: A benchmark. In Proceedings of the IEEE international conference on computer vision. 1116--1124.
[45]
Wei-Shi Zheng, Shaogang Gong, and Tao Xiang. 2012. Reidentification by relative distance comparison. IEEE transactions on pattern analysis and machine intelligence, Vol. 35, 3 (2012), 653--668.
[46]
Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, and Jan Kautz. 2019b. Joint discriminative and generative learning for person re-identification. In proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2138--2147.
[47]
Zhedong Zheng, Liang Zheng, and Yi Yang. 2017. Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In Proceedings of the IEEE international conference on computer vision. 3754--3762.
[48]
Zhun Zhong, Liang Zheng, Guoliang Kang, Shaozi Li, and Yi Yang. 2020. Random Erasing Data Augmentation. In Proceedings of the AAAI.
[49]
Kuan Zhu, Haiyun Guo, Zhiwei Liu, Ming Tang, and Jinqiao Wang. 2020. Identity-guided human semantic parsing for person re-identification. In European Conference on Computer Vision. Springer, 346--363.
[50]
Jiaxuan Zhuo, Zeyu Chen, Jianhuang Lai, and Guangcong Wang. 2018. Occluded person re-identification. In 2018 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 1--6.

Cited By

View all
  • (2024)Rethink Motion Information for Occluded Person Re-IdentificationApplied Sciences10.3390/app1406255814:6(2558)Online publication date: 19-Mar-2024
  • (2024)ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-IdentificationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680958(1583-1592)Online publication date: 28-Oct-2024
  • (2024)Occlusion-Aware Transformer With Second-Order Attention for Person Re-IdentificationIEEE Transactions on Image Processing10.1109/TIP.2024.339336033(3200-3211)Online publication date: 2024
  • Show More Cited By

Index Terms

  1. Dynamic Prototype Mask for Occluded Person Re-Identification

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '22: Proceedings of the 30th ACM International Conference on Multimedia
    October 2022
    7537 pages
    ISBN:9781450392037
    DOI:10.1145/3503161
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 October 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. dynamic prototype mask
    2. occluded person re-identification

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    MM '22
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)96
    • Downloads (Last 6 weeks)10
    Reflects downloads up to 10 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Rethink Motion Information for Occluded Person Re-IdentificationApplied Sciences10.3390/app1406255814:6(2558)Online publication date: 19-Mar-2024
    • (2024)ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-IdentificationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680958(1583-1592)Online publication date: 28-Oct-2024
    • (2024)Occlusion-Aware Transformer With Second-Order Attention for Person Re-IdentificationIEEE Transactions on Image Processing10.1109/TIP.2024.339336033(3200-3211)Online publication date: 2024
    • (2024)Region Generation and Assessment Network for Occluded Person Re-IdentificationIEEE Transactions on Information Forensics and Security10.1109/TIFS.2023.331895619(120-132)Online publication date: 1-Jan-2024
    • (2024)Pedestrian 3D Shape Understanding for Person Re-Identification via Multi-View LearningIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.335885034:7(5589-5602)Online publication date: Jul-2024
    • (2024)3D Person Re-Identification Based on Global Semantic Guidance and Local Feature AggregationIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.332871234:6(4698-4712)Online publication date: Jun-2024
    • (2024)Occlusion-aware Cross-Attention Fusion for Video-based Occluded Cloth-Changing Person Re-Identification2024 IEEE International Joint Conference on Biometrics (IJCB)10.1109/IJCB62174.2024.10744425(1-11)Online publication date: 15-Sep-2024
    • (2024)ACML: Attention-Based Cross-Modality Learning For Cloth-Changing and Occluded Person Re-Identification2024 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP51287.2024.10647794(2396-2402)Online publication date: 27-Oct-2024
    • (2024)Exploring Feature Uncertainty in Occluded Person Re-Identification via Entropy-Guided Fusion2024 IEEE 9th International Conference on Computational Intelligence and Applications (ICCIA)10.1109/ICCIA62557.2024.10719304(171-175)Online publication date: 9-Aug-2024
    • (2024)Cross-Modality Transformer with Mixed Data Augmentation Learning for Visible-Infrared Person Re-Identification2024 9th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA)10.1109/ICCCBDA61447.2024.10569946(168-175)Online publication date: 25-Apr-2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media