[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Age-Invariant Face Recognition by Multi-Feature Fusionand Decomposition with Self-attention

Published: 25 January 2022 Publication History

Abstract

Different from general face recognition, age-invariant face recognition (AIFR) aims at matching faces with a big age gap. Previous discriminative methods usually focus on decomposing facial feature into age-related and age-invariant components, which suffer from the loss of facial identity information. In this article, we propose a novel Multi-feature Fusion and Decomposition (MFD) framework for age-invariant face recognition, which learns more discriminative and robust features and reduces the intra-class variants. Specifically, we first sample multiple face images of different ages with the same identity as a face time sequence. Then, the multi-head attention is employed to capture contextual information from facial feature series, extracted by the backbone network. Next, we combine feature decomposition with fusion based on the face time sequence to ensure that the final age-independent features effectively represent the identity information of the face and have stronger robustness against the aging process. Besides, we also mitigate imbalanced age distribution in the training data by a re-weighted age loss. We experimented with the proposed MFD over the popular CACD and CACD-VS datasets, where we show that our approach improves the AIFR performance than previous state-of-the-art methods. We simultaneously show the performance of MFD on LFW dataset.

References

[1]
Dong Cao, Xiangyu Zhu, Xingyu Huang, Jianzhu Guo, and Zhen Lei. 2020. Domain balancing: Face recognition on long-tailed domains. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5671–5679.
[2]
Kaidi Cao, Colin Wei, Adrien Gaidon, Nikos Arechiga, and Tengyu Ma. 2019. Learning imbalanced datasets with label-distribution-aware margin loss. In Advances in Neural Information Processing Systems. 1567–1578.
[3]
Jie Chang, Zhonghao Lan, Changmao Cheng, and Yichen Wei. 2020. Data uncertainty learning in face recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5710–5719.
[4]
Bor-Chun Chen, Chu-Song Chen, and Winston H Hsu. 2014. Cross-age reference coding for age-invariant face recognition and retrieval. In Proceedings of the European Conference on Computer Vision. Springer, 768–783.
[5]
Bor-Chun Chen, Chu-Song Chen, and Winston H Hsu. 2015. Face recognition and retrieval using cross-age reference coding with cross-age celebrity dataset. IEEE Trans. Multimedia 17, 6 (2015), 804–815.
[6]
Dong Chen, Xudong Cao, Fang Wen, and Jian Sun. 2013. Blessing of dimensionality: High-dimensional feature and its efficient compression for face verification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3025–3032.
[7]
Ke Chen, Joni-Kristian Kämäräinen, and Zhaoxiang Zhang. 2016. Facial age estimation using robust label distribution. In Proceedings of the 24th ACM International Conference on Multimedia. 77–81.
[8]
Zhineng Chen, Shanshan Ai, and Caiyan Jia. 2019. Structure-aware deep learning for product image classification. ACM Trans. Multimedia Comput. Commun. Appl. 15, 1s (2019), 1–20.
[9]
Yin Cui, Menglin Jia, Tsung-Yi Lin, Yang Song, and Serge Belongie. 2019. Class-balanced loss based on effective number of samples. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 9268–9277.
[10]
Jiankang Deng, Jia Guo, Niannan Xue, and Stefanos Zafeiriou. 2019. Arcface: Additive angular margin loss for deep face recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4690–4699.
[11]
Jincan Deng, Liang Li, Beichen Zhang, Shuhui Wang, Zhengjun Zha, and Qingming Huang. 2021. Syntax-guided hierarchical attention network for video captioning. IEEE Trans. Circ. Syst. Vid. Technol. (2021), 1–1. DOI:
[12]
Amanda Cardoso Duarte. 2019. Cross-modal neural sign language translation. In Proceedings of the 27th ACM International Conference on Multimedia. 1650–1654.
[13]
Chuang Gan, Ting Yao, Kuiyuan Yang, Yi Yang, and Tao Mei. 2016. You lead, we exceed: Labor-free video concept learning by jointly exploiting web videos and images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 923–932.
[14]
Dihong Gong, Zhifeng Li, Dahua Lin, Jianzhuang Liu, and Xiaoou Tang. 2013. Hidden factor analysis for age invariant face recognition. In Proceedings of the IEEE International Conference on Computer Vision. 2872–2879.
[15]
Lingxiao He, Haiqing Li, Qi Zhang, Zhenan Sun, and Zhaofeng He. 2016. Multiscale representation for partial face recognition under near infrared illumination. In Proceedings of the IEEE 8th International Conference on Biometrics Theory, Applications and Systems (BTAS’16). IEEE, 1–7.
[16]
Lingxiao He, Yinggang Wang, Wu Liu, He Zhao, Zhenan Sun, and Jiashi Feng. 2019. Foreground-aware pyramid reconstruction for alignment-free occluded person re-identification. In Proceedings of the IEEE International Conference on Computer Vision. 8450–8459.
[17]
Gary B. Huang, Marwan Mattar, Tamara Berg, and Eric Learned-Miller. 2008. Labeled faces in the wild: A database forstudying face recognition in unconstrained environments. In Workshop on Faces in ‘Real-Life’ Images: Detection, Alignment, and Recognition.
[18]
Qingqiu Huang, Lei Yang, Huaiyi Huang, Tong Wu, and Dahua Lin. 2020. Caption-supervised face recognition: Training a state-of-the-art face model without manual annotation. In Proceedings of the European Conference on Computer Vision. Springer, 139–155.
[19]
Salman Khan, Munawar Hayat, Syed Waqas Zamir, Jianbing Shen, and Ling Shao. 2019. Striking the right balance with uncertainty. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 103–112.
[20]
Jianing Li, Jingdong Wang, Qi Tian, Wen Gao, and Shiliang Zhang. 2019. Global-local temporal representations for video person re-identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 3958–3967.
[21]
Jianing Li, Shiliang Zhang, Qi Tian, Meng Wang, and Wen Gao. 2019. Pose-guided representation learning for person re-identification. IEEE Trans. Pattern Anal. Mach. Intell. (2019), 1–1. DOI:
[22]
Liang Li, Shuqiang Jiang, and Qingming Huang. 2012. Learning hierarchical semantic description via mixed-norm regularization for image understanding. IEEE Trans. Multimedia 14, 5 (2012), 1401–1413.
[23]
Liang Li, Xinge Zhu, Yiming Hao, Shuhui Wang, Xingyu Gao, and Qingming Huang. 2019. A hierarchical CNN-RNN approach for visual emotion classification. ACM Trans. Multimedia Comput. Commun. Appl. 15, 3s (2019), 1–17.
[24]
Shuangqun Li, Xinchen Liu, Wu Liu, Huadong Ma, and Haitao Zhang. 2016. A discriminative null space based deep learning approach for person re-identification. In Proceedings of the 4th International Conference on Cloud Computing and Intelligence Systems (CCIS’16). IEEE, 480–484.
[25]
Zhifeng Li, Dihong Gong, Xuelong Li, and Dacheng Tao. 2016. Aging face recognition: A hierarchical learning model based on local patterns selection. IEEE Trans. Image Process. 25, 5 (2016), 2146–2154.
[26]
Zhifeng Li, Unsang Park, and Anil K. Jain. 2011. A discriminative model for age invariant face recognition. IEEE Trans. Inf. Forens. Secur. 6, 3 (2011), 1028–1037.
[27]
Liang Lin, Guangrun Wang, Wangmeng Zuo, Xiangchu Feng, and Lei Zhang. 2016. Cross-domain visual matching via generalized similarity measure and feature learning. IEEE Trans. Pattern Anal. Mach. Intell. 39, 6 (2016), 1089–1102.
[28]
Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE International Conference on Computer Vision. 2980–2988.
[29]
Xuejing Liu, Liang Li, Shuhui Wang, Zheng-Jun Zha, Li Su, and Qingming Huang. 2019. Knowledge-guided pairwise reconstruction network for weakly supervised referring expression grounding. In Proceedings of the 27th ACM International Conference on Multimedia. 539–547.
[30]
Xiaobin Liu and Shiliang Zhang. 2020. Domain adaptive person re-identification via coupling optimization. In Proceedings of the 28th ACM International Conference on Multimedia. 547–555.
[31]
Ziwei Liu, Ping Luo, Xiaogang Wang, and Xiaoou Tang. 2015. Deep learning face attributes in the wild. In Proceedings of the IEEE International Conference on Computer Vision. 3730–3738.
[32]
Xiang Long, Chuang Gan, and Gerard de Melo. 2018. Video captioning with multi-faceted attention. Trans. Assoc. Comput. Linguist. 6 (2018), 173–184.
[33]
Xiang Long, Chuang Gan, Gerard Melo, Xiao Liu, Yandong Li, Fu Li, and Shilei Wen. 2018. Multimodal keyless attention fusion for video classification. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32.
[34]
Ping Luo, Zhenyao Zhu, Ziwei Liu, Xiaogang Wang, and Xiaoou Tang. 2016. Face model compression by distilling knowledge from neurons. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 30.
[35]
Jinna Lv, Wu Liu, Meng Zhang, He Gong, Bin Wu, and Huadong Ma. 2017. Multi-feature fusion for predicting social media popularity. In Proceedings of the 25th ACM International Conference on Multimedia. 1883–1888.
[36]
Gayathri Mahalingam and Chandra Kambhamettu. 2010. Age invariant face recognition using graph matching. In Proceedings of the 4th IEEE International Conference on Biometrics: Theory, Applications and Systems (BTAS’10). IEEE, 1–7.
[37]
Lixuan Meng, Chenggang Yan, Jun Li, Jian Yin, Wu Liu, Hongtao Xie, and Liang Li. 2020. Multi-features fusion and decomposition for age-invariant face recognition. In Proceedings of the 28th ACM International Conference on Multimedia. 3146–3154.
[38]
Narayanan Ramanathan and Rama Chellappa. 2006. Modeling age progression in young faces. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), Vol. 1. IEEE, 387–394.
[39]
Mengye Ren, Wenyuan Zeng, Bin Yang, and Raquel Urtasun. 2018. Learning to reweight examples for robust deep learning. In Proceedings of the International Conference on Machine Learning. PMLR, 4334–4343.
[40]
Rasmus Rothe, Radu Timofte, and Luc Van Gool. 2015. DEX: Deep expectation of apparent age from a single image. In Proceedings of the IEEE International Conference on Computer Vision Workshops (ICCVW’15). 10–15.
[41]
Jun Shu, Qi Xie, Lixuan Yi, Qian Zhao, Sanping Zhou, Zongben Xu, and Deyu Meng. 2019. Meta-weight-net: Learning an explicit mapping for sample weighting. In Advances in Neural Information Processing Systems. 1919–1930.
[42]
Jingkuan Song, Jingqiu Zhang, Lianli Gao, Xianglong Liu, and Heng Tao Shen. 2018. Dual conditional GANs for face aging and rejuvenation. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI’18). 899–905.
[43]
Yu Sun, Yun Ye, Wu Liu, Wenpeng Gao, YiLi Fu, and Tao Mei. 2019. Human mesh recovery from monocular images via a skeleton-disentangled representation. In Proceedings of the IEEE International Conference on Computer Vision. 5349–5358.
[44]
Yaniv Taigman, Ming Yang, Marc’Aurelio Ranzato, and Lior Wolf. 2014. Deepface: Closing the gap to human-level performance in face verification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1701–1708.
[45]
Shangzhi Teng, Shiliang Zhang, Qingming Huang, and Nicu Sebe. 2021. Viewpoint and scale consistency reinforcement for uav vehicle re-identification. Int. J. Comput. Vis. 129, 3 (2021), 719–735.
[46]
Dongkai Wang and Shiliang Zhang. 2020. Unsupervised person re-identification via multi-label classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 10981–10990.
[47]
Hao Wang, Dihong Gong, Zhifeng Li, and Wei Liu. 2019. Decorrelated adversarial learning for age-invariant face recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3527–3536.
[48]
Hao Wang, Yitong Wang, Zheng Zhou, Xing Ji, Dihong Gong, Jingchao Zhou, Zhifeng Li, and Wei Liu. 2018. Cosface: Large margin cosine loss for deep face recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5265–5274.
[49]
Wei Wang, Zhen Cui, Yan Yan, Jiashi Feng, Shuicheng Yan, Xiangbo Shu, and Nicu Sebe. 2016. Recurrent face aging. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2378–2386.
[50]
Xiao Wang, Wu Liu, Jun Chen, Xiaobo Wang, Chenggang Yan, and Tao Mei. 2020. Listen, look, and find the one: Robust person search with multimodality index. ACM Trans. Multimedia Comput. Commun. Appl. 16, 2 (2020), 1–20.
[51]
Yitong Wang, Dihong Gong, Zheng Zhou, Xing Ji, Hao Wang, Zhifeng Li, Wei Liu, and Tong Zhang. 2018. Orthogonal deep features decomposition for age-invariant face recognition. In Proceedings of the European Conference on Computer Vision (ECCV’18). 738–753.
[52]
Yandong Wen, Zhifeng Li, and Yu Qiao. 2016. Latent factor guided convolutional neural networks for age-invariant face recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4893–4901.
[53]
Yandong Wen, Kaipeng Zhang, Zhifeng Li, and Yu Qiao. 2016. A discriminative feature learning approach for deep face recognition. In Proceedings of the European Conference on Computer Vision. Springer, 499–515.
[54]
Bichen Wu, Alvin Wan, Xiangyu Yue, Peter Jin, Sicheng Zhao, Noah Golmant, Amir Gholaminejad, Joseph Gonzalez, and Kurt Keutzer. 2018. Shift: A zero flop, zero parameter alternative to spatial convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 9127–9135.
[55]
Chenfei Xu, Qihe Liu, and Mao Ye. 2017. Age invariant face recognition and retrieval by coupled auto-encoder networks. Neurocomputing 222 (2017), 62–71.
[56]
Tongkun Xu, Xin Zhao, Jiamin Hou, Xinhong Hao, Jian Yin, et al. 2020. A general re-ranking method based on metric learning for person re-identification. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME’20). IEEE, 1–6.
[57]
Chenggang Yan, Biao Gong, Yuxuan Wei, and Yue Gao. 2021. Deep multi-view enhancement hashing for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 43, 4 (2021), 1445–1451.
[58]
Chenggang Yan, Liang Li, Chunjie Zhang, Bingtao Liu, Yongdong Zhang, and Qionghai Dai. 2019. Cross-modality bridging and knowledge transferring for image understanding. IEEE Trans. Multimedia 21, 10 (2019), 2675–2685.
[59]
Chenggang Yan, Zhisheng Li, Yongbing Zhang, Yutao Liu, Xiangyang Ji, and Yongdong Zhang. 2020. Depth image denoising using nuclear norm and learning graph model. ACM Trans. Multimedia Comput. Commun. Appl. 16, 4 (2020), 1–17.
[60]
Chenggang Yan, Biyao Shao, Hao Zhao, Ruixin Ning, Yongdong Zhang, and Feng Xu. 2020. 3d room layout estimation from a single rgb image. IEEE Trans. Multimedia 22, 11 (2020), 3014–3024.
[61]
Shijie Yang, Liang Li, Shuhui Wang, Weigang Zhang, Qingming Huang, and Qi Tian. 2019. Skeletonnet: A hybrid network with a skeleton-embedding process for multi-view image representation learning. IEEE Trans. Multimedia 21, 11 (2019), 2916–2929.
[62]
Dong Yi, Zhen Lei, Shengcai Liao, and Stan Z Li. 2014. Learning face representation from scratch. arXiv:1411.7923. Retrieved from https://arxiv.org/abs/1411.7923.
[63]
Beichen Zhang, Liang Li, Shijie Yang, Shuhui Wang, Zheng-Jun Zha, and Qingming Huang. 2020. State-relabeling adversarial active learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8756–8765.
[64]
Kaipeng Zhang, Zhanpeng Zhang, Zhifeng Li, and Yu Qiao. 2016. Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Sign. Process. Lett. 23, 10 (2016), 1499–1503.
[65]
Zhifei Zhang, Yang Song, and Hairong Qi. 2017. Age progression/regression by conditional adversarial autoencoder. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5810–5818.
[66]
Jian Zhao, Yu Cheng, Yi Cheng, Yang Yang, Fang Zhao, Jianshu Li, Hengzhu Liu, Shuicheng Yan, and Jiashi Feng. 2019. Look across elapse: Disentangled representation learning and photorealistic cross-age face synthesis for age-invariant face recognition. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 9251–9258.
[67]
Tianyue Zheng, Weihong Deng, and Jiani Hu. 2017. Age estimation guided convolutional neural network for age-invariant face recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 1–9.
[68]
Haiping Zhu, Qi Zhou, Junping Zhang, and James Z. Wang. 2018. Facial aging and rejuvenation by conditional multi-adversarial autoencoder with ordinal regression. arXiv:1804.02740. Retrieved from https://arxiv.org/abs/1804.02740.

Cited By

View all
  • (2025)Long–Short Observation-driven Prediction Network for pedestrian crossing intention prediction with momentary observationNeurocomputing10.1016/j.neucom.2024.128824614(128824)Online publication date: Jan-2025
  • (2025)Hybrid propagation modeling based clutch fault diagnosis of multi-mode electromechanical transmission system using particle filterMeasurement10.1016/j.measurement.2024.116385243(116385)Online publication date: Feb-2025
  • (2025) Adaptive event-triggered sliding mode control for platooning of heterogeneous vehicular systems and its input-to-output string stability Information Sciences10.1016/j.ins.2024.121342686(121342)Online publication date: Jan-2025
  • Show More Cited By

Index Terms

  1. Age-Invariant Face Recognition by Multi-Feature Fusionand Decomposition with Self-attention

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 18, Issue 1s
      February 2022
      352 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3505206
      Issue’s Table of Contents

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 25 January 2022
      Accepted: 01 June 2021
      Revised: 01 May 2021
      Received: 01 January 2021
      Published in TOMM Volume 18, Issue 1s

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Age-invariant face recognition
      2. feature fusion
      3. feature decomposition
      4. self-attention

      Qualifiers

      • Research-article
      • Refereed

      Funding Sources

      • National Key Research and Development Program of China
      • National Natural Science Foundation of China
      • Zhejiang Province Natural Science Foundation of China
      • Youth Innovation Promotion Association of Chinese Academy of Sciences
      • 111 Project

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)527
      • Downloads (Last 6 weeks)64
      Reflects downloads up to 12 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2025)Long–Short Observation-driven Prediction Network for pedestrian crossing intention prediction with momentary observationNeurocomputing10.1016/j.neucom.2024.128824614(128824)Online publication date: Jan-2025
      • (2025)Hybrid propagation modeling based clutch fault diagnosis of multi-mode electromechanical transmission system using particle filterMeasurement10.1016/j.measurement.2024.116385243(116385)Online publication date: Feb-2025
      • (2025) Adaptive event-triggered sliding mode control for platooning of heterogeneous vehicular systems and its input-to-output string stability Information Sciences10.1016/j.ins.2024.121342686(121342)Online publication date: Jan-2025
      • (2024)Application of Instance Segmentation to Identifying Insect Concentrations in Data from an Entomological RadarRemote Sensing10.3390/rs1617333016:17(3330)Online publication date: 8-Sep-2024
      • (2024)VALNet: Vision-Based Autonomous Landing with Airport Runway Instance SegmentationRemote Sensing10.3390/rs1612216116:12(2161)Online publication date: 14-Jun-2024
      • (2024)Hybrid Spatial-Channel Attention Mechanism for Cross-Age Face RecognitionElectronics10.3390/electronics1307125713:7(1257)Online publication date: 28-Mar-2024
      • (2024)The Effects of AI-Driven Face Restoration on Forensic Face RecognitionApplied Sciences10.3390/app1409378314:9(3783)Online publication date: 29-Apr-2024
      • (2024)Face Identification Based on Active Facial Patches Using Multi-Task Cascaded Convolutional NetworksJournal of Advances in Information Technology10.12720/jait.15.1.118-12615:1(118-126)Online publication date: 2024
      • (2024)Hybrid model-based early diagnosis of esophageal disorders using convolutional neural network and refined logistic regressionEURASIP Journal on Image and Video Processing10.1186/s13640-024-00634-32024:1Online publication date: 9-Aug-2024
      • (2024)Cartoon copyright recognition method based on character personality actionJournal on Image and Video Processing10.1186/s13640-024-00627-22024:1Online publication date: 24-May-2024
      • Show More Cited By

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Full Text

      View this article in Full Text.

      Full Text

      HTML Format

      View this article in HTML Format.

      HTML Format

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media