Abstract
Human face retrieval has long been established as one of the most interesting research topics in computer vision. With the recent development of deep learning, many researchers have addressed this problem by building deep hashing models to learn binary code from face images, while performing face retrieval as a classification task. Nevertheless, the performance is still unsatisfactory since these models are incapable of handling inter-class variation between multiple persons, as we need to make a class label for each identity. In this backdrop, we propose in this paper an effective deep learning-based framework for face image retrieval. The key to our framework is mainly based on the matching of face pairs, where a two-stream network, named \(\chi Net+\chi Match\), is designed to learn similarities in terms of person identity. Such similarities are investigated by embedding both deep local representation via face components, and deep global face representation via the whole face image. Since the similarities captured over face components are supposed to diversify due to variation in pose, expression and occlusion, we also introduce a Sparse Score Fusion layer that learns automatically the weight of each component according to its contribution to face matching. To allow fast retrieval, we farther propose a method that generates binary codes corresponding to the groups of similar faces through the hierarchical k-means, where the path down binary tree is exploited as a binary code for indexing. The final retrieval is then conducted within a privileged subset of images in the database. Our experiments on different challenging datasets show that our approach obtains outstanding results while outperforming most existing methods.
Similar content being viewed by others
Data availibility
All datasets used in this work are publicly available and have been properly referenced in the text.
References
Mishra A (2019) Dhfml: deep heterogeneous feature metric learning for matching photograph and cartoon pairs. Int J Multimed Info Retr 8:135–142
Kafai M, An L, Bhanu B (2014) Reference face graph for face recognition. IEEE Trans Info Foren Secur 9(12):2132–2143
Khan AM, Jalal AS (2020) A framework for suspect face retrieval using linguistic descriptions. Expert Syst Appl 141:112925
Yang S, Kafai M, An L, Bhanu B (2014) Zapping index: using smile to measure advertisement zapping likelihood. IEEE Trans Affect Comput 5(4):432–444
Ameur B, Belahcene M, Masmoudi S, Ben Hamida A (2019) Hybrid descriptors and weighted PCA-EFMNet for face verification in the wild. Int J Multimed Info Retr 8:143–154
Chen BC, Chen YY, Kuo YH, Hsu WH (2013) Scalable face image retrieval using attribute-enhanced sparse codewords. IEEE Trans Multimedia 15(5):1163–1173
Sun Y, Chen Y, Wang X, Tang X (2014) Deep learning face representation by joint identification-verification. Advances in neural information processing systems
Wang D, Otto C, Jain AK (2015) Face search at scale: 80 million gallery, arXiv:1507.07242
Xia R, Pan Y, Lai H, Liu C, Yan S (2014) Supervised hashing for image retrieval via image representation learning. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence
Dong Z, Jia S, Wu T, Pei M (2016) Face video retrieval via deep learning of binary hash representations. In: Thirtieth AAAI Conference on Artificial Intelligence
Lin J, Li Z, Tang J (2017) Discriminative deep hashing for scalable face image retrieval. In: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
Tang J, Li Z, Zhu X (2018) Supervised deep hashing for scalable face image retrieval. Pattern Recogn 75:25–32
Tang J, Lin J, Li Z, Yang J (2018) Discriminative deep quantization hashing for face image retrieval. IEEE Trans Neural Netw Learn Syst 99:1–9
Xiong Z, Li B, Gu X, Gu W, Wang W (2019) Discriminative deep attention-aware hashing for face image retrieval. In: Pacific Rim International Conference on Artificial Intelligence
Ng HW, Winkler S (2014) A data-driven approach to cleaning large face datasets. In: Proceedings of IEEE International Conference on Image Processing (ICIP)
Wolf L, Hassner T, Maoz I (2011) Face recognition in unconstrained videos with matched background similarity. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Cao Q, Shen L, Xie W, Parkhi OM, Zisserman A (2018) VGGFace2: a dataset for recognizing face across pose and age. In: 13th International Conference on Automatic Face and Gesture Recognition
Wang F, Chen L, Li C, Huang S, Chen Y, Qian C, Loy CC (2018) The devil of face recognition is in the noise, In: Proceedings of the European Conference on Computer Vision (ECCV’18) pp 1–16
Wang D, Hoi SC, He Y, Zhu J, Mei T, Luo J (2014) Retrieval-based face annotation by weak label regularized local coordinate coding. IEEE Trans Pattern Anal Mach Intell 36(3):550–563
Klare B.F, Blanton A, Klein B (2014) Efficient face retrieval using synecdoches, In: IEEE. IEEE International Joint Conference on Biometrics pp 1–7
Chen BC, Chen YY, Kuo YH, Ngo TD, Le DD, Satoh SI, Hsu WH (2017) Scalable face track retrieval in video archives using bag-of-faces sparse representation. IEEE Trans Circuits Syst Video Technol 27(7):1595–1603
Liu D, Li J, Wang N, Peng C, Gao X (2018) Composite components-based face sketch recognition. Neurocomputing 302:46–54
Zaeemzadeh A, Ghadar S, Faieta B, Lin Z, Rahnavard N, Shah M, Kalarot R (2021) Face image retrieval with attribute manipulation, In: Proceedings of the IEEE/CVF International Conference on Computer Vision pp 12116–12125
Dai P, Wang X, Zhang W, Zhang P, You W (2018) Implicit relative attribute enabled cross-modality hashing for face image-video retrieval. Multimedia Tools Appl 77(18):23547–23577
An L, Zou C, Zhang L, Denney B (2016) Scalable attribute-driven face image retrieval. Neurocomputing 172:215–224
Li Y, Wang R, Liu H, Jiang H, Shan S, Chen X (2015) Two birds, one stone: jointly learning binary code for large-scale face image retrieval and attributes prediction, In: Proceedings of the IEEE International Conference on Computer Vision pp 3819–3827
Li Y, Wang R, Huang Z, Shan S, Chen X (2015) Face video retrieval with image query via hashing across euclidean space and riemannian manifold. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Li Y, Wang R, Cui Z, Shan S, Chen X (2016) Spatial pyramid covariance-based compact video code for robust face retrieval in tv-series. IEEE Trans Image Process 25(12):5905–5919
Kafai M, Eshghi K, Bhanu B (2014) Discrete cosine transform locality-sensitive hashes for face retrieval. IEEE Trans Multimedia 16(4):1090–1103
Li Y, Wang R, Shan S, Chen X (2015) Hierarchical hybrid statistic based video binary code and its application to face retrieval in tv-series, vol. 1, IEEE. In: 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition pp 1–8
Li Y, Wang R, Cui Z, Shan S, Chen X (2014) Compact video code and its application to robust face retrieval in TV-series
Gui J, Liu T, Sun Z, Tao D, Tan T (2017) Fast supervised discrete hashing. IEEE Trans Pattern Anal Mach Intell 40(2):490–496
Dubey SR (2017) Face retrieval using frequency decoded local descriptor. Multimedia Tools Appl 78(12):16411–16431
Geng X (2016) Label distribution learning. IEEE Trans Knowl Data Eng 28:1734–1748
Xu N, Liu YP, Geng X (2019) Label enhancement for label distribution learning. IEEE Trans Knowl Data Eng 33:1632–1643
Xu N, Shu J, Zheng R, Geng X, Meng D, Zhang ML (2022) Variational label enhancement. IEEE Trans Pattern Anal Mach Intell 45:6537–6551
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436
Wang D, Jain AK (2015) Face retriever: Pre-filtering the gallery via deep neural net, IEEE. In: International Conference on Biometrics pp 473–480
Li Q, Sun Z, He R, Tan T (2017) Deep supervised discrete hashing. In NIPS
Zhe X, Chen S, Yan H (2020) Deep class-wise hashing: semantics-preserving hashing via class-wise loss. IEEE Trans Neural Netw Learn Syst 31(5):1681–1695
Zhang R, Lin L, Zhang R, Zuo W, Zhang L (2015) Bit-scalable deep hashing with regularized similarity learning for image retrieval and person re-identification. IEEE Trans Image Process 24(12):4766–4779
Wang R, Wang R, Qiao S, Shan S, Chen X (2020) Deep position-aware hashing for semantic continuous image retrieval. In WACV
Zhang M, Zhe X, Chen S, Yan H (2021) Deep center-based dual-constrained hashing for discriminative face image retrieval. Pattern Recogn 117:107976
Zhang M, Zhe X, Yan H (2023) Orthonormal product quantization network for scalable face image retrieval. Pattern Recogn 141:109671
Tarawneh AS, Hassanat A, Celik C, Chetverikov D, Rahman MS, Verma C (2019) Deep face image retrieval: a comparative study with dictionary learning, IEEE. In: 10th International Conference on Information and Communication Systems (ICICS) pp 185–192
Jing C, Dong Z, Pei M, Jia Y (2019) Heterogeneous hashing network for face retrieval across image and video domains. IEEE Trans Multimedia 21(3):782–794
Xu C, Liu Q, Ye M (2017) Age invariant face recognition and retrieval by coupled auto-encoder networks. Neurocomputing 222:62–71
Liu D, Gao X, Wang N, Li J, Peng C (2020) Coupled attribute learning for heterogeneous face recognition. IEEE Trans Neural Netw Learn Syst 31:4699–4712
Liu D, Gao X, Peng C, Wang N, Li J (2022) Heterogeneous face interpretable disentangled representation for joint face recognition and synthesis. IEEE Trans Neural Netw Learn Syst 33:5611–5625
Liu D, Yang W, Peng C, Wang N, Hu R, Gao X (2023) Modality-agnostic augmented multi-collaboration representation for semi-supervised heterogenous face recognition, In: Proceedings of the 31st ACM International Conference on Multimedia pp 4647–4656
Jiang H, Learned-Miller E (2017) Face detection with the faster R-CNN. In: IEEE International Conference on Automatic Face and Gesture Recognition
Hannane R, Elboushaki A, Afdel K (2020) A divide-and-conquer strategy for facial landmark detection using dual-task cnn architecture. Pattern Recongnition 107:107504
Han X, Leung T, Jia Y, Sukthankar R, Berg AC (2015) Matchnet: Unifying feature and metric learning for patch-based matching, In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition pp 3279–3286
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition, In: Proceedings of the IEEE conference on computer vision and pattern recognition pp 770–778
Huang GB, Ramesh M, Berg T, Learned-Miller E (2007) Labeled faces in the wild: a database for studying face recognition in unconstrained environments. University of Massachusetts, Amherst, Technical Report, pp 07–49
Yuan L, Wang T, Zhang X, Tay FE, Jie Z, Liu W, Feng J (2020) Central similarity quantization for efficient image and video retrieval, In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition pp 3083–3092
Jang YK, Cho NI (2020) Generalized product quantization network for semi-supervised image retrieval, In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition pp 3420–3429
Xiong Z, Wu D, Gu W, Zhang H, Li B, Wang W (2020) Deep discrete attention guided hashing for face image retrieval, In: Proceedings of the 2020 International Conference on Multimedia Retrieval pp 136–144
Feng P, Zhang H, Sun Y, Tang Z (2023) Lightweight image hashing based on knowledge distillation and optimal transport for face retrieval, In: 29th International Conference on Multimedia Modeling pp 423–434
Liu H, Wang R, Shan S, Chen X (2016) Deep supervised hashing for fast image retrieval, In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition pp 2064–2072
Li W.J, Wang S, Kang W.C (2016) Feature learning based deep supervised hashing with pairwise labels, In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence pp 1711–1717
Wang X, Shi Y, Kitani K.M (2016) Deep supervised hashing with triplet labels, In: 13th Asian Conference on Computer Vision pp 70–84
Zhou L, Wang Y, Xiao B, Xu Q (2021) Dfh-gan: a deep face hashing with generative adversarial network, In: 25th International Conference on Pattern Recognition (ICPR) pp 7012–7019
Acknowledgements
This work is supported in part by the PPR2-2015 project under grant number 14UIZ2015, and in part by the Al Khawarizmi project under grant number ALKHAWARIZMI/2020/02 financed by the Moroccan government through the CNRST funding program.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have no Conflict of interest to declare that are relevant to the content of this article.
Human participants
This article does not contain any studies involving human participants or animals performed by any of the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Elboushaki, A., Hannane, R. & Afdel, K. Similarity-based face image retrieval using sparsely embedded deep features and binary code learning. Int J Multimed Info Retr 13, 28 (2024). https://doi.org/10.1007/s13735-024-00337-5
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13735-024-00337-5