[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3243082.3267444acmotherconferencesArticle/Chapter ViewAbstractPublication PageswebmediaConference Proceedingsconference-collections
short-paper

A comprehensive analysis of Local Binary Convolutional Neural Network for fast face recognition in surveillance video

Published: 16 October 2018 Publication History

Abstract

Recent research in convolutional neural network (CNN) has provided a variety of new architectures for deep learning. One interesting new architecture is the local binary convolutional neural network (LBCNN), which has shown to provide significant reduction in the number of parameters to be learned at training. In this paper, we study the influence of network parameters in the scenario of face recognition, comparing LBCNN against other famous networks available in the literature in terms of sensibility and processing time. In our study, we also propose a pre-processing step on images to increase the accuracy of the model, besides investigating its behaviour with noisy images. Our experiments are carried on the Chokepoint dataset, whose face subimages were collected from video frames under real-world surveillance conditions, including variations in terms of illumination, sharpness, pose, and misalignment due to automatic face detection. The conclusion is that by using the Laplacian step and a reduced amount of LBC modules, it is possible to train LBCNN more quickly and with improved accuracy. In addition, it was found that LBCNN is very sensitive to noise and better results can be achieved when noisy images are inserted in the training set.

References

[1]
Timo Ahonen, Abdenour Hadid, and Matti Pietikäinen. 2004. Face Recognition with Local Binary Patterns. In Computer Vision - ECCV 2004, Tomás Pajdla and Jiří Matas (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 469--481.
[2]
Reshma Suresh Babu, B. Radhakrishnan, and L. Padma Suresh. 2016. Detection and extraction of roads from satellite images based on Laplacian of Gaussian operator. In 2016 International Conference on Emerging Technological Trends (ICETT). 1--7.
[3]
Saman Bashbaghi, Eric Granger, Robert Sabourin, and Mostafa Parchami. 2018. Deep Learning Architectures for Face Recognition in Video Surveillance. CoRR abs/1802.09990 (2018). arXiv:1802.09990 http://arxiv.org/abs/1802.09990
[4]
Moulay Smail Bouzakraoui, Abdelalim Sadiq, and Nourddine Enneya. 2017. A Customer Emotion Recognition Through Facial Expression Using POEM Descriptor and SVM Classifier. In Proceedings of the 2Nd International Conference on Big Data, Cloud and Applications (BDCA'17). ACM, New York, NY, USA, Article 80, 6 pages.
[5]
Qiong Cao, Li Shen, Weidi Xie, Omkar M. Parkhi, and Andrew Zisserman. 2017. VGGFace2: A dataset for recognising faces across pose and age. CoRR abs/1710.08092 (2017). arXiv:1710.08092 http://arxiv.org/abs/1710.08092
[6]
J. Cheng, J. Wu, C. Leng, Y. Wang, and Q. Hu. 2017. Quantized CNN: A Unified Approach to Accelerate and Compress Convolutional Networks. IEEE Transactions on Neural Networks and Learning Systems (2017), 1--14.
[7]
Ronan Collobert, Koray Kavukcuoglu, and Clément Farabet. 2011. Torch7: A Matlab-like Environment for Machine Learning. In BigLearn, NIPS Workshop.
[8]
Matthieu Courbariaux and Yoshua Bengio. 2016. BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1. CoRR abs/1602.02830 (2016). arXiv:1602.02830 http://arxiv.org/abs/1602.02830
[9]
Matthieu Courbariaux, Yoshua Bengio, and Jean-Pierre David. 2015. BinaryConnect: Training Deep Neural Networks with binary weights during propagations. CoRR abs/1511.00363 (2015). arXiv:1511.00363 http://arxiv.org/abs/1511.00363
[10]
S. Dodge and L. Karam. 2016. Understanding how image quality affects deep neural networks. In 2016 Eighth International Conference on Quality of Multimedia Experience (QoMEX). 1--6.
[11]
E. F. Ersi and J. S. Zelek. 2006. Local Feature Matching For Face Recognition. In The 3rd Canadian Conference on Computer and Robot Vision (CRV'06). 4--4.
[12]
K. He, X. Zhang, S. Ren, and J. Sun. 2016. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 770--778.
[13]
Felix Juefei-Xu, Vishnu Naresh Boddeti, and Marios Savvides. 2017. Local Binary Convolutional Neural Networks. In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, July 21-26, 2017. 4284--4293.
[14]
Idelette Laure Kambi Beli and Chunsheng Guo. 2017. Enhancing Face Identification Using Local Binary Patterns and K-Nearest Neighbors. Journal of Imaging 3, 3 (2017).
[15]
Christian Kraetzer, Andrey Makrushin, Tom Neubert, Mario Hildebrandt, and Jana Dittmann. 2017. Modeling Attacks on Photo-ID Documents and Applying Media Forensics for the Detection of Facial Morphing. In Proceedings of the 5th ACM Workshop on Information Hiding and Multimedia Security (IH&MMSec '17). ACM, New York, NY, USA, 21--32.
[16]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In Advances in Neural Information Processing Systems 25, F. Pereira, C. J. C. Burges, L. Bottou, and K. Q. Weinberger (Eds.). Curran Associates, Inc., 1097--1105. http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf
[17]
T. Ojala, M. Pietikainen, and T. Maenpaa. 2002. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 7 (Jul 2002), 971--987.
[18]
K. K. Pal and K. S. Sudeep. 2016. Preprocessing for image classification by convolutional neural networks. In 2016 IEEE International Conference on Recent Trends in Electronics, Information Communication Technology (RTEICT). 1778--1781.
[19]
Raghavendra Ramachandra and Christoph Busch. 2017. Presentation Attack Detection Methods for Face Recognition Systems: A Comprehensive Survey. ACM Comput. Surv. 50, 1, Article 8 (March 2017), 37 pages.
[20]
Mohammad Rastegari, Vicente Ordonez, Joseph Redmon, and Ali Farhadi. 2016. XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks. In ECCV.
[21]
S. S. Roy, M. Ahmed, and M. A. H. Akhand. 2017. Classification of massive noisy image using auto-encoders and convolutional neural network. In 2017 8th International Conference on Information Technology (ICIT). 971--979.
[22]
Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015. FaceNet: A Unified Embedding for Face Recognition and Clustering. CoRR abs/1503.03832 (2015). arXiv:1503.03832 http://arxiv.org/abs/1503.03832
[23]
K. Simonyan and A. Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. International Conference on Learning Representations (ICLR) (2015).
[24]
Sima Soltanpour, Boubakeur Boufama, and Q.M. Jonathan Wu. 2017. A survey of local feature methods for 3D face recognition. Pattern Recognition 72 (2017), 391 -- 406.
[25]
C. Szegedy, Wei Liu, Yangqing Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich. 2015. Going deeper with convolutions. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1--9.
[26]
N. Tajbakhsh, J. Y. Shin, S. R. Gurudu, R. T. Hurst, C. B. Kendall, M. B. Gotway, and J. Liang. 2016. Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning? IEEE Transactions on Medical Imaging 35, 5 (May 2016), 1299--1312.
[27]
Y. Tian, B. Fan, and F. Wu. 2017. L2-Net: Deep Learning of Discriminative Patch Descriptor in Euclidean Space. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 6128--6136.
[28]
S. Widyarto, S. R. B. Kassim, and W. K. Sari. 2017. 2D-sigmoid enhancement prior to segment MRI Glioma tumour: Pre image-processing. In 2017 4th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI). 1--5.
[29]
Yongkang Wong, Shaokang Chen, Sandra Mau, Conrad Sanderson, and Brian C. Lovell. 2011. Patch-based Probabilistic Image Quality Assessment for Face Selection and Improved Video-based Face Recognition. In IEEE Biometrics Workshop, Computer Vision and Pattern Recognition (CVPR) Workshops. IEEE, 81--88.
[30]
Dong Yi, Zhen Lei, Shengcai Liao, and Stan Z. Li. 2014. Learning Face Representation from Scratch. CoRR abs/1411.7923 (2014). arXiv:1411.7923 http://arxiv.org/abs/1411.7923
[31]
Jonghwa Yim and Kyung-Ah Sohn. 2017. Enhancing the Performance of Convolutional Neural Networks on Quality Degraded Datasets. CoRR abs/1710.06805 (2017). arXiv:1710.06805 http://arxiv.org/abs/1710.06805

Cited By

View all
  • (2023)VaBTFER: An Effective Variant Binary Transformer for Facial Expression RecognitionSensors10.3390/s2401014724:1(147)Online publication date: 27-Dec-2023
  • (2021)A comparison among keyframe extraction techniques for CNN classification based on video periocular imagesMultimedia Tools and Applications10.1007/s11042-020-10384-9Online publication date: 13-Jan-2021
  • (2021)Towards Automated Surveillance: A Review of Intelligent Video SurveillanceIntelligent Computing10.1007/978-3-030-80129-8_53(784-803)Online publication date: 6-Jul-2021
  • Show More Cited By

Index Terms

  1. A comprehensive analysis of Local Binary Convolutional Neural Network for fast face recognition in surveillance video

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        WebMedia '18: Proceedings of the 24th Brazilian Symposium on Multimedia and the Web
        October 2018
        437 pages
        ISBN:9781450358675
        DOI:10.1145/3243082
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        In-Cooperation

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 16 October 2018

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Convolutional neural network
        2. Face recognition
        3. Local Binary Pattern

        Qualifiers

        • Short-paper
        • Research
        • Refereed limited

        Funding Sources

        Conference

        WebMedia '18
        WebMedia '18: Brazilian Symposium on Multimedia and the Web
        October 16 - 19, 2018
        BA, Salvador, Brazil

        Acceptance Rates

        WebMedia '18 Paper Acceptance Rate 37 of 111 submissions, 33%;
        Overall Acceptance Rate 270 of 873 submissions, 31%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)5
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 05 Mar 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)VaBTFER: An Effective Variant Binary Transformer for Facial Expression RecognitionSensors10.3390/s2401014724:1(147)Online publication date: 27-Dec-2023
        • (2021)A comparison among keyframe extraction techniques for CNN classification based on video periocular imagesMultimedia Tools and Applications10.1007/s11042-020-10384-9Online publication date: 13-Jan-2021
        • (2021)Towards Automated Surveillance: A Review of Intelligent Video SurveillanceIntelligent Computing10.1007/978-3-030-80129-8_53(784-803)Online publication date: 6-Jul-2021
        • (2020)Analysis of video surveillance images using computer vision in a controlled security environment2020 15th Iberian Conference on Information Systems and Technologies (CISTI)10.23919/CISTI49556.2020.9141068(1-6)Online publication date: Jun-2020
        • (2020)Deep-learned faces: a surveyEURASIP Journal on Image and Video Processing10.1186/s13640-020-00510-w2020:1Online publication date: 29-Jun-2020

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media