More Web Proxy on the site http://driver.im/

research-article

Pedestrian and Face Detection with Low Resolution Based on Improved MTCNN

Authors:

Jing LiAuthors Info & Claims

ICCPR '20: Proceedings of the 2020 9th International Conference on Computing and Pattern Recognition

Pages 174 - 180

https://doi.org/10.1145/3436369.3436492

Published: 11 January 2021 Publication History

Abstract

In recent years, the application of deep learning based on deep convolutional neural networks has gained great success in face detection. However, the large visual variations of pedestrians and faces, such as large pose variations and dark lightings, resulting in lower resolution targets, impose great challenges for these tasks in real-world applications. To solve this problem, we present a conceptually simple, end-to-end, and general framework for pedestrian and face detection. Our approach efficiently detects both pedestrian and face in an image. First, an efficient improved P-Net is developed to detect a pedestrian. Then an efficient improved R-Net1 is developed to filter pedestrian targets in the second level, and improved R-Net2 carries out the preliminary detection of face targets in the remaining pedestrian targets. In order to improve the face detection rate on a small scale, improved R-Net2 introduces a multi-level feature fusion mechanism. Last, an improved O-Net is proposed to identify pedestrian and face regions. Compared to state-of-the-art face detection methods such as Multiscale Cascade CNN、 Faceness、 Two-stage CNN、 MTCNN, the proposed method achieves promising performance on WIDER FACE benchmarks, our method also reaches promising results on the Caltech benchmarks.

References

[1]

K. Zhang, Z. Zhang, Z. Li, Y.J.I.S.P.L. Qiao, Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks, 23 (2016), 1499--1503.

[2]

P. Viola, M. Jones, Rapid object detection using a boosted cascade of simple features, in: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, 2001, pp. I-I.

[3]

P. Viola, M.J. Jones, Robust Real-Time Face Detection, International Journal of Computer Vision, 57 (2004), 137--154.

Digital Library

[4]

M.N. Chaudhari, M. Deshmukh, G. Ramrakhiani, R. Parvatikar, Face Detection Using Viola Jones Algorithm and Neural Networks, in: 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA), 2018, pp. 1--6.

[5]

Y. Bin, J. Yan, Z. Lei, S.Z. Li, Aggregate channel features for multi-view face detection, in: IEEE International Joint Conference on Biometrics, 2014, pp. 1--8.

[6]

J. Li, Y. Zhang, Learning SURF Cascade for Fast and Accurate Object Detection, in: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013.

[7]

Q. Zhu, M.C. Yeh, K.T. Cheng, S. Avidan, Fast Human Detection Using a Cascade of Histograms of Oriented Gradients, in: Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, 2006.

[8]

P. Felzenszwalb, D. Mcallester, D.J.C. Ramanan, A discriminatively trained, multiscale, deformable part model, 8: (2008), 1--8.

[9]

X. Wang, T.X. Han, S. Yan, An HOG-LBP human detector with partial occlusion handling, in: 2009 IEEE 12th International Conference on Computer Vision, 2009, pp. 32--39.

[10]

Kim Pyeongkang, Kim Hyunghun, and Kim Taewoo, Enhanced HoG Technique for Pedestrian Detection Based on Quarter Cell, Journal of Image and Graphics, 5(2017), 64--67.

[11]

P. Dollár, R. Appel, S. Belongie, P. Perona, Fast Feature Pyramids for Object Detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, 36 (2014), 1532--1545.

Digital Library

[12]

O. Prakash, J. Gwak, M. Khare, A. Khare, M. Jeon, Human detection in complex real scenes based on combination of biorthogonal wavelet transform and Zernike moments, Optik, 157 (2018), 1267--1281.

[13]

R. Girshick, J. Donahue, T. Darrell, J. Malik, Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, in: 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp. 580--587.

Digital Library

[14]

R. Girshick, Fast R-CNN, in: 2015 IEEE International Conference on Computer Vision (ICCV), 2015, pp. 1440--1448.

Digital Library

[15]

S. Ren, K. He, R. Girshick, J. Sun, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39 (2017), 1137--1149.

Digital Library

[16]

J. Redmon, S. Divvala, R. Girshick, A. Farhadi, You Only Look Once: Unified, Real-Time Object Detection, in: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 779--788.

[17]

W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, A.C. Berg, SSD: Single Shot MultiBox Detector, in, Springer International Publishing, Cham, 2016, pp. 21--37.

[18]

C.-Y. Fu, W. Liu, A. Ranga, A. Tyagi, A. Berg, DSSD: Deconvolutional Single Shot Detector, (2017).

[19]

S. Liu, S. Lv, H. Zhang, J. Gong, Pedestrian Detection Algorithm Based on the Improved SSD, in: 2019 Chinese Control And Decision Conference (CCDC), 2019, pp. 3559--3563.

[20]

J. Redmon, A. Farhadi, YOLO9000: Better, Faster, Stronger, in: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 6517--6525.

[21]

J. Redmon, A. Farhadi, YOLOv3: An Incremental Improvement, (2018).

[22]

Z. Yi, S. Yongliang, Z. Jun, An improved tiny-yolov3 pedestrian detection algorithm, Optik, 183 (2019), 17--23.

[23]

Z. Ahmed, R. Iniyavan, P. M. M, Enhanced Vulnerable Pedestrian Detection using Deep Learning, in: 2019 International Conference on Communication and Signal Processing (ICCSP), 2019, pp. 0971--0974.

[24]

L. Gong, X.U. Meihua, D. Liu, F. Zhang, M. Ramp, D. Center, S.J.J.o.S.U. University, Novel model of pedestrian detection based on Gaussian mixture model and HOG+SVM, (2018).

[25]

C. Zhou, M. Wu, S.-K. Lam, Fast and Accurate Pedestrian Detection using Dual-Stage Group Cost-Sensitive RealBoost with Vector Form Filters, in: Proceedings of the 25th ACM international conference on Multimedia, Association for Computing Machinery, Mountain View, California, USA, 2017, pp. 735--743.

[26]

V. Mutneja, S. Singh, GPU accelerated face detection from low resolution surveillance videos using motion and skin color segmentation, Optik, 157 (2018), 1155--1165.

[27]

S. Yang, P. Luo, C. Loy, X. Tang, From Facial Parts Responses to Face Detection: A Deep Learning Approach, in: 2015 IEEE International Conference on Computer Vision (ICCV), 2015, pp. 3676--3684.

Digital Library

[28]

H. Li, Z. Lin, X. Shen, J. Brandt, G. Hua, A convolutional neural network cascade for face detection, in: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 5325--5334.

[29]

P. Hu, D. Ramanan, Finding Tiny Faces, in: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 1522--1530.

[30]

Z. Hao, Q. Feng, L. Kaidong, An Optimized Face Detection Based on Adaboost Algorithm, in: 2018 International Conference on Information Systems and Computer Aided Education (ICISCAE), 2018, pp. 375--378.

[31]

W. Yang, Z. Jiachun, Real-time face detection based on YOLO, in: 2018 1st IEEE International Conference on Knowledge Innovation and Invention (ICKII), 2018, pp. 221--224.

Index Terms

Pedestrian and Face Detection with Low Resolution Based on Improved MTCNN
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object detection

Recommendations

An improved MTCNN face detection algorithm
EBIMCS '22: Proceedings of the 2022 5th International Conference on E-Business, Information Management and Computer Science

Abstract: In view of the low accuracy of face detection by MTCNN, an improved MTCNN face detection algorithm is proposed. The detection accuracy is improved by the following factors: the migration learning to the original network model, the fine-tuned ...
Using DSCB: A Depthwise Separable Convolution Block Rebuild MTCNN for Face Detection
ICIGP '22: Proceedings of the 2022 5th International Conference on Image and Graphics Processing

Nowadays, there are huge demands of face detection in images and videos for surveillance, education, autonomous driving and health care. These application scenarios need high accuracy and efficiency of face detection. However, in some scene, ...
MTCNN and FACENET Based Access Control System for Face Detection and Recognition
Abstract
Face detection and recognition is one of the research hotspots in the field of computer vision, which is widely used in video surveillance and identity matching. The traditional algorithms of face detection include AdaBoost, Haar-like, DPM, etc. ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICCPR '20: Proceedings of the 2020 9th International Conference on Computing and Pattern Recognition

October 2020

552 pages

ISBN:9781450387835

DOI:10.1145/3436369

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

Beijing University of Technology

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 January 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ICCPR 2020

ICCPR 2020: 2020 9th International Conference on Computing and Pattern Recognition

October 30 - November 1, 2020

Xiamen, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
98
Total Downloads

Downloads (Last 12 months)17
Downloads (Last 6 weeks)2

Reflects downloads up to 01 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents