[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3436369.3436492acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccprConference Proceedingsconference-collections
research-article

Pedestrian and Face Detection with Low Resolution Based on Improved MTCNN

Published: 11 January 2021 Publication History

Abstract

In recent years, the application of deep learning based on deep convolutional neural networks has gained great success in face detection. However, the large visual variations of pedestrians and faces, such as large pose variations and dark lightings, resulting in lower resolution targets, impose great challenges for these tasks in real-world applications. To solve this problem, we present a conceptually simple, end-to-end, and general framework for pedestrian and face detection. Our approach efficiently detects both pedestrian and face in an image. First, an efficient improved P-Net is developed to detect a pedestrian. Then an efficient improved R-Net1 is developed to filter pedestrian targets in the second level, and improved R-Net2 carries out the preliminary detection of face targets in the remaining pedestrian targets. In order to improve the face detection rate on a small scale, improved R-Net2 introduces a multi-level feature fusion mechanism. Last, an improved O-Net is proposed to identify pedestrian and face regions. Compared to state-of-the-art face detection methods such as Multiscale Cascade CNN、 Faceness、 Two-stage CNN、 MTCNN, the proposed method achieves promising performance on WIDER FACE benchmarks, our method also reaches promising results on the Caltech benchmarks.

References

[1]
K. Zhang, Z. Zhang, Z. Li, Y.J.I.S.P.L. Qiao, Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks, 23 (2016), 1499--1503.
[2]
P. Viola, M. Jones, Rapid object detection using a boosted cascade of simple features, in: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, 2001, pp. I-I.
[3]
P. Viola, M.J. Jones, Robust Real-Time Face Detection, International Journal of Computer Vision, 57 (2004), 137--154.
[4]
M.N. Chaudhari, M. Deshmukh, G. Ramrakhiani, R. Parvatikar, Face Detection Using Viola Jones Algorithm and Neural Networks, in: 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA), 2018, pp. 1--6.
[5]
Y. Bin, J. Yan, Z. Lei, S.Z. Li, Aggregate channel features for multi-view face detection, in: IEEE International Joint Conference on Biometrics, 2014, pp. 1--8.
[6]
J. Li, Y. Zhang, Learning SURF Cascade for Fast and Accurate Object Detection, in: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013.
[7]
Q. Zhu, M.C. Yeh, K.T. Cheng, S. Avidan, Fast Human Detection Using a Cascade of Histograms of Oriented Gradients, in: Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, 2006.
[8]
P. Felzenszwalb, D. Mcallester, D.J.C. Ramanan, A discriminatively trained, multiscale, deformable part model, 8: (2008), 1--8.
[9]
X. Wang, T.X. Han, S. Yan, An HOG-LBP human detector with partial occlusion handling, in: 2009 IEEE 12th International Conference on Computer Vision, 2009, pp. 32--39.
[10]
Kim Pyeongkang, Kim Hyunghun, and Kim Taewoo, Enhanced HoG Technique for Pedestrian Detection Based on Quarter Cell, Journal of Image and Graphics, 5(2017), 64--67.
[11]
P. Dollár, R. Appel, S. Belongie, P. Perona, Fast Feature Pyramids for Object Detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, 36 (2014), 1532--1545.
[12]
O. Prakash, J. Gwak, M. Khare, A. Khare, M. Jeon, Human detection in complex real scenes based on combination of biorthogonal wavelet transform and Zernike moments, Optik, 157 (2018), 1267--1281.
[13]
R. Girshick, J. Donahue, T. Darrell, J. Malik, Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, in: 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp. 580--587.
[14]
R. Girshick, Fast R-CNN, in: 2015 IEEE International Conference on Computer Vision (ICCV), 2015, pp. 1440--1448.
[15]
S. Ren, K. He, R. Girshick, J. Sun, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39 (2017), 1137--1149.
[16]
J. Redmon, S. Divvala, R. Girshick, A. Farhadi, You Only Look Once: Unified, Real-Time Object Detection, in: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 779--788.
[17]
W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, A.C. Berg, SSD: Single Shot MultiBox Detector, in, Springer International Publishing, Cham, 2016, pp. 21--37.
[18]
C.-Y. Fu, W. Liu, A. Ranga, A. Tyagi, A. Berg, DSSD: Deconvolutional Single Shot Detector, (2017).
[19]
S. Liu, S. Lv, H. Zhang, J. Gong, Pedestrian Detection Algorithm Based on the Improved SSD, in: 2019 Chinese Control And Decision Conference (CCDC), 2019, pp. 3559--3563.
[20]
J. Redmon, A. Farhadi, YOLO9000: Better, Faster, Stronger, in: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 6517--6525.
[21]
J. Redmon, A. Farhadi, YOLOv3: An Incremental Improvement, (2018).
[22]
Z. Yi, S. Yongliang, Z. Jun, An improved tiny-yolov3 pedestrian detection algorithm, Optik, 183 (2019), 17--23.
[23]
Z. Ahmed, R. Iniyavan, P. M. M, Enhanced Vulnerable Pedestrian Detection using Deep Learning, in: 2019 International Conference on Communication and Signal Processing (ICCSP), 2019, pp. 0971--0974.
[24]
L. Gong, X.U. Meihua, D. Liu, F. Zhang, M. Ramp, D. Center, S.J.J.o.S.U. University, Novel model of pedestrian detection based on Gaussian mixture model and HOG+SVM, (2018).
[25]
C. Zhou, M. Wu, S.-K. Lam, Fast and Accurate Pedestrian Detection using Dual-Stage Group Cost-Sensitive RealBoost with Vector Form Filters, in: Proceedings of the 25th ACM international conference on Multimedia, Association for Computing Machinery, Mountain View, California, USA, 2017, pp. 735--743.
[26]
V. Mutneja, S. Singh, GPU accelerated face detection from low resolution surveillance videos using motion and skin color segmentation, Optik, 157 (2018), 1155--1165.
[27]
S. Yang, P. Luo, C. Loy, X. Tang, From Facial Parts Responses to Face Detection: A Deep Learning Approach, in: 2015 IEEE International Conference on Computer Vision (ICCV), 2015, pp. 3676--3684.
[28]
H. Li, Z. Lin, X. Shen, J. Brandt, G. Hua, A convolutional neural network cascade for face detection, in: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 5325--5334.
[29]
P. Hu, D. Ramanan, Finding Tiny Faces, in: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 1522--1530.
[30]
Z. Hao, Q. Feng, L. Kaidong, An Optimized Face Detection Based on Adaboost Algorithm, in: 2018 International Conference on Information Systems and Computer Aided Education (ICISCAE), 2018, pp. 375--378.
[31]
W. Yang, Z. Jiachun, Real-time face detection based on YOLO, in: 2018 1st IEEE International Conference on Knowledge Innovation and Invention (ICKII), 2018, pp. 221--224.

Index Terms

  1. Pedestrian and Face Detection with Low Resolution Based on Improved MTCNN

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICCPR '20: Proceedings of the 2020 9th International Conference on Computing and Pattern Recognition
    October 2020
    552 pages
    ISBN:9781450387835
    DOI:10.1145/3436369
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • Beijing University of Technology

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 January 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Pedestrian and face detection
    2. improved O-Net
    3. improved P-Net
    4. improved R-Net

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICCPR 2020

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 98
      Total Downloads
    • Downloads (Last 12 months)17
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 01 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media