Abstract
The growth in the number of vehicles in the world makes it hard to safely share the environment with pedestrians. Pedestrian’s safety is an important task that needs to be granted in the traffic environment. New cars are equipped with advanced driver assistance systems (ADAS) with a variety of applications. Pedestrian detection application is one of the most important applications for an ADAS that needs to be enhanced. In this paper, we propose a pedestrian detection system to be implemented in an ADAS. The proposed system is based on convolutional neural networks thanks to its performance when solving computer vision applications. On the other side, the proposed system ensures real-time processing and high detection performance. The proposed system will be designed by tacking the advantage of building lightweight convolution blocks and model compression techniques to ensure an embedded implementation. Those blocks will guarantee high precision and fast processing speed. To train and evaluate the proposed system, we used the Caltech dataset. The evaluation of the proposed system resulted in 87% of mean average precision and an inference speed of 35 frames per second.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Yosinski J, Clune J, Bengio Y, Lipson H (2014) How transferable are features in deep neural networks? In: Advances in neural information processing systems, pp 3320–3328
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the thirteenth international conference on artificial intelligence and statistics, pp 249–256
Yu J, Tan M, Zhang H, Tao D, Rui Y (2019) Hierarchical deep click feature prediction for fine-grained image recognition. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2019.2932058
Yu J, Tao D, Wang M, Rui Y (2014) Learning to rank using user clicks and visual features for image retrieval. IEEE Trans Cybern 45(4):767–779
Yu J, Zhu C, Zhang J, Huang Q, Tao D (2019) Spatial pyramid-enhanced NetVLAD with weighted triplet loss for place recognition. IEEE Trans Neural Netw Learn Syst 31(2):661–674
Ayachi R, Said Y, Atri M (2019) To perform road signs recognition for autonomous vehicles using cascaded deep learning pipeline. Artif Intell Adv 1(1):1–10
Afif M, Ayachi R, Said Y, Pissaloux E, Atri M (2018) Indoor image recognition and classification via deep convolutional neural network. In: International conference on the sciences of electronics, technologies of information and telecommunications. Springer, Cham, pp 364–371
Deng J, Guo J, Xue N, Zafeiriou S (2019) Arcface: additive angular margin loss for deep face recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4690–4699
Hong C, Yu J, Zhang J, Jin X, Lee K-H (2018) Multimodal face-pose estimation with multitask manifold deep learning. IEEE Trans Ind Inform 15(7):3952–3961
Wang Y, Haichao YU, Gao D, Wang J (2019) Image segmentation and object detection using fully convolutional neural network. U.S. Patent Application 10/304,193, filed May 28
Ayachi R, Afif M, Said Y, Atri M (2020) Traffic signs detection for real-world application of an advanced driving assisting system using deep learning. Neural Process Lett 51(1):837–851
Yu J, Yao J, Zhang J, Yu Z, Tao D (2020) SPRNet: single-pixel reconstruction for one-stage instance segmentation. IEEE Trans Cybern. https://doi.org/10.1109/TCYB.2020.2969046
Hong C, Jun Yu, Wan J, Tao D, Wang M (2015) Multimodal deep autoencoder for human pose recovery. IEEE Trans Image Process 24(12):5659–5670
Zhang J, Jun Yu, Tao D (2018) Local deep-feature alignment for unsupervised dimension reduction. IEEE Trans Image Process 27(5):2420–2432
Krishnamoorthi R (2018) Quantizing deep convolutional networks for efficient inference: a whitepaper. arXiv preprint arXiv:1806.08342
Yeom S-K, Seegerer P, Lapuschkin S, Wiedemann S, Müller K-R, Samek W (2019) Pruning by explaining: a novel criterion for deep neural network pruning. arXiv preprint arXiv:1912.08881
Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861
Liu Z, Chen Z, Li Z, Hu W (2018) An efficient pedestrian detection method based on YOLOv2. Math Probl Eng. https://doi.org/10.1155/2018/3518959
Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7263–7271
Dalal N, Triggs B (2005) INRIA person dataset. Online: http://pascal.inrialpes.fr/data/human. Accessed 21 Sept 2019
Dollár P, Wojek C, Schiele B, Perona P (2009) Pedestrian detection: a benchmark, pp 304–311
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. In: Advances in neural information processing systems, pp 2672–2680
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) SSD: single shot multibox detector. In: European conference on computer vision. Springer, Cham, pp 21–37
Dinakaran RK, Easom P, Bouridane A, Zhang L, Jiang R, Mehboob F, Rauf A (2019) Deep learning based pedestrian detection at distance in smart cities. In: Proceedings of SAI intelligent systems conference. Springer, Cham, pp 588–593
Krizhevsky A, Hinton G (2009) Learning multiple layers of features from tiny images, vol 1, no. 4. Technical report. University of Toronto
Zhang L, Lin L, Liang X, He K (2016) Is faster R-CNN doing well for pedestrian detection? In: European conference on computer vision. Springer, Cham, pp 443–457
Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99
Friedman J, Hastie T, Tibshirani R (2000) Additive logistic regression: a statistical view of boosting (with discussion and a rejoinder by the authors). Ann Stat 28(2):337–407
Appel R, Fuchs T, Dollár P, Perona P (2013) Quickly boosting decision trees—pruning underachieving features early. In: International conference on machine learning, pp 594–602
Said Y, Barr M (2019) Pedestrian detection for advanced driver assistance systems using deep learning algorithms. Int J Comput Sci Netw Secur 19(9):10
Tian Y, Luo P, Wang X, Tang X (2015) Pedestrian detection aided by deep learning semantic tasks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5079–5087
Ess A, Leibe B, Van Gool L (2007) Depth and appearance for mobile scene analysis. In: 2007 IEEE 11th international conference on computer vision. IEEE, pp 1–8
Du X, El-Khamy M, Morariu VI, Lee J, Davis L (2018) Fused deep neural networks for efficient pedestrian detection. arXiv preprint arXiv:1805.08688
Opitz D, Maclin R (1999) Popular ensemble methods: an empirical study. J Artif Intell Res 11:169–198
Deng L, Platt JC (2014) Ensemble deep learning for speech recognition. In: Fifteenth annual conference of the international speech communication association
Bjorck N, Gomes CP, Selman B, Weinberger KQ (2018) Understanding batch normalization. In: Advances in neural information processing systems, pp 7694–7705
Ayachi R, Afif M, Said Y, Atri M (2018) Strided convolution instead of max pooling for memory efficiency of convolutional neural networks. In: International conference on the sciences of electronics, technologies of information and telecommunications. Springer, Cham, pp 234–243
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980
Verbickas R, Laganiere R, Laroche D, Zhu C, Xu X, Ors A (2017) SqueezeMap: fast pedestrian detection on a low-power automotive processor using efficient convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 146–154
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there is no conflict of interest regarding the publication of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Ayachi, R., Said, Y. & Ben Abdelaali, A. Pedestrian Detection Based on Light-Weighted Separable Convolution for Advanced Driver Assistance Systems. Neural Process Lett 52, 2655–2668 (2020). https://doi.org/10.1007/s11063-020-10367-9
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-020-10367-9