Abstract
In the contemporary era, the global explosion of traffic has created many eye-catching concerns for policymakers. This not only enhances pollution but also leads to several road accident fatalities which may be greatly reduced by proper monitoring and surveillance. Further, with the advent of UAV technology and due to the incompatibility of traditional techniques, surveillance has become one of UAVs prominent application domains. However, it requires algorithmic analysis of aerial images which becomes extremely challenging due to multi-scale rotating objects with large aspect ratios, extremely imbalanced categories, cluttered background, and birds-eye view. Therefore, this article presents the novel aerial image traffic monitoring and surveillance algorithms based on the most advanced and popular DL object detection models (Faster-RCNN, SSD, YOLOv3, and YOLOv4) using the AU-AIR dataset. This dataset is exceedingly imbalanced and to resolve this issue, another 500 images have been grabbed by web-mining techniques. The novel contribution of this work is two-fold. First, this article scientifically distinguishes the inappropriateness of ground-view images for aerial object detection. Second, a regress comparison of these algorithms has been made to investigate their effectiveness. Extensive experimental analysis endorses the efficiency of YOLOv4 as it outperforms the other developed models by a minimum mAP margin of 88%. Also, more than 6 times high detection speed and greater adaptability with stronger detection robustness ensure its real-time practical implementation.
Similar content being viewed by others
Abbreviations
- CNN:
-
Convolutional Neural Network
- R-CNN:
-
Region-based Convolutional Neural Network
- SPP:
-
Spatial Pyramid Pooling
- DL:
-
Deep Learning
- FPS:
-
Frames per second
- SSD:
-
Single-shot detector
- IoU:
-
Intersection over Union
- SVM:
-
Support Vector Machine
- mAP:
-
Mean Average Precision
- UAV:
-
Unmanned aerial Vehicles
- WHO:
-
World Health Organization
- MAV:
-
Manned Aerial Vehicle
- PANet:
-
Path Aggregation Network
- YOLO:
-
You Only Look Once
References
Ali N, Jhanjhi NZ, Nawaz S et al (2020) Smart traffic monitoring system using unmanned aerial vehicles ( UAVs ). Comput Commun 157:434–443. https://doi.org/10.1016/j.comcom.2020.04.049
Al-Turjman F (2019) A novel approach for drones positioning in mission critical applications. Trans Emerg Telecommun Technol n/a:e3603. https://doi.org/10.1002/ett.3603
Barmpounakis EN, Vlahogianni EI, Golias JC (2016) Unmanned aerial aircraft systems for transportation engineering: current practice and future challenges. Int J Transp Sci Technol 5:111–122. https://doi.org/10.1016/j.ijtst.2017.02.001
Barnich O, Van Droogenbroeck M (2011) ViBe: a universal background subtraction algorithm for video sequences. IEEE Trans Image Process 20:1709–1724. https://doi.org/10.1109/TIP.2010.2101613
Benjdira B, Khursheed T, Koubaa A, et al (2018) Car detection using unmanned aerial vehicles: comparison between faster R-CNN and YOLOv3. 1–6
Bochkovskiy A, Wang C-Y, Liao H-YM (2020) YOLOv4: optimal speed and accuracy of object detection
Bonali FL, Tibaldi A, Marchese F, Fallati L, Russo E, Corselli C, Savini A (2019) UAV-based surveying in volcano-tectonics: an example from the Iceland rift. J Struct Geol 121:46–64. https://doi.org/10.1016/j.jsg.2019.02.004
Bozcan I, Kayacan E (2020) AU-AIR: a multi-modal unmanned aerial vehicle dataset for low altitude traffic surveillance. In: 2020 IEEE International Conference on Robotics and Automation (ICRA). pp 8504–8510
Chang FR, Huang HL, Schwebel DC, Chan AHS, Hu GQ (2020) Global road traffic injury statistics: challenges, mechanisms and solutions. Chinese J Traumatol - English Ed 23:216–218
Choi Y, Kim N, Hwang S, Park K, Yoon JS, An K, Kweon IS (2018) KAIST multi-spectral day/night data set for autonomous and assisted driving. IEEE Trans Intell Transp Syst 19:934–948. https://doi.org/10.1109/TITS.2018.2791533
Chriki A, Touati H, Snoussi H, Kamoun F (2020) Deep learning and handcrafted features for one-class anomaly detection in UAV video. Multimed Tools Appl 80:1–22. https://doi.org/10.1007/s11042-020-09774-w
Christiansen MP, Laursen MS (2017) Designing and testing a UAV mapping system for agricultural field surveying 1:1–19. https://doi.org/10.3390/s17122703
Dai J, Li Y, He K, Sun J (2016) R-FCN: object detection via region-based fully convolutional networks. Advances in Neural Information Processing Systems. Neural information processing systems foundation, In, pp 379–387
Deng J, Dong W, Socher R, et al (2009) ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition. pp 248–255
Everingham M, Van Gool L, Williams CKI et al (2010) The Pascal visual object classes (VOC) challenge. Int J Comput Vis 88:303–338. https://doi.org/10.1007/s11263-009-0275-4
Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32:1627–1645. https://doi.org/10.1109/TPAMI.2009.167
Geiger A, Lenz P, Stiller C, Urtasun R (2013) Vision meets robotics : the KITTI dataset. 32:1231–1237. https://doi.org/10.1177/0278364913491297
Girshick R, Donahue J, Darrell T, Malik J (2016) Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans Pattern Anal Mach Intell 38:142–158. https://doi.org/10.1109/TPAMI.2015.2437384
Gomaa A, Abdelwahab MM, Abo-Zahhad M (2020) Efficient vehicle detection and tracking strategy in aerial videos by employing morphological operations and feature points motion analysis. Multimed Tools Appl 79:26023–26043. https://doi.org/10.1007/s11042-020-09242-5
Gomez M, Vergara A, Montenegro F et al (2020) Detection of banana plants and their major diseases through aerial images and machine learning methods : a case study in DR Congo and Republic of Benin. ISPRS J Photogramm Remote Sens 169:110–124. https://doi.org/10.1016/j.isprsjprs.2020.08.025
He K, Gkioxari G, Dollár P, Girshick R (2017) Mask R-CNN. In: 2017 IEEE International Conference on Computer Vision (ICCV). pp 2980–2988
Hendry CR-C (2019) Automatic license plate recognition via sliding-window darknet-YOLO deep learning. Image Vis Comput 87:47–56. https://doi.org/10.1016/j.imavis.2019.04.007
Hildmann H (2019) Review : using unmanned aerial vehicles ( UAVs ) as mobile sensing platforms ( MSPs ) for Disaster response. Civil Security and Public Safety. 13–19. https://doi.org/10.3390/drones3030059, 3
Karthik R, Hariharan M, Anand S et al (2020) Attention embedded residual CNN for disease detection in tomato leaves. Appl Soft Comput 86:105933. https://doi.org/10.1016/j.asoc.2019.105933
Khan NA, Jhanjhi NZ, Brohi SN, Nayyar A (2020) Chapter three - emerging use of UAV’s: secure communication protocol issues and challenges. In: Al-Turjman FBT-D in S-C (ed). Elsevier, pp 37–55
Kumar S, Yadav D, Gupta H, Verma OP, Ansari IA, Ahn CW (2021) A novel yolov3 algorithm-based deep learning approach for waste segregation: towards smart waste management. Electron 10:1–20. https://doi.org/10.3390/electronics10010014
Li Y, Dong H, Li H, Zhang X (2020) Multi-block SSD based on small object detection for UAV railway scene surveillance. Chinese J Aeronaut 33:1747–1755. https://doi.org/10.1016/j.cja.2020.02.024
Lin T-Y, Maire M, Belongie S, et al (2014) Microsoft COCO: common objects in context BT - computer vision – ECCV 2014. In: Fleet D, Pajdla T, Schiele B, Tuytelaars T (eds). Springer International Publishing, Cham, pp. 740–755
Liu W, Anguelov D, Erhan D, et al (2016) SSD: single shot multibox detector. In: lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics). Springer Verlag, pp 21–37
Mittal P, Sharma A, Singh R (2020) Deep learning-based object detection in low-altitude UAV datasets: A survey Image Vis Comput 104046. https://doi.org/10.1016/j.imavis.2020.104046,, 104
Nie X, Yang M, Liu RW (2019) Deep neural network-based robust ship detection under different weather conditions. In: 2019 IEEE Intelligent Transportation Systems Conference, ITSC 2019. Institute of Electrical and Electronics Engineers Inc., pp 47–52
Park MW, In Kim J, Lee YJ et al (2017) Vision-based surveillance system for monitoring traffic conditions. Multimed Tools Appl 76:25343–25367. https://doi.org/10.1007/s11042-017-4521-4
Pi Y, Nath ND, Behzadan AH (2020) Convolutional neural networks for object detection in aerial imagery for disaster response and recovery. Adv Eng Informatics 43:101009. https://doi.org/10.1016/j.aei.2019.101009
Rangel JC, Martínez-Gómez J, Romero-González C, García-Varea I, Cazorla M (2018) Semi-supervised 3D object recognition through CNN labeling. Appl Soft Comput 65:603–613. https://doi.org/10.1016/j.asoc.2018.02.005
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE Computer Society, In, pp 779–788
Ren S, He K, Girshick R, Sun J (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39:1137–1149. https://doi.org/10.1109/TPAMI.2016.2577031
Rohan A, Rabah M, Kim S (2019) Convolutional neural network-based real-time object detection and tracking for parrot AR drone 2. IEEE Access 7:69575–69584. https://doi.org/10.1109/ACCESS.2019.2919332
Sadykova D, Pernebayeva D, Bagheri M, James A (2020) IN-YOLO: real-time detection of outdoor high voltage insulators using UAV imaging. IEEE Trans Power Deliv 35:1599–1601. https://doi.org/10.1109/TPWRD.2019.2944741
Saleh M, Jhanjhi N, Abdullah A, Fatima-tuz-Zahra (2020) Proposing a privacy protection model in case of civilian drone. In: 2020 22nd International Conference on Advanced Communication Technology (ICACT). pp 596–602
Shastry AC, Schowengerdt RA (2005) Airborne video registration and traffic-flow parameter estimation. IEEE Trans Intell Transp Syst 6:391–405. https://doi.org/10.1109/TITS.2005.858621
Smitha JA, Rajkumar N (2020) Optimal feed forward neural network based automatic moving vehicle detection system in traffic surveillance system. Multimed Tools Appl 79:18591–18610. https://doi.org/10.1007/s11042-020-08757-1
Tian Y, Yang G, Wang Z, Wang H, Li E, Liang Z (2019) Apple detection during different growth stages in orchards using the improved YOLO-V3 model. Comput Electron Agric 157:417–426. https://doi.org/10.1016/j.compag.2019.01.012
Tzutalin. LabelImg. Git code (2015). https://github.com/tzutalin/labelImg
Wang Z, Liu D, Lei Y, Niu X, Wang S, Shi L (2020) Small target detection based on bird’s visual information processing mechanism. Multimed Tools Appl 79:22083–22105. https://doi.org/10.1007/s11042-020-08807-8
Wu Y, Sui Y, Wang G (2017) Vision-based real-time aerial object localization and tracking for UAV sensing system. IEEE Access 5:23969–23978. https://doi.org/10.1109/ACCESS.2017.2764419
Wu D, Lv S, Jiang M, Song H (2020) Using channel pruning-based YOLO v4 deep learning algorithm for the real-time and accurate detection of apple flowers in natural environments. Comput Electron Agric 178:105742. https://doi.org/10.1016/j.compag.2020.105742
Xu Y, Yu G, Wang Y, Wu X, Ma Y (2017) Car detection from low-altitude UAV imagery with the faster R-CNN. 2017:1–10. https://doi.org/10.1155/2017/2823617
Zhang S, Wen L, Bian X, et al (2018) Single-shot refinement neural network for object detection. In: 2018 IEEE/CVF conference on computer vision and pattern recognition. pp 4203–4212
Zhang J, Liang X, Wang M, Yang L, Zhuo L (2019) Coarse-to-fine object detection in unmanned aerial vehicle imagery using lightweight convolutional neural network and deep motion saliency. Neurocomputing. 398:555–565. https://doi.org/10.1016/j.neucom.2019.03.102
Zhu P, Wen L, Du D et al (2018) Vision meets drones : past. Present and Future:1–20
Acknowledgements
The first author would like to thank the Ministry of Human Resource Development, New Delhi, India for providing the Research Fellowship for carrying out this work. The authors would also like to thank ISRO, India for providing the support time to time to carry out this area of research.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Gupta, H., Verma, O.P. Monitoring and surveillance of urban road traffic using low altitude drone images: a deep learning approach. Multimed Tools Appl 81, 19683–19703 (2022). https://doi.org/10.1007/s11042-021-11146-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-11146-x