Abstract
Efficient vehicle detection has played an important role in Intelligent Transportation Systems (ITS) in smart cities. With the development of the Convolutional Neural Network (CNN) for objection detection, new applications have been designed to enable on-road vehicle detection algorithms. Therefore, this work aims to further improve the conventional CNN model for real-time detection on low-cost embedded hardware. In this study, a lightweight CNN model is proposed based on YOLOv4 Tiny to detect vehicles from the VEDAI dataset. In the proposed method, one additional scale feature map is added to make a total of three prediction boxes in the architecture. Then, the output image size of the second and third prediction boxes are upscaled in order to improve detection accuracy in detecting small size vehicles in the aerial images. The proposed model has been evaluated on NVIDIA Geforce 940MX GPU-based computer, Google Collab (TESLA K80) and Jetson Nano. Based on the experimental results, this study has demonstrated that the proposed model achieved better mean average precision (mAP) compared to the conventional YOLOv4 Tiny and previous works.
Similar content being viewed by others
References
Xu, D., Wu, Y.: Fe-Yolo: a feature enhancement network for remote sensing target detection. Remote Sensing 13(7), 1311 (2021)
Koay, H.V., Chuah, J.H., Chow, C.-O., Chang, Y.-L., Yong, K.K.: Yolo-RTUAV: Towards real-time vehicle detection through aerial images with low-cost edge devices. Remote Sensing 13(21), 4196 (2021)
Yang, Z., Pun-Cheng, L.S.C.: Vehicle detection in intelligent transportation systems and its applications under varying environments: a review. Image Vis. Comput. 69, 143–154 (2018)
Baran, R., Rusc, T., Fornalski, P.: A smart camera for the surveillance of vehicles in intelligent transportation systems. Multimed Tools Appl 75(17), 10471–10493 (2015)
Khalifa, O.O., Wajdi, M.H., Saeed, R.A., Hashim, A.H., Ahmed, M.Z., Ali, E.S.: Vehicle detection for vision-based intelligent transportation systems using convolutional neural network algorithm. J. Adv. Transp. (2022). https://doi.org/10.1155/2022/9189600
Xu, Y., Yu, G., Wu, X., Wang, Y., Ma, Y.: An enhanced viola-jones vehicle detection method from unmanned aerial vehicles imagery. IEEE Trans. Intell. Transp. Syst. 18, 1845–1856 (2017)
Chen, Z., Wang, C., Wen, C., Teng, X., Chen, Y., Guan, H., Luo, H., Cao, L., Li, J.: Vehicle detection in high-resolution aerial images via sparse representation and superpixels. IEEE Trans. Geosci. Remote Sensing 54, 103–116 (2016)
Cao, S., Yu, Y., Guan, H., Peng, D., Yan, W.: Affine-function transformation-based object matching for vehicle detection from unmanned aerial vehicle imagery. Remote Sensing 11, 1708 (2019)
Ringwald, T., Sommer, L., Schumann, A., Beyerer, J., Stiefelhagen, R.: UAV-net: a fast aerial vehicle detector for mobile platforms. In: proceedings of the 2019 IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW), Long Beach, CA, USA, pp. 544–552. (16–17 June 2019)
He, Y., Pan, Z., Li, L., Shan, Y., Cao, D., Chen, L.: Real-time vehicle detection from short-range aerial image with compressed MobileNet. In: proceedings of the 2019 international conference on robotics and automation (ICRA), Montreal, Canada, pp. 8339–8345. (20–24 May 2019).
Zhang, P., Zhong, Y., Li, X.: SlimYOLOv3: Narrower, faster and better for real-time UAV applications. In: proceedings of the 2019 IEEE/CVF international conference on computer vision workshop (ICCVW), Seoul, Korea. (2019).
Shivappriya, S.N., Priyadarsini, M.J., Stateczny, A., Puttamadappa, C., Parameshachari, B.D.: Cascade object detection and remote sensing object detection method based on trainable activation function. Remote Sensing 13(2), 200 (2021)
Supreeth, H.S., Patil, C.M.: Efficient multiple moving object detection and tracking using combined background subtraction and clustering. SIViP 12(6), 1097–1105 (2018)
Tang, T., Zhou, S., Deng, Z., Zou, H., Lei, L.: Vehicle detection in aerial images based on region convolutional neural networks and hard negative example mining. Sensors 17(2), 336 (2017)
Husain, A.A., Maity, T., Yadav, R.K.: Vehicle detection in intelligent transport system under a hazy environment: a survey. IET Image Proc. 14(1), 1–10 (2020)
Bouguettaya, A., Ahmed, K., Taberkit, A.M.: A survey on lightweight CNN-based object detection algorithms for platforms with limited computational resources. Int. J. Inf. Appl. Math. 2(2), 28–44 (2019)
Howard, A. G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., Adam, H.: MobileNets: efficient convolutional neural networks for mobile vision applications. arXiv:1704.0486. (2017).
Xiao, D., Shan, F., Li, Z., Le, B.T., Liu, X., Li, X.: A target detection model based on improved tiny-yolov3 under the environment of Mining Truck. IEEE Access 7, 123757–123764 (2019)
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., & Berg, A. C.): SSD: single shot multibox detector. In: proceedings of the European conference on computer vision, Amsterdam, The Netherlands, pp. 21–37. (2016).
Wang, T., Anwer, R.M., Cholakkal, H., Khan, F.S., Pang, Y. and Shao, F.: Learning rich features at high-speed for single-shot object detection. In: IEEE/CVF international conference on computer vision (ICCV), 2019, pp. 1971-1980. (2019).
Li, J., Wang, H.: Surface defect detection of vehicle light guide plates based on an improved RetinaNet. Meas. Sci. Technol. 33(4), 045401 (2022)
Zhang, S., Wen, L., Bian, X., Lei, Z., & Li, S. Z.: Single-shot refinement neural network for object detection. In: 2018 IEEE/CVF conference on computer vision and pattern recognition. (2018).
Redmon, J., Divvala, S., Girshick, R., and Farhadi, A.: You only look once: unified real-time object detection. In: proceedings IEEE conference computer vision pattern recognition, pp. 779–788. (2016).
Redmon, J., & Farhadi, A. Yolo9000: Better, faster, stronger. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR). (2017).
Jiang, Z., Zhao, L., Li, S., & Jia, Y.: Real-time object detection method based on improved Yolov4-Tiny. In: proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), Honolulu, HI, USA; pp. 6517–6525. (2020).
Redmon, J., & Farhadi, A.: Yolov3: An incremental improvement. arXiv.org. (2018).
Bochkovskiy, A., Wang, C.-Y., & Liao, H.-Y. M.: Yolov4: Optimal Speed and accuracy of object detection. arXiv.org. (2020).
Junos, M.H., Mohd Khairuddin, A.S., Dahari, M.: Automated object detection on aerial images for limited capacity embedded device using a lightweight CNN model. Alex. Eng. J. 61(8), 6023–6041 (2022)
Ghasemi Darehnaei, Z., Shokouhifar, M., Yazdanjouei, H., Rastegar Fatemi, S.M.: SI-EDTL: swarm intelligence ensemble deep transfer learning for multiple vehicle detection in UAV images. Concurr. Comput. Pract. Exp. (2021). https://doi.org/10.1002/cpe.6726
Bazi, Y., Melgani, F.: Convolutional SVM networks for object detection in UAV imagery. IEEE Trans Geosci. Remote Sens. 56(6), 3107–3118 (2018)
Ju, M., Luo, J., Zhang, P., He, M., Luo, H.: A simple and efficient network for small target detection. IEEE Access 7, 85771–85781 (2019)
Chen, C., Zhong, J., Tan, Y.: Multiple-oriented and small object detection with convolutional neural networks for aerial image. Remote Sensing 11(18), 2176 (2019)
Zhong, J., Lei, T., Yao, G.: Robust vehicle detection in aerial images based on cascaded convolutional neural networks. Sensors 17(12), 2720 (2017)
Razakarivony, S., & Jurie, F. Vehicle detection in aerial imagery (Vedai) : A benchmark. J. Vis. Commun. Image Rep. hal-01122605v2. (2015).
Acknowledgements
The research funding is provided by Industry-Driven Innovation Grant (IDIG) by Universiti Malaya with project number PPSI-2020-CLUSTER-SD01.
Author information
Authors and Affiliations
Contributions
Md Abdul Momin and Anis Salwa wrote the main manuscript. Mohamad Haniff Junos and Mohamad Sofian Abu Talip prepared the figures and tables. All authors reviewed the manuscript
Corresponding author
Ethics declarations
Conflict of interest
The authors declared that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Momin, M.A., Junos, M.H., Mohd Khairuddin, A.S. et al. Lightweight CNN model: automated vehicle detection in aerial images. SIViP 17, 1209–1217 (2023). https://doi.org/10.1007/s11760-022-02328-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-022-02328-7