LS-YOLO: A Lightweight Selective YOLOv8 Algorithm for UAV Aerial Photography

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 15042))

Included in the following conference series:

Chinese Conference on Pattern Recognition and Computer Vision (PRCV)

225 Accesses

Abstract

Object detection for unmanned aerial vehicles (UAV) aerial photography presents challenges such as tiny and densely distributed objects, and unbalanced categories. Furthermore, the hardware limitations of UAV restrict the scalability of models, leading to reduced accuracy. In response to these challenges, an enhanced YOLOv8m model which incorporates multiple lightweight strategies is proposed. Specifically, GDC (Ghost Dynamic Conv) is introduced into the backbone network to improve feature extraction, and more features are generated with fewer parameters to achieve efficient feature extraction. Additionally, the feature fusion mechanism has been optimized, and the LS-FPN-PAN feature fusion mechanism has been devised to globally reduce the number of feature channels and amount of calculation. Through adaptive feature selection, the channel weight was given to achieve better fusion. Furthermore, a lightweight selective detection head was proposed, and shared convolution was employed to facilitate the learning of target features by three detection heads. The WMPDIoU loss function was designed to reduce the penalty caused by the geometric factors of the detection box of tiny objects. The cost-free approach of substituting NMS function and implementing knowledge distillation is employed to enhance the model’s performance. The experimental results show that the model size and parameter number of the improved model are only 42.1\(\%\) and 55.1\(\%\) of the original model, but the performance is considerably improved. On the Visdrone2019 test dataset, P, mAP@0.5, mAP@0.5:0.95 are increased by 12.9\(\%\), 26.5\(\%\) and 38.8\(\%\) respectively, indicating a successful realization of lightweight design with enhanced performance capabilities suitable for effective application in object detection tasks on UAV platforms.

Supported by the Ministry of Education (20230104440, 2023122991102), and Priority Academic Program Development of Jiangsu Higher Education Institutions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 54.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 69.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Lightweight Network for Detecting Small Targets in the Air

Small Object Detection in UAV Images Based on YOLOv8n

Article Open access 26 August 2024

Htfd-yolo: Small target detection in drone aerial photography based on YOLOv8s

Article 25 February 2025

References

Li, Y., Fan, Q., Huang, H., Han, Z., Gu, Q.: A modified YOLOv8 detection network for UAV aerial image recognition. Drones 7(5), 304 (2023)
Article MATH Google Scholar
Lou, H., Duan, X., Guo, J., Liu, H., Gu, J., Bi, L., Chen, H.: DC-YOLOv8: small-size object detection algorithm based on camera sensor. Electronics 12(10), 2323 (2023)
Article Google Scholar
Guo, J., Lou, H., Chen, H., Liu, H., Gu, J., Bi, L., Duan, X.: A new detection algorithm for alien intrusion on highway. Sci. Rep. 13(1), 10667 (2023)
Article Google Scholar
Wang, G., Chen, Y., An, P., Hong, H., Hu, J., Huang, T.: UAV-YOLOv8: a small-object-detection model based on improved YOLOv8 for UAV aerial photography scenarios. Sensors 23(16), 7190 (2023)
Article Google Scholar
Han, K., Wang, Y., Guo, J., Wu, E.: ParameterNet: parameters are all you need for large-scale visual pretraining of mobile networks (2023). arXiv:2306.14525
Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., Xu, C.: Ghostnet: more features from cheap operations. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1580–1589(2020)
Google Scholar
Chen, Y., Zhang, C., Chen, B., Huang, Y., Sun, Y., Wang, C., Fu, X., Dai, Y., Qin, F., Peng, Y., et al.: Accurate leukocyte detection based on deformable-DETR and multi-level feature fusion for aiding diagnosis of blood diseases. Comput. Biol. Med. 170, 107917 (2024)
Article MATH Google Scholar
Liu, W., Lu, H., Fu, H., Cao, Z.: Learning to Upsample by Learning to Sample. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6027–6037 (2023)
Google Scholar
Tian, Z., Shen, C., Chen, H., He, T.: FCOS: a simple and strong anchor-free object detector. IEEE Trans. Pattern Anal. Mach. Intell. 44(4), 1922–1933 (2020)
MATH Google Scholar
Tong, Z., Chen, Y., Xu, Z., Yu, R.: Wise-IoU: bounding box regression loss with dynamic focusing mechanism (2023). arXiv:2301.10051
Siliang, M., Yong, X.: MPDIoU: a loss for efficient and accurate bounding box regression (2023). arXiv:2307.07662
Bodla, N., Singh, B., Chellappa, R., Davis, L. S.: Soft-NMS–improving object detection with one line of code. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5561–5569 (2017)
Google Scholar
Shu, C., Liu, Y., Gao, J., Yan, Z., Shen, C.: Channel-wise knowledge distillation for dense prediction. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 5311–5320 (2021)
Google Scholar
Du, D., Zhu, P., Wen, L., Bian, X., Lin, H., Hu, Q., Zhang, L.: VisDrone-DET2019: the vision meets drone object detection in image challenge results. In: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (2019)
Google Scholar
Zhang, Z., Yi, H.-H., Zheng, J.: Focusing on small objects detector in aerial images. Acta Electonica Sinica 51(4), 944–955 (2023)
MATH Google Scholar
Hsieh, M.R., Lin, Y.L., Hsu, W.H.: Drone-based object counting by spatially regularized regional proposal network. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 4145–4153 (2017)
Google Scholar
Wu, X., Li, W., Hong, D., Tao, R., Du, Q.: Deep learning for unmanned aerial vehicle-based object detection and tracking: a survey. IEEE Geosci. Remote Sens. Mag. 10(1), 91–124 (2021)
Article MATH Google Scholar
Feng, C., Zhong, Y., Gao, Y., Scott, M. R., Huang, W.: Tood: task-aligned one-stage object detection. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 3490–3499. IEEE Computer Society (2021)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Technology, Soochow University, Suzhou, 215006, China
Wei Pan & Zhe Yang

Authors

Wei Pan
View author publications
You can also search for this author in PubMed Google Scholar
Zhe Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhe Yang .

Editor information

Editors and Affiliations

Peking University, Beijing, China
Zhouchen Lin
Nankai University, Tianjin, China
Ming-Ming Cheng
Chinese Academy of Sciences, Beijing, China
Ran He
Xinjiang University, Ürümqi, Xinjiang, China
Kurban Ubul
Xinjiang University, Ürümqi, China
Wushouer Silamu
Peking University, Beijing, China
Hongbin Zha
Tsinghua University, Beijing, China
Jie Zhou
Chinese Academy of Sciences, Beijing, China
Cheng-Lin Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pan, W., Yang, Z. (2025). LS-YOLO: A Lightweight Selective YOLOv8 Algorithm for UAV Aerial Photography. In: Lin, Z., et al. Pattern Recognition and Computer Vision. PRCV 2024. Lecture Notes in Computer Science, vol 15042. Springer, Singapore. https://doi.org/10.1007/978-981-97-8858-3_13

Download citation

DOI: https://doi.org/10.1007/978-981-97-8858-3_13
Published: 03 November 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-8857-6
Online ISBN: 978-981-97-8858-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics