[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3594315.3594327acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccaiConference Proceedingsconference-collections
research-article

A Sea-Surface Target Detection Method based on Weighted Aggregation Network

Published: 02 August 2023 Publication History

Abstract

Aiming at the problem that the detection performance is easily affected by objective conditions such as light environment and background noise in the complicated ocean environment and there is image scale transformation in congested scenarios and loss of deep semantic information during forwarding, this paper proposed an actual sea surface target detection algorithm combined with the Convolutional Block Attention Module (CBAM) and the Bidirectional Feature Pyramid Network (BiFPN). Firstly, an efficient weighted feature aggregation network was constructed based on the YOLOv5s model by adding the skip connection between input and output nodes, which integrated the deep and shallow semantic information and achieved a unified description of the cross-scale features. On the basis of original feature network, a series of attention feature map information with channel and space two dimensions was generated for meeting the needs of surface target detection for environmental noise suppression and enhancing the feature expression ability of the network in complex backgrounds. And then, for improving the training speed and inference accuracy, the SIoU loss function was introduced as the location loss, which redefined the penalty metrics considering the vector angle between the required regressions to ensure the accurate localization of the prediction box. Finally, the validation was carried out on the actual sea image dataset provided by the Key Laboratory of Marine Intelligent Equipment and System, Ministry of Education. The experimental results showed that the improved model had higher detection accuracy than the YOLOv5s model, the [email protected]:0.95 was increased by 1.8 percentage points, and the speed reached 77FPS. The model showed good generalization performance, and it could meet the requirements of both target accuracy and real-time detection, which could be effectively applied to the ship intelligent perception field.

References

[1]
Redmon Joseph, Santosh Divvala, Ross Girshick, (2016). “You only look once: Unified, real-time object detection”. Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 779-788.
[2]
Redmon Joseph and Ali Farhadi. (2018). “Yolov3: An incremental improvement”. arXiv preprint arXiv:1804.02767.
[3]
Bochkovskiy Alexey, Chien-Yao Wang and Hong-Yuan Mark Liao. (2020). “Yolov4: Optimal speed and accuracy of object detection”. arXiv preprint arXiv:2004.10934.
[4]
Chuyi Li, Lulu Li, Hongliang Jiang, (2022). “YOLOv6: A single-stage object detection framework for industrial applications”. arXiv preprint arXiv:2209.02976.
[5]
Chien-Yao Wang, Alexey Bochkovskiy, Hong-Yuan Mark Liao. (2022). “YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors”. arXiv preprint arXiv:2207.02696.
[6]
Wei Liu, Dragomir Anguelov, Dumitru Erhan, (2016). “Ssd: Single shot multibox detector”. Proceedings of European Conference on Computer Vision. pp.21-37.
[7]
Ross Girshick, Jeff Donahue, Trevor Darrell, (2014). “Rich feature hierarchies for accurate object detection and semantic segmentation”. Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 580-587.
[8]
Ross Girshick. (2015). “Fast r-cnn”. Proceedings of the IEEE international conference on computer vision. pp. 1440-1448.
[9]
Xudong Sun, Pengcheng Wu and Steven CH Hoi. (2018). “Face detection using deep learning: An improved faster RCNN approach”. Neurocomputing, 299:42-50.
[10]
Liang Jin and Guodong Liu. (2021). “An approach on image processing of deep learning based on improved ssd”. Symmetry, 13(3):495.
[11]
Jeremiah W. Johnson. (2018). “Adapting mask-rcnn for automatic nucleus segmentation”. arXiv preprint arXiv:1805.00500.
[12]
Yunong Tian, Guodong Yang, Zhe Wang, (2019). “Apple detection during different growth stages in orchards using the improved YOLO-V3 model”. Computers and electronics in agriculture, 157:417-426.
[13]
Haiyan Yu, Yu Li, and Dexian Zhang. (2021). “An Improved YOLOv3 Small-Scale Ship Target Detection Algorithm”. Proceedings of 2021 6th International Conference on Smart Grid and Electrical Automation. pp. 560-563.
[14]
Junchi Zhou, Ping Jiang, Airu Zou, (2021). “Ship Target Detection Algorithm Based on Improved YOLOv5”. Journal of Marine Science and Engineering, 9(8). 908.
[15]
Dehai Chen, Shiru Sun, Zhijun Lei, (2021). “Ship target detection algorithm based on improved YOLOv3 for maritime image”. Journal of Advanced Transportation, 2021:1-11.
[16]
Mingxing Tan, Ruoming Pang, and Quoc V. Le. (2020). “Efficientdet: Scalable and efficient object detection”. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 10781-10790.
[17]
Sanghyun Woo, Jongchan Park, Joon-Young Lee, (2018). “CBAM: Convolutional Block Attention Module”. Proceedings of European Conference on Computer Vision. pp. 3-19.
[18]
Hamid Rezatofighi, Nathan Tsoi, JunYoung Gwak, (2019). “Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression”. Proceedings of 2019 IEEE/CVF conference on computer vision and pattern recognition. pp. 658-666.
[19]
Zhaohui Zheng, Ping Wang, Wei Liu, (2020). “Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression”. Proceedings of the AAAI conference on artificial intelligence, 34(07):12993-13000.
[20]
Zhora Gevorgyan. (2022). “SIoU Loss: More Powerful Learning for Bounding Box Regression”. arXiv preprint arXiv:2205.12740.
[21]
Yi Huang, Hongdong Wang, Jilin Ma, (2021). “Research and Practical Exploration of Test and Validation Technologies Applied on Unmanned Surface Vehicle Optical Recognition”. Proceedings of 2021 IEEE International Conference on Unmanned Systems. pp. 976-981.
[22]
Yuguang Shi, Yu Guo, Zhenqiang Mi, (2022). “Stereo CenterNet-based 3D object detection for autonomous driving”. Neurocomputing, 471:219-229.

Index Terms

  1. A Sea-Surface Target Detection Method based on Weighted Aggregation Network

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICCAI '23: Proceedings of the 2023 9th International Conference on Computing and Artificial Intelligence
    March 2023
    824 pages
    ISBN:9781450399029
    DOI:10.1145/3594315
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 02 August 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICCAI 2023

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 31
      Total Downloads
    • Downloads (Last 12 months)15
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 03 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media