[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3462676.3462683acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiceccConference Proceedingsconference-collections
research-article

Improved Detection and Recognition of Sea Surface Ships Based on YOLOv3

Published: 07 September 2021 Publication History

Abstract

In order to improve the accuracy and real-time performance of ship target detection on the sea surface, an improved ship target detection method based on the YOLOv3 algorithm is proposed. Firstly, add the CBAM attention mechanism layer to the network prediction layer to improve the expression of the correlation between the feature map target information and important channels and space; secondly, the spatial pyramid pooling (SPP) module is introduced in YOLOv3 to solve the problem of information loss and scale inconsistency; thirdly, GIoU is selected as the loss function to improve the accuracy of the positioning information of the target prediction frame; then the adaptive spatial feature fusion (ASFF) method is used in the prediction layer to make full use of features of different scales to improve the accuracy of target detection; finally refer to the PASCAL VOC data setting format, established a sea surface ship dataset containing 6041 pictures, and manually annotated it for network training. The test results show that compared with YOLOv3, the improved algorithm can effectively improve the accuracy and speed of ship detection on the sea surface. The detection accuracy (mAP) of ship targets on the sea surface has increased by 2.97% to 94.17%, and the frame rate has reached 48fps. Meet the requirements of real-time detection of ship targets.

References

[1]
Wang Xiaoyu. Research on detection technology of surface unmanned craft based on deep learning[D]. Jiangsu University of Science and Technology, 2019.
[2]
Fukushima K . Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position[J]. Biological Cybernetics, 1980, 36(4):193-202.
[3]
Krizhevsky A, Sutskever I, Hinton G. Image Net Classification with Deep Convolutionalal Neural Networks[C]//Pereira F, Burges CJC, Bottou.L, Advances in Neural Information Processing Systems 25. Lake Tahoe, Nevada, USA: Curran Associates Inc. 2012, 1097-1105.
[4]
Girshick R,Donahue J,Daarrell T,et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR).2014.
[5]
Girshick R. Fast r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2015: 1440-1448.
[6]
Ren Shaoqing,He Kaiming,Girshick Ross,Sun Jian. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.[J]. IEEE transactions on pattern analysis and machine intelligence,2017,39(6).
[7]
HE Kaiming, ZHANG Xiangyu, REN Shaoqing,et al.Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015,37(9): 1904–1916.
[8]
He K, Gkioxari G, Dollár P, Mask r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2017: 2961-2969.
[9]
Redmon J, Divvala S, Girshick R, You only look once: Unified, real-time object detection[C]// Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779-788.
[10]
Redmon J, Farhadi A. YOLO9000: better, faster, stronger[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 7263-7271.
[11]
Liu W, Anguelov D, Erhan D, Ssd: Single shot multibox detector[C]//European conference on computer vision. Springer, Cham, 2016: 21-37.
[12]
Fu C Y, Liu W, Ranga A, Dssd: Deconvolutional single shot detector[J]. arXiv preprint arXiv:1701.06659, 2017.
[13]
Li Z, Zhou F. FSSD: feature fusion single shot multibox detector[J]. arXiv preprint arXiv:1712.00960, 2017.
[14]
Lin T Y, Dollár P, Girshick R, Feature pyramid networks for object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 2117-2125.
[15]
Redmon J,Farhadi A.Yolov3:An incremental improvement[J].arXiv preprint arXiv:1804.02767,2018.
[16]
Yu J, Jiang Y, Wang Z, Unitbox: An advanced object detection network[C]//Proceedings of the 24th ACM international conference on Multimedia. 2016: 516-520.
[17]
Rezatofighi H, Tsoi N, Gwak J Y, Generalized intersection over union: A metric and a loss for bounding box regression[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019: 658-666.
[18]
HE Kaiming, ZHANG Xiangyu, REN Shaoqing,et al.Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015,37(9): 1904–1916.
[19]
Liu S, Huang D, Wang Y. Learning Spatial Fusion for Single-Shot Object Detection[EB/OL]. arXiv preprint arXiv:1911.09516, 2019.
[20]
Woo S, Park J, Lee J Y, CBAM:convolutional block attention module[C]// The European Conference on Compute

Cited By

View all
  • (2024)Two-stage ship detection at long distances based on deep learning and slicing techniquePLOS ONE10.1371/journal.pone.031314519:11(e0313145)Online publication date: 19-Nov-2024
  • (2023)Toward Enhanced Support for Ship SailingIEEE Access10.1109/ACCESS.2023.330380811(87047-87061)Online publication date: 2023
  • (2023)Sw-YoloXExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.119560217:COnline publication date: 1-May-2023

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICECC '21: Proceedings of the 4th International Conference on Electronics, Communications and Control Engineering
April 2021
122 pages
ISBN:9781450389129
DOI:10.1145/3462676
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 September 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. ASFF
  2. CBAM
  3. GIoU
  4. Keywords: Ship
  5. SPP

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • Natural Science Foundation of China?Shandong Province Postgraduate Education Quality Curriculum Project?Shandong Province Postgraduate Education Joint Training Base Project

Conference

ICECC 2021

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)16
  • Downloads (Last 6 weeks)1
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Two-stage ship detection at long distances based on deep learning and slicing techniquePLOS ONE10.1371/journal.pone.031314519:11(e0313145)Online publication date: 19-Nov-2024
  • (2023)Toward Enhanced Support for Ship SailingIEEE Access10.1109/ACCESS.2023.330380811(87047-87061)Online publication date: 2023
  • (2023)Sw-YoloXExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.119560217:COnline publication date: 1-May-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media