[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3633637.3633665acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccprConference Proceedingsconference-collections
research-article

Remote Sensing Object Detection based on Attention and Feature Fusion

Published: 28 February 2024 Publication History

Abstract

In addressing the challenges of significant scale variation and high resemblance between the foreground and background, we present a one-stage remote sensing object detection strategy in this paper. Specifically, we have developed a multi-scale information mining module, integrated into the backbone network, to enhance feature representation capabilities and address the issue of vast scale variations in objects. Subsequently, we utilize a deep and shallow feature fusion module to harmonize shallow and deep features. This module not only effectively detects multi-scale objects but also improves the precision detection of smaller objects. To alleviate the problem of foreground and background similarities, we have incorporated a dual path attention mechanism into our feature pyramid networks. This adaptation enables the network to focus more intensively on object information. Comprehensive experiments on two distinct remote sensing object detection datasets, namely, DIOR and NWPU VHR-10, validate the efficacy of our proposed method. Furthermore, our approach demonstrates its superiority over current state-of-the-art methodologies, improving the baseline method by 3.3% and 15% respectively.

References

[1]
Huang X, He B, Tong M, “Few-shot object detection on remote sensing images via shared attention module and balanced fine-tuning strategy,” Remote Sensing, 2021, 13(19): 3816.
[2]
Wang P, Sun X, Diao W, “FMSSD: Feature-merged single-shot detection for multiscale objects in large-scale remote sensing imagery,” IEEE Transactions on Geoscience and Remote Sensing, 2019, 58(5): 3377-3390.
[3]
Shi, Huimin, “DPNET: Dual-Path Network for Efficient Object Detection with Lightweight Self-Attention,” 2022 IEEE International Conference on Image Processing (ICIP). IEEE, 2022.
[4]
Girshick, Ross, “Rich feature hierarchies for accurate object detection and semantic segmentation,” Proceedings of the IEEE conference on computer vision and pattern recognition. 2014.
[5]
R. Girshick, “Fast R-CNN,” in Proc. IEEE Int. Conf. Comput. Vis.,Dec. 2015, pp.1440–1448.
[6]
Sarker, M., Rashwan, H. A., Akram, F., : Slsdeep: skin lesion segmentation based on dilated residual and pyramid pooling networks. Springer, Cham (2018).
[7]
He K, Gkioxari G, Dollár P, “Mask r-cnn,” Proceedings of the IEEE international conference on computer vision. 2017: 2961-2969.
[8]
Bochkovskiy A, Wang C Y, Liao H Y M. “Yolov4: Optimal speed and accuracy of ob-ject detection,” arXiv preprint arXiv:2004.10934, 2020.
[9]
Liu W, Anguelov D, Erhan D, “Ssd: Single shot multibox detector,” Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer International Publishing, 2016: 21-37.
[10]
Law H, Deng J. “Cornernet: Detecting objects as paired keypoints,” Proceedings of the European conference on computer vision (ECCV). 2018: 734-750.
[11]
Wang J, Gong Z, Liu X, “Multi-Feature Information Complementary Detector: A High-Precision Object Detection Model for Remote Sensing Images,” Remote Sensing, 2022, 14(18): 4519.
[12]
Xue J, Zheng Y, Dong-Ye C, “Improved YOLOv5 network method for remote sensing image-based ground objects recognition,” Soft Computing, 2022, 26(20): 10879-10889.
[13]
Hu, Jie, Li Shen, and Gang Sun. “Squeeze-and-excitation networks,” Proceedings of the IEEE conference on computer vision and pattern recognition. 2018.
[14]
Yang X, Zhao J, Zhang H, “Remote Sensing Image Detection Based on YOLOv4 Improvements,” IEEE Access, 2022, 10: 95527-95538.
[15]
Li K, Wan G, Cheng G, “Object detection in optical remote sensing images: A survey and a new benchmark,” ISPRS journal of photogrammetry and remote sensing, 2020, 159: 296-307.
[16]
Li K, Wan G, Cheng G, “Object detection in optical remote sensing images: A survey and a new benchmark,” ISPRS journal of photogrammetry and remote sensing, 2020, 159: 296-307.
[17]
Cheng G, Zhou P, Han J. “Learning rotation-invariant convolutional neural networks for object detection in VHR optical remote sensing images,” IEEE Transactions on Geoscience and Remote Sensing, 2016, 54(12): 7405-7415.
[18]
He K, Zhang X, Ren S, “Spatial pyramid pooling in deep convolutional networks for visual recognition,” IEEE transactions on pattern analysis and machine intelligence, 2015, 37(9): 1904- 1916.
[19]
Weber M, Wang H, Qiao S, “Deeplab2: A tensorflow library for deep labeling,” arXiv preprint arXiv:2106.09748, 2021.
[20]
Woo S, Park J, Lee J Y, “Cbam: Convolutional block attention module,” Proceedings of the European conference on computer vision (ECCV). 2018: 3-19.
[21]
Liu S, Huang D. “Receptive field block net for accurate and fast object detection,” Proceedings of the European conference on computer vision(ECCV). 2018: 385-400.
[22]
Li Q, Chen Y, Zeng Y. “Transformer with transfer CNN for remote-sensing-image object detection,” Remote Sensing, 2022, 14(4): 984.
[23]
Jiang S, Yao W, Wong M S, “An optimized deep neural network detecting small and narrow rectangular objects in Google Earth Images,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2020, 13:1068-1081.
[24]
L. Li, G. Cao, J. Liu, and Y . Tong, “Efficient detection in aerial images for resource-limited satellites,” IEEE Geosci. Remote Sens. Lett., vol. 19,2021, Art. no. 6001605.
[25]
Li, Q.; Chen, Y.; Zeng, Y. Transformer with Transfer CNN for Remote-Sensing-Image Object Detection. Remote Sens. 2022, 14, 984.
[26]
Olugboja A, Wang Z, Sun Y. Parallel convolutional neural networks for object detection[J]. Journal of Advances in Information Technology Vol, 2021, 12(4).

Index Terms

  1. Remote Sensing Object Detection based on Attention and Feature Fusion

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICCPR '23: Proceedings of the 2023 12th International Conference on Computing and Pattern Recognition
    October 2023
    589 pages
    ISBN:9798400707988
    DOI:10.1145/3633637
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 February 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Dual path attention mechanism
    2. Remote sensing object detection
    3. deep and shallow feature fusion module
    4. multi scale information mining module

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICCPR 2023

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 19
      Total Downloads
    • Downloads (Last 12 months)19
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 15 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media