Strip Object Detection Method for Multiscale Optical Remote Sensing Images Without Anchors

被引:0
|
作者
Qi, He [1 ]
Hao, Shen [1 ]
机构
[1] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing 100024, Peoples R China
关键词
remote sensing image; strip object detection; images without anchors; attention mechanism;
D O I
10.3788/LOP241242
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A multiscale remote sensing image strip target detection method without anchor points is proposed to address the issue of the poor detection performance of strip targets in optical remote sensing image target detection. Accordingly, an atrous spatial pyramid pooling feature extraction network that integrates a coordinate attention module and strip pooling module is designed in this study. Consequently, the extraction of strip target feature information is significantly improved. The proposed method was validated through effectiveness experiments on the DIOR dataset, and the optimal dilation rate was selected. Based on this, ablation experiments were conducted to verify the feasibility of the designed framework. Comparative experiments were further conducted considering four classic object detection methods. Compared with the classic anchor-free method CenterNet, the average precision mean of the proposed method improves by 8. 63 percentage points and 11. 45 percentage points at thresholds of 0. 5 and 0. 75, respectively. Moreover, compared with the other three methods, the proposed method is competitive. The experimental results show that the proposed method is more accurate in locating strip targets (such as bridges and airports) in optical remote sensing images and has better detection performance.
引用
收藏
页数:11
相关论文
共 33 条
  • [1] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [2] UP-DETR: Unsupervised Pre-training for Object Detection with Transformers
    Dai, Zhigang
    Cai, Bolun
    Lin, Yugeng
    Chen, Junying
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1601 - 1610
  • [3] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [4] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [5] He KM, 2015, Arxiv, DOI arXiv:1512.03385
  • [6] Coordinate Attention for Efficient Mobile Network Design
    Hou, Qibin
    Zhou, Daquan
    Feng, Jiashi
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13708 - 13717
  • [7] Strip Pooling: Rethinking Spatial Pooling for Scene Parsing
    Hou, Qibin
    Zhang, Li
    Cheng, Ming-Ming
    Feng, Jiashi
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4002 - 4011
  • [8] Hu Z H, 2024, Laser & Optoelectronics Progress
  • [9] DETR with Improved DeNoising Training for Multi-Scale Oriented Object Detection in Optical Remote Sensing Images (Invited)
    Jin Ruijiao
    Wang Kun
    Liu Minhao
    Teng Xichao
    Li Zhang
    Yu Qifeng
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (02)
  • [10] Jocher G, 2024, YOLO by ultralyticsEB/ OL