Remote Sensing Target Detection Based on Multilevel Self- Attention Enhancement

被引:1
|
作者
Wei Xiegen [1 ,2 ]
Cao Lin [2 ,3 ]
Tian Shu [3 ]
Du Kangning [3 ]
Song Peiran [3 ]
Guo Yanan [3 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Instrument Sci & Optoelect Engn, Beijing 100101, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Key Lab, Minist Educ Optoelect Measurement Technol & Instr, Beijing 100101, Peoples R China
[3] Beijing Informat Sci & Technol Univ, Key Lab Informat & Commun Syst, Minist Informat Ind, Beijing 100101, Peoples R China
关键词
rotating object detection; remote sensing image; Swin Transformer; Gaussian distance;
D O I
10.3788/LOP223048
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Remote sensing image target detection technology has gained considerable attention with the improvement of remote sensing image resolution. This thesis proposes a remote sensing target detection algorithm based on multilevel local self- attention enhancement to solve such problems as complex background noise, arbitrary target direction, and large changes in target size in remote sensing images. First, the proposed algorithm adopts the Swin Transformer feature extraction module in an Oriented region- based convolutional neural network (R-CNN) backbone network, and the multilevel local information of feature-extracted semantic information is modeled using the Transformer module with shifted window operations and hierarchical design. Second, Oriented RPN is used to generate high-quality directed candidate boxes. Finally, the Kullback-Leibler divergence ( KLD) between Gaussian distributions is regarded as the regression loss function, allowing the parameter gradient to be dynamically adjusted based on the object's characteristics for more accurate regression of the detection boxes. The mean average precision (mAP) of the proposed algorithm reaches 77. 2% and 90. 6% on the DOTA dataset and HRSC2016 dataset, respectively, and it is increased by 1. 8 percentage points and 0. 5 percentage points compared with the Oriented R-CNN algorithm. The results reveal that the proposed algorithm can effectively advance the target detection accuracy of remote sensing images.
引用
收藏
页数:11
相关论文
共 25 条
  • [1] Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery
    Azimi, Seyed Majid
    Vig, Eleonora
    Bahmanyar, Reza
    Koerner, Marco
    Reinartz, Peter
    [J]. COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 : 150 - 165
  • [2] Ding J, 2018, Arxiv, DOI arXiv:1812.00155
  • [3] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [4] Align Deep Features for Oriented Object Detection
    Han, Jiaming
    Ding, Jian
    Li, Jie
    Xia, Gui-Song
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [5] Joo HB, 2017, INT C CONTR AUTOMAT, P345, DOI 10.23919/ICCAS.2017.8204463
  • [6] Li MJ, 2018, INT GEOSCI REMOTE SE, P673, DOI 10.1109/IGARSS.2018.8519094
  • [7] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2999 - 3007
  • [8] Feature Pyramid Networks for Object Detection
    Lin, Tsung-Yi
    Dollar, Piotr
    Girshick, Ross
    He, Kaiming
    Hariharan, Bharath
    Belongie, Serge
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 936 - 944
  • [9] Rotation-Invariant Object Detection in Remote Sensing Images Based on Radial-Gradient Angle
    Lin, Yudong
    He, Hongjie
    Yin, Zhongke
    Chen, Fan
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (04) : 746 - 750
  • [10] Multi-Dimensional CNN Fused Algorithm for Hyperspectral Remote Sensing Image Classification
    Liu Jinxiang
    Ban Wei
    Chen Yu
    Sun Yaqin
    Zhuang Huifu
    Fu Erjiang
    Zhang Kefei
    [J]. CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2021, 48 (16):