Channel Self-Attention Based Multiscale Spatial-Frequency Domain Network for Oriented Object Detection in Remote Sensing Imagery

被引:0
|
作者
Xu, Yang [1 ]
Pan, Yushan [1 ]
Wu, Zebin [1 ]
Wei, Zhihui [1 ]
Zhan, Tianming [2 ,3 ]
机构
[1] Nanjing Univ Sci & Technol NJUST, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Nanjing Audit Univ, Jiangsu Key Construct Lab Audit Informat Engn, Nanjing 211815, Peoples R China
[3] Nanjing Audit Univ, Sch Informat Engn, Nanjing 211815, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Frequency-domain analysis; Detectors; Remote sensing; Object detection; Data mining; Attention mechanisms; Wavelet transforms; Convolution; Semantics; Fusion features; Haar wavelet transform; oriented object detection; remote sensing imagery; spatial-frequency domain;
D O I
10.1109/TGRS.2024.3500013
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The detection of oriented objects in remote sensing images remains a daunting challenge due to their complex backgrounds, various sizes, and especially arbitrary orientations. However, most of the existing methods only model the structural features of the images in the spatial domain, while the horizontal convolution kernels limit the model's ability to perceive object direction information. Furthermore, the frequency features contain rich information about scale, texture, and angle, which can be a good complement to the spatial features. Inspired by this, we propose a multiscale spatial-frequency domain network (MSFN) to utilize spatial-frequency information for oriented object detection, which can be integrated into any convolutional neural network (CNN) architectures seamlessly and perform end-to-end training easily. Firstly, multiscale Haar wavelet transforms are leveraged to extract the multiscale frequency domain features from the image. Subsequently, channel alignment feature fusion module (CA-FFM) is proposed to fuse the high-level semantic features extracted by CNN with the low-level texture features extracted by the wavelet transform in multiscale. Finally, a channel self-attention (CSA)-based spatial-frequency feature perception module (SFPM) is designed to perform self-attention weighted aggregation on the fused features along the channel dimension, thereby constructing a novel spatial-frequency feature extraction backbone network for oriented object detector in remote sensing images. Experimental results on the DOTA and HRSC2016 datasets validate the effectiveness and universality of the proposed method.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] A Strip Steel Surface Defect Salient Object Detection Based on Channel, Spatial and Self-Attention Mechanisms
    Sun, Yange
    Geng, Siyu
    Guo, Huaping
    Zheng, Chengyi
    Zhang, Li
    ELECTRONICS, 2024, 13 (21)
  • [32] Few-Shot Object Detection in Remote Sensing Images With Multiscale Spatial Selective Attention
    Yu, Yingnan
    Chen, Si-Bao
    Huang, Li-Li
    Tang, Jin
    Luo, Bin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 1
  • [33] Progressive Domain Adaptive Object Detection Based on Self-Attention in Foggy Weather
    Lin, Meng
    Zhou, Gang
    Yang, Yawei
    Shi, Jun
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2023, 18 (12) : 1923 - 1931
  • [34] Adaptive multi-level feature fusion and attention-based network for arbitrary-oriented object detection in remote sensing imagery
    Chen, Luchang
    Liu, Chunsheng
    Chang, Faliang
    Li, Shuang
    Nie, Zhaoying
    NEUROCOMPUTING, 2021, 451 : 67 - 80
  • [35] Joint self-attention and branch sampling for object detection on drone imagery
    Zhang Y.
    Wu C.
    Liu Y.
    Zhang T.
    Zheng Y.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (18): : 2723 - 2735
  • [36] A self-attention capsule feature pyramid network for water body extraction from remote sensing imagery
    Yu, Yongtao
    Yao, Yuting
    Guan, Haiyan
    Li, Dilong
    Liu, Zuojun
    Wang, Lanfang
    Yu, Changhui
    Xiao, Shaozhang
    Wang, Wenhao
    Chang, Lv
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (05) : 1801 - 1822
  • [37] Robust Building Extraction for High Spatial Resolution Remote Sensing Images with Self-Attention Network
    Zhou, Dengji
    Wang, Guizhou
    He, Guojin
    Long, Tengfei
    Yin, Ranyu
    Zhang, Zhaoming
    Chen, Sibao
    Luo, Bin
    SENSORS, 2020, 20 (24) : 1 - 19
  • [38] Object Detection For Remote Sensing Image Based on Multiscale Feature Fusion Network
    Tian Tingting
    Yang Jun
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (16)
  • [39] Single Shot Anchor Refinement Network for Oriented Object Detection in Optical Remote Sensing Imagery
    Bao, Songze
    Zhong, Xing
    Zhu, Ruifei
    Zhang, Xiaonan
    Li, Zhuqiang
    Li, Mengyang
    IEEE ACCESS, 2019, 7 : 87150 - 87161
  • [40] Optical Remote Sensing Image Cloud Detection with Self-Attention and Spatial Pyramid Pooling Fusion
    Pu, Weihua
    Wang, Zhipan
    Liu, Di
    Zhang, Qingling
    REMOTE SENSING, 2022, 14 (17)