Channel Self-Attention Based Multiscale Spatial-Frequency Domain Network for Oriented Object Detection in Remote Sensing Imagery

被引:0
|
作者
Xu, Yang [1 ]
Pan, Yushan [1 ]
Wu, Zebin [1 ]
Wei, Zhihui [1 ]
Zhan, Tianming [2 ,3 ]
机构
[1] Nanjing Univ Sci & Technol NJUST, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Nanjing Audit Univ, Jiangsu Key Construct Lab Audit Informat Engn, Nanjing 211815, Peoples R China
[3] Nanjing Audit Univ, Sch Informat Engn, Nanjing 211815, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Frequency-domain analysis; Detectors; Remote sensing; Object detection; Data mining; Attention mechanisms; Wavelet transforms; Convolution; Semantics; Fusion features; Haar wavelet transform; oriented object detection; remote sensing imagery; spatial-frequency domain;
D O I
10.1109/TGRS.2024.3500013
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The detection of oriented objects in remote sensing images remains a daunting challenge due to their complex backgrounds, various sizes, and especially arbitrary orientations. However, most of the existing methods only model the structural features of the images in the spatial domain, while the horizontal convolution kernels limit the model's ability to perceive object direction information. Furthermore, the frequency features contain rich information about scale, texture, and angle, which can be a good complement to the spatial features. Inspired by this, we propose a multiscale spatial-frequency domain network (MSFN) to utilize spatial-frequency information for oriented object detection, which can be integrated into any convolutional neural network (CNN) architectures seamlessly and perform end-to-end training easily. Firstly, multiscale Haar wavelet transforms are leveraged to extract the multiscale frequency domain features from the image. Subsequently, channel alignment feature fusion module (CA-FFM) is proposed to fuse the high-level semantic features extracted by CNN with the low-level texture features extracted by the wavelet transform in multiscale. Finally, a channel self-attention (CSA)-based spatial-frequency feature perception module (SFPM) is designed to perform self-attention weighted aggregation on the fused features along the channel dimension, thereby constructing a novel spatial-frequency feature extraction backbone network for oriented object detector in remote sensing images. Experimental results on the DOTA and HRSC2016 datasets validate the effectiveness and universality of the proposed method.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Arbitrary-Oriented Dense Object Detection in Remote Sensing Imagery
    Chen Yingxue
    Ding Wenrui
    Li Hongguang
    Wang Yufeng
    Liu Shuo
    Xiao, Zhifeng
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 436 - 440
  • [42] Oriented Object Detection by Searching Corner Points in Remote Sensing Imagery
    Chen, Xueqing
    Ma, Li
    Du, Qian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [43] Self-Attention Guidance and Multiscale Feature Fusion-Based UAV Image Object Detection
    Zhang, Yunzuo
    Wu, Cunyu
    Zhang, Tian
    Liu, Yameng
    Zheng, Yuxin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [44] CAG-FPN: CHANNEL SELF-ATTENTION GUIDED FEATURE PYRAMID NETWORK FOR OBJECT DETECTION
    Chang, Jie
    Dai, Huhe
    Zheng, Yuan
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 9616 - 9620
  • [45] TSMGA: Temporal-Spatial Multiscale Graph Attention Network for Remote Sensing Change Detection
    Zhang, Xiaoyang
    Yuan, Genji
    Hua, Zhen
    Li, Jinjiang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 3696 - 3712
  • [46] Multigranularity Self-Attention Network for Fine-Grained Ship Detection in Remote Sensing Images
    Ouyang, Lihan
    Fang, Leyuan
    Ji, Xinyu
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 9722 - 9732
  • [47] Self-attention module and FPN-based remote sensing image target detection
    Zhongyu Li
    Huajun Wang
    Hengfei Zhong
    Yuting Dai
    Arabian Journal of Geosciences, 2021, 14 (23)
  • [48] Cloud Detection From Remote Sensing Imagery Based on Domain Translation Network
    Guo, Jianhua
    Yang, Jingyu
    Yue, Huanjing
    Chen, Yang
    Hou, Chunping
    Li, Kun
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [49] Cloud Detection from Remote Sensing Imagery Based on Domain Translation Network
    Guo, Jianhua
    Yang, Jingyu
    Yue, Huanjing
    Chen, Yang
    Hou, Chunping
    Li, Kun
    IEEE Geoscience and Remote Sensing Letters, 2022, 19
  • [50] FSAU-Net: a network for extracting buildings from remote sensing imagery using feature self-attention
    Hu, Minghong
    Li, Jiatian
    Xiaohui, A.
    Zhao, Yunfei
    Lu, Mei
    Li, Wen
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (05) : 1643 - 1664