Remote Sensing Object Detection Based on Convolution and Swin Transformer

被引:14
|
作者
Jiang, Xuzhao [1 ]
Wu, Yonghong [1 ]
机构
[1] Wuhan Univ Technol, Dept Stat, Wuhan 430070, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Feature extraction; Transformers; Remote sensing; Prediction algorithms; Detection algorithms; Classification algorithms; Remote sensing images; object detection; attention mechanism; swin transformer; multi-scale features;
D O I
10.1109/ACCESS.2023.3267435
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Remote sensing object detection is an essential task for surveying the earth. It is challenging for the target detection algorithm in natural scenes to obtain satisfactory detection results in remote sensing images. In this paper, the RAST-YOLO (You only look once with Regin Attention and Swin Transformer) algorithm is proposed to address the problems of remote sensing object detection, such as significant differences in target scales, complex backgrounds, and tightly arranged small-size targets. To increase the information interaction range of the feature map, make full use of the background information of the object, and improve the detection accuracy of the object with a complex background, the Regin Attention (RA) mechanism combined with Swin Transformer as the backbone is proposed to extract features. To improve the detection accuracy of small objects, the C3D module is used to fuse deep and shallow semantic information and optimize the multi-scale problem of remote sensing targets. To evaluate the performance of RAST-YOLO, extensive experiments are performed on DIOR and TGRS-HRRSD datasets. The experimental results show that RAST achieves state-of-the-art detection accuracy with high efficiency and robustness. Specifically, compared with the baseline network, the mean average precision (mAP) of detection results is improved by 5% and 2.3% on DIOR and TGRS-HRRSD datasets, respectively, which demonstrates RAST-YOLO is effective and superior. Moreover, the lightweight structure of RAST-YOLO can ensure the real-time detection speed and obtain excellent detection results.
引用
收藏
页码:38643 / 38656
页数:14
相关论文
共 50 条
  • [41] Cross Teaching-Enhanced Multispectral Remote Sensing Object Detection With Transformer
    Zhu, Jiahe
    Zhang, Huan
    Li, Simin
    Wang, Shengjin
    Ma, Hongbing
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 2401 - 2413
  • [42] Efficient Inductive Vision Transformer for Oriented Object Detection in Remote Sensing Imagery
    Zhang, Cong
    Su, Jingran
    Ju, Yakun
    Lam, Kin-Man
    Wang, Qi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [43] QETR: A Query-Enhanced Transformer for Remote Sensing Image Object Detection
    Ma, Xinyu
    Lv, Pengyuan
    Zhong, Yanfei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [44] QAGA-Net: enhanced vision transformer-based object detection for remote sensing images
    Song, Huaxiang
    Xia, Hanjun
    Wang, Wenhui
    Zhou, Yang
    Liu, Wanbo
    Liu, Qun
    Liu, Jinling
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2025, 18 (01) : 133 - 152
  • [45] FEST: Feature Enhancement Swin Transformer for Remote Sensing Image Semantic Segmentation
    Zhang, Ronghuan
    Zhao, Jing
    Li, Ming
    Zou, Qingzhi
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1177 - 1182
  • [46] Shallow multiplexing and multiscale dilation convolution combined attention based oriented object detection in remote sensing images
    Wang, Jiangtao
    Shi, Jiawei
    DIGITAL SIGNAL PROCESSING, 2025, 156
  • [47] Improved Deformable Convolution Method for Aircraft Object Detection in Flight Based on Feature Separation in Remote Sensing Images
    Yu, Lijian
    Zhi, Xiyang
    Hu, Jianming
    Zhang, Shuqing
    Niu, Ruize
    Zhang, Wei
    Jiang, Shikai
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 8313 - 8323
  • [48] Class-Guided Swin Transformer for Semantic Segmentation of Remote Sensing Imagery
    Meng, Xiaoliang
    Yang, Yuechi
    Wang, Libo
    Wang, Teng
    Li, Rui
    Zhang, Ce
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [49] FEA-Swin: Foreground Enhancement Attention Swin Transformer Network for Accurate UAV-Based Dense Object Detection
    Xu, Wenyu
    Zhang, Chaofan
    Wang, Qi
    Dai, Pangda
    SENSORS, 2022, 22 (18)
  • [50] A Lightweight Dual-Branch Swin Transformer for Remote Sensing Scene Classification
    Zheng, Fujian
    Lin, Shuai
    Zhou, Wei
    Huang, Hong
    REMOTE SENSING, 2023, 15 (11)