Swin-Transformer-Based YOLOv5 for Small-Object Detection in Remote Sensing Images

被引:10
|
作者
Cao, Xuan [2 ]
Zhang, Yanwei [1 ]
Lang, Song [1 ]
Gong, Yan [1 ,3 ]
机构
[1] Chinese Acad Sci, Suzhou Inst Biomed Engn & Technol, Suzhou 215613, Peoples R China
[2] Suzhou Univ Sci & Technol, Sch Phys Sci & Technol, Suzhou 215009, Peoples R China
[3] Jinan Guoke Med Technol Dev Co Ltd, Jinan 250104, Peoples R China
基金
中国国家自然科学基金;
关键词
Swin Transformer; YOLOv5; multi-scale feature fusion; attention mechanism; small-object detection; remote sensing;
D O I
10.3390/s23073634
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This study aimed to address the problems of low detection accuracy and inaccurate positioning of small-object detection in remote sensing images. An improved architecture based on the Swin Transformer and YOLOv5 is proposed. First, Complete-IOU (CIOU) was introduced to improve the K-means clustering algorithm, and then an anchor of appropriate size for the dataset was generated. Second, a modified CSPDarknet53 structure combined with Swin Transformer was proposed to retain sufficient global context information and extract more differentiated features through multi-head self-attention. Regarding the path-aggregation neck, a simple and efficient weighted bidirectional feature pyramid network was proposed for effective cross-scale feature fusion. In addition, extra prediction head and new feature fusion layers were added for small objects. Finally, Coordinate Attention (CA) was introduced to the YOLOv5 network to improve the accuracy of small-object features in remote sensing images. Moreover, the effectiveness of the proposed method was demonstrated by several kinds of experiments on the DOTA (Dataset for Object detection in Aerial images). The mean average precision on the DOTA dataset reached 74.7%. Compared with YOLOv5, the proposed method improved the mean average precision (mAP) by 8.9%, which can achieve a higher accuracy of small-object detection in remote sensing images.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Swin-Transformer-Enabled YOLOv5 with Attention Mechanism for Small Object Detection on Satellite Images
    Gong, Hang
    Mu, Tingkui
    Li, Qiuxia
    Dai, Haishan
    Li, Chunlai
    He, Zhiping
    Wang, Wenjing
    Han, Feng
    Tuniyazi, Abudusalamu
    Li, Haoyang
    Lang, Xuechan
    Li, Zhiyuan
    Wang, Bin
    [J]. REMOTE SENSING, 2022, 14 (12)
  • [2] Small-object detection based on YOLOv5 in autonomous driving systems
    Mahaur, Bharat
    Mishra, K. K.
    [J]. PATTERN RECOGNITION LETTERS, 2023, 168 : 115 - 122
  • [3] Improved YOLOv5 Object Detection Algorithm for Remote Sensing Images
    Yang, Chen
    She, Lu
    Yang, Lu
    Feng, Zixian
    [J]. Computer Engineering and Applications, 2023, 59 (15) : 76 - 86
  • [4] Improved YOLOv5 Based on Hybrid Domain Attention for Small Object Detection in Optical Remote Sensing Images
    Deng, Tianmin
    Liu, Xuhui
    Mao, Guotao
    [J]. ELECTRONICS, 2022, 11 (17)
  • [5] Small target detection with remote sensing images based on an improved YOLOv5 algorithm
    Pei, Wenjing
    Shi, Zhanhao
    Gong, Kai
    [J]. FRONTIERS IN NEUROROBOTICS, 2023, 16
  • [6] C3TB-YOLOv5: integrated YOLOv5 with transformer for object detection in high-resolution remote sensing images
    Wu, Qinggang
    Li, Yang
    Huang, Wei
    Chen, Qiqiang
    Wu, Yonglei
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (08) : 2622 - 2650
  • [7] Remote Sensing Object Detection Based on Convolution and Swin Transformer
    Jiang, Xuzhao
    Wu, Yonghong
    [J]. IEEE ACCESS, 2023, 11 : 38643 - 38656
  • [8] OAB-YOLOv5: One-Anchor-Based YOLOv5 for Rotated Object Detection in Remote Sensing Images
    Liu, Jie
    Qiao, Wensheng
    Xiong, Zhaolong
    [J]. JOURNAL OF SENSORS, 2022, 2022
  • [9] Aircraft Target Detection in Remote Sensing Images Based on Improved YOLOv5
    Luo, Shun
    Yu, Juan
    Xi, Yunjiang
    Liao, Xiao
    [J]. IEEE ACCESS, 2022, 10 : 5184 - 5192
  • [10] CNTR-YOLO: Improved YOLOv5 Based on ConvNext and Transformer for Aircraft Detection in Remote Sensing Images
    Zhou, Fengyun
    Deng, Honggui
    Xu, Qiguo
    Lan, Xin
    [J]. ELECTRONICS, 2023, 12 (12)