Improved Architecture and Training Strategies of YOLOv7 for Remote Sensing Image Object Detection

被引:0
|
作者
Zhao, Dewei [1 ]
Shao, Faming [1 ]
Liu, Qiang [1 ]
Zhang, Heng [1 ]
Zhang, Zihan [1 ]
Yang, Li [1 ]
机构
[1] Army Engn Univ PLA, Coll Field Engn, Nanjing 210007, Peoples R China
基金
中国国家自然科学基金;
关键词
remote sensing; object detection; improvement; YOLOv7; small object;
D O I
10.3390/rs16173321
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The technology for object detection in remote sensing images finds extensive applications in production and people's lives, and improving the accuracy of image detection is a pressing need. With that goal, this paper proposes a range of improvements, rooted in the widely used YOLOv7 algorithm, after analyzing the requirements and difficulties in the detection of remote sensing images. Specifically, we strategically remove some standard convolution and pooling modules from the bottom of the network, adopting stride-free convolution to minimize the loss of information for small objects in the transmission. Simultaneously, we introduce a new, more efficient attention mechanism module for feature extraction, significantly enhancing the network's semantic extraction capabilities. Furthermore, by adding multiple cross-layer connections in the network, we more effectively utilize the feature information of each layer in the backbone network, thereby enhancing the network's overall feature extraction capability. During the training phase, we introduce an auxiliary network to intensify the training of the underlying network and adopt a new activation function and a more efficient loss function to ensure more effective gradient feedback, thereby elevating the network performance. In the experimental results, our improved network achieves impressive mAP scores of 91.2% and 80.8% on the DIOR and DOTA version 1.0 remote sensing datasets, respectively. These represent notable improvements of 4.5% and 7.0% over the original YOLOv7 network, significantly enhancing the efficiency of detecting small objects in particular.
引用
收藏
页数:32
相关论文
共 50 条
  • [31] Small object detection model for UAV aerial image based on YOLOv7
    Chen, Jinguang
    Wen, Ronghui
    Ma, Lili
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 2695 - 2707
  • [32] HB-YOLO: An Improved YOLOv7 Algorithm for Dim-Object Tracking in Satellite Remote Sensing Videos
    Yu, Chaoran
    Feng, Zhejun
    Wu, Zengyan
    Wei, Runxi
    Song, Baoming
    Cao, Changqing
    REMOTE SENSING, 2023, 15 (14)
  • [33] Improved YOLOv5 for Remote Sensing Image Detection
    Liu, Tao
    Ding, Xueyan
    Zhang, Bingbing
    Zhang, Jianxin
    Computer Engineering and Applications, 2023, 59 (10): : 253 - 261
  • [34] Automatic detection of standing dead trees based on improved YOLOv7 from airborne remote sensing imagery
    Zhou, Hongwei
    Wu, Shangxin
    Xu, Zihan
    Sun, Hong
    FRONTIERS IN PLANT SCIENCE, 2024, 15
  • [35] An Improved YOLOX for Remote Sensing Image Object Detection
    Fang, Zhou
    He, Lin
    Li, Yingqi
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [36] Remote Sensing Image Object Detection Based on Improved YOLOv3 in Deep Learning Environment
    Yang, Tianle
    Li, Jinghui
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (15)
  • [37] Efficient Object Detection and Recognition of Body Welding Studs Based on Improved YOLOv7
    Huang, Hong
    Peng, Xiangqian
    Hu, Xiaoping
    Ou, Wenchu
    IEEE ACCESS, 2024, 12 : 41531 - 41541
  • [38] Improved YOLOv7 for Small Object Detection Algorithm Based on Attention and Dynamic Convolution
    Li, Kai
    Wang, Yanni
    Hu, Zhongmian
    APPLIED SCIENCES-BASEL, 2023, 13 (16):
  • [39] Mask wearing detection based on improved YOLOv7
    Fu Hui-chen
    Gao Jun-wei
    Che Lu-yang
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (08) : 1139 - 1147
  • [40] Helmet Detection Algorithm Based on Improved YOLOv7
    Yajie Yaermaimaiti Yilihamu
    Lingfei Liu
    Ruohao Xi
    undefined Wang
    Automatic Control and Computer Sciences, 2024, 58 (6) : 642 - 655