Improved Architecture and Training Strategies of YOLOv7 for Remote Sensing Image Object Detection

被引:0
|
作者
Zhao, Dewei [1 ]
Shao, Faming [1 ]
Liu, Qiang [1 ]
Zhang, Heng [1 ]
Zhang, Zihan [1 ]
Yang, Li [1 ]
机构
[1] Army Engn Univ PLA, Coll Field Engn, Nanjing 210007, Peoples R China
基金
中国国家自然科学基金;
关键词
remote sensing; object detection; improvement; YOLOv7; small object;
D O I
10.3390/rs16173321
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The technology for object detection in remote sensing images finds extensive applications in production and people's lives, and improving the accuracy of image detection is a pressing need. With that goal, this paper proposes a range of improvements, rooted in the widely used YOLOv7 algorithm, after analyzing the requirements and difficulties in the detection of remote sensing images. Specifically, we strategically remove some standard convolution and pooling modules from the bottom of the network, adopting stride-free convolution to minimize the loss of information for small objects in the transmission. Simultaneously, we introduce a new, more efficient attention mechanism module for feature extraction, significantly enhancing the network's semantic extraction capabilities. Furthermore, by adding multiple cross-layer connections in the network, we more effectively utilize the feature information of each layer in the backbone network, thereby enhancing the network's overall feature extraction capability. During the training phase, we introduce an auxiliary network to intensify the training of the underlying network and adopt a new activation function and a more efficient loss function to ensure more effective gradient feedback, thereby elevating the network performance. In the experimental results, our improved network achieves impressive mAP scores of 91.2% and 80.8% on the DIOR and DOTA version 1.0 remote sensing datasets, respectively. These represent notable improvements of 4.5% and 7.0% over the original YOLOv7 network, significantly enhancing the efficiency of detecting small objects in particular.
引用
收藏
页数:32
相关论文
共 50 条
  • [21] Dense Small Object Detection Based on an Improved YOLOv7 Model
    Chen, Xun
    Deng, Linyi
    Hu, Chao
    Xie, Tianyi
    Wang, Chengqi
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [22] Improved Complex Road Scene Object Detection Algorithm of YOLOv7
    Du, Juan
    Cui, Shaohua
    Jin, Meijuan
    Ru, Chen
    Computer Engineering and Applications, 2024, 60 (01) : 96 - 103
  • [23] Underwater optical image object detection based on YOLOv7 algorithm
    Wang, Shaojie
    Wu, Weichao
    Wang, Xinyuan
    Han, Yongchen
    Ma, Yuwei
    OCEANS 2023 - LIMERICK, 2023,
  • [24] YOLOv7-sea: Object Detection of Maritime UAV Images based on Improved YOLOv7
    Zhao, Hangyue
    Zhang, Hongpu
    Zhao, Yanyun
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 233 - 238
  • [25] Enhanced object detection in remote sensing images by applying metaheuristic and hybrid metaheuristic optimizers to YOLOv7 and YOLOv8
    Khaled Mohammed Elgamily
    M. A. Mohamed
    Ahmed Mohamed Abou-Taleb
    Mohamed Maher Ata
    Scientific Reports, 15 (1)
  • [26] Pedestrian Detection Method in Infrared Image Based on Improved YOLOv7
    Liu, Zhengyan
    Dai, Chaoyue
    Li, Xu
    Proceedings of 2023 IEEE 3rd International Conference on Information Technology, Big Data and Artificial Intelligence, ICIBA 2023, 2023, : 946 - 954
  • [27] Improved YOLOv7 Small Object Detection Algorithm for Seaside Aerial Images
    Yu, Miao
    Jia, YinShan
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 483 - 491
  • [28] Remote-Sensing Image Object Detection Based on Improved YOLOv8 Algorithm
    Zhang Xiuzai
    Shen Tao
    Xu Dai
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (10)
  • [29] A Small Object Detection Algorithm for Traffic Signs Based on Improved YOLOv7
    Li, Songjiang
    Wang, Shilong
    Wang, Peng
    SENSORS, 2023, 23 (16)
  • [30] Small object detection model for UAV aerial image based on YOLOv7
    Jinguang Chen
    Ronghui Wen
    Lili Ma
    Signal, Image and Video Processing, 2024, 18 : 2695 - 2707