A Multi- scale Hierarchical Residual Network- based Method for Tiny Object Detection in Optical Remote Sensing Images

被引:0
|
作者
Zeng, Xiangjin [1 ,2 ,3 ]
Liu, Genghuan [1 ,2 ]
Chen, Jianming [1 ,2 ,3 ]
Dou, Jiazhen [1 ,2 ]
Ren, Zhenbo [4 ,5 ]
Di, Jianglei [1 ,2 ]
Qin, Yuwen [1 ,2 ,3 ]
机构
[1] Guangdong Univ Technol, Inst Adv Photon Technol, Sch Informat Engn, Key Lab Photon Technol Integrated Sensing & Commun, Guangzhou 510006, Peoples R China
[2] Guangdong Univ Technol, Inst Adv Photon Technol, Sch Informat Engn, Guangdong Prov Key Lab Informat Photon Technol, Guangzhou 510006, Peoples R China
[3] Southern Marine Sci & Engn Guangdong Lab Zhuhai, Zhuhai 519082, Peoples R China
[4] Northwestern Polytech Univ, Minist Ind & Informat Technol, Key Lab Light Field Manipulat & Informat Acquisit, Xian 710129, Peoples R China
[5] Northwestern Polytech Univ, Shaanxi Key Lab Opt Informat Technol, Sch Phys Sci & Technol, Xian 710129, Peoples R China
基金
中国国家自然科学基金;
关键词
Optical remote sensing images; Tiny object detection; Deep learning; Multi scale; Convolutional neural network;
D O I
10.3788/gzxb20245308.0810001
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Optical remote sensing image object detection aims to precisely locate and categorize targets such as aircraft, vehicles, and ships. Challenges arise due to the vast distances in remote sensing, leading to numerous tiny objects that are hard to characterize. Additionally, complex backgrounds and environmental factors like lighting and weather conditions reduce signal-to-noise ratios, increasing detection difficulties. Although Convolutional Neural Networks (CNNs), especially those from the YOLO family, are employed for their efficient feature extraction capabilities, they perform poorly in detecting these tiny objects. The key to realize the detection of tiny objects in optical remote sensing images is to obtain sufficiently rich multi-scale feature information and clear tiny object features. Aiming at the above problems, this paper proposes a multi-scale hierarchical residual network based optical remote sensing image tiny object detection algorithm MHRM-YOLO on the basis of YOLOv5, and designs a simple and efficient Multi-scale Hierarchical Residual tiny object feature extraction Module (MHRM). This module expands on Cross Stage Partial (CSP) module by doing more layered design and using different convolutional combinations to extract features from different layered, which allows the network to obtain richer gradient information flow and output richer feature map combinations. In addition, MHRM can be easily embedded into the existing mainstream YOLO detection algorithm backbone network, which can obtain richer sensory fields at a finer granularity level and can effectively capture the contextual information of tiny objects and retain their spatial feature information. The network structure of the MHRM-YOLO algorithm is mainly divided into three parts, namely the backbone, the neck, and the head for prediction. The backbone consists of MHRM and basic convolution module, which performs finegrained feature extraction to obtain more multi-scale information and larger sensory field; the neck part uses the conventional CSP plus Path Aggregation Network (PAN) feature pyramid structure to perform multi-scale feature fusion; and the prediction part uses the optimized localization loss function to perform computation. Since tiny object detection is sensitive to positional offsets during regression, the localization loss penalty term is further improved to enhance the algorithm's ability to perceive positional offsets. The shape penalty term of the baseline CIoU localization loss has lost its effect, in this regard, the optimized loss function retains the Euclidean distance penalty term of the centroid and adjusts it to a scalable exponential function, and improves the shape penalty term to a bounding box distance penalty term, which weakens the detection algorithm's sensitivity to positional offsets, and further improves the performance of the detection algorithm. In order to validate the effectiveness of the proposed detection algorithm, MHRM-YOLO conducts systematic experiments on the challenging optical remote sensing image tiny object detection dataset AITODv2 and the tiny pedestrian dataset TinyPerson. Systematic ablation experiments are conducted for the effects between different module combinations, the effects of the loss function, the performance difference between different backbone network modules and the portability of the algorithm, and the experimental results show that both the MHRM module and the localization loss function can improve the performance of the detection algorithm. Compared with the benchmark YOLOv5 algorithm, the average detection accuracy of MHRM-YOLO on the two datasets is improved by 5.5 degrees o and 1.8 degrees o respectively, which effectively reduces the false detection rate and the leakage rate of the detection of tiny objects in optical remote sensing images. Of course, due to the use of larger-scale feature layers for detection, the MHRMYOLO detection algorithm has an increased computational volume and a slight decrease in inference speed compared with the benchmark algorithm. The algorithm still has the problem of missed detection for relatively irregularly shaped target algorithms. In addition, the experimental results show that although the detection accuracy of the MHRM-YOLO algorithm has an advantage over the mainstream detection algorithms, the detection results are generally low, much lower than the accuracy of conventional target detection, and the algorithm still has room for further optimization.
引用
收藏
页数:13
相关论文
共 36 条
  • [1] [Anonymous], 2024, GLENN J. YOLOv5
  • [2] [Anonymous], 2024, GLENN J. YOLOv8
  • [3] [Anonymous], 2015, Chinese Journal of Lasers
  • [4] Cascade R-CNN: Delving into High Quality Object Detection
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
  • [5] RRNet: A Hybrid Detector for Object Detection in Drone-captured Images
    Chen, Changrui
    Zhang, Yu
    Lv, Qingxuan
    Wei, Shuo
    Wang, Xiaorui
    Sun, Xin
    Dong, Junyu
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 100 - 108
  • [6] Chen K, 2019, Arxiv, DOI [arXiv:1906.07155, DOI 10.48550/ARXIV.1906.07155]
  • [7] Towards Large-Scale Small Object Detection: Survey and Benchmarks
    Cheng, Gong
    Yuan, Xiang
    Yao, Xiwen
    Yan, Kebing
    Zeng, Qinghua
    Xie, Xingxing
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13467 - 13488
  • [8] Small object detection in remote sensing images based on super-resolution
    Fang Xiaolin
    Hu Fan
    Yang Ming
    Zhu Tongxin
    Bi Ran
    Zhang Zenghui
    Gao Zhiyuan
    [J]. PATTERN RECOGNITION LETTERS, 2022, 153 : 107 - 112
  • [9] Object Detection Method of Optical Remote Sensing Image with Multi- attention Mechanism
    Fu Hongjian
    Bai Hongyang
    Guo Hongwei
    Yuan Yuman
    Qin Weiwei
    [J]. ACTA PHOTONICA SINICA, 2022, 51 (12)
  • [10] Gaussian similarity-based adaptive dynamic label assignment for tiny object detection
    Fu, Ronghao
    Chen, Chengcheng
    Yan, Shuang
    Heidari, Ali Asghar
    Wang, Xianchang
    Escorcia-Gutierrez, Jose
    Mansour, Romany F.
    Chene, Huiling
    [J]. NEUROCOMPUTING, 2023, 543