A Multi- scale Hierarchical Residual Network- based Method for Tiny Object Detection in Optical Remote Sensing Images

被引：0

作者：

Zeng, Xiangjin ^{[1
,2
,3
]}

Liu, Genghuan ^{[1
,2
]}

Chen, Jianming ^{[1
,2
,3
]}

Dou, Jiazhen ^{[1
,2
]}

Ren, Zhenbo ^{[4
,5
]}

Di, Jianglei ^{[1
,2
]}

Qin, Yuwen ^{[1
,2
,3
]}

机构：

[1] Guangdong Univ Technol, Inst Adv Photon Technol, Sch Informat Engn, Key Lab Photon Technol Integrated Sensing & Commun, Guangzhou 510006, Peoples R China

[2] Guangdong Univ Technol, Inst Adv Photon Technol, Sch Informat Engn, Guangdong Prov Key Lab Informat Photon Technol, Guangzhou 510006, Peoples R China

[3] Southern Marine Sci & Engn Guangdong Lab Zhuhai, Zhuhai 519082, Peoples R China

[4] Northwestern Polytech Univ, Minist Ind & Informat Technol, Key Lab Light Field Manipulat & Informat Acquisit, Xian 710129, Peoples R China

[5] Northwestern Polytech Univ, Shaanxi Key Lab Opt Informat Technol, Sch Phys Sci & Technol, Xian 710129, Peoples R China

来源：

ACTA PHOTONICA SINICA | 2024年 / 53卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Optical remote sensing images; Tiny object detection; Deep learning; Multi scale; Convolutional neural network;

D O I：

10.3788/gzxb20245308.0810001

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

Optical remote sensing image object detection aims to precisely locate and categorize targets such as aircraft, vehicles, and ships. Challenges arise due to the vast distances in remote sensing, leading to numerous tiny objects that are hard to characterize. Additionally, complex backgrounds and environmental factors like lighting and weather conditions reduce signal-to-noise ratios, increasing detection difficulties. Although Convolutional Neural Networks (CNNs), especially those from the YOLO family, are employed for their efficient feature extraction capabilities, they perform poorly in detecting these tiny objects. The key to realize the detection of tiny objects in optical remote sensing images is to obtain sufficiently rich multi-scale feature information and clear tiny object features. Aiming at the above problems, this paper proposes a multi-scale hierarchical residual network based optical remote sensing image tiny object detection algorithm MHRM-YOLO on the basis of YOLOv5, and designs a simple and efficient Multi-scale Hierarchical Residual tiny object feature extraction Module (MHRM). This module expands on Cross Stage Partial (CSP) module by doing more layered design and using different convolutional combinations to extract features from different layered, which allows the network to obtain richer gradient information flow and output richer feature map combinations. In addition, MHRM can be easily embedded into the existing mainstream YOLO detection algorithm backbone network, which can obtain richer sensory fields at a finer granularity level and can effectively capture the contextual information of tiny objects and retain their spatial feature information. The network structure of the MHRM-YOLO algorithm is mainly divided into three parts, namely the backbone, the neck, and the head for prediction. The backbone consists of MHRM and basic convolution module, which performs finegrained feature extraction to obtain more multi-scale information and larger sensory field; the neck part uses the conventional CSP plus Path Aggregation Network (PAN) feature pyramid structure to perform multi-scale feature fusion; and the prediction part uses the optimized localization loss function to perform computation. Since tiny object detection is sensitive to positional offsets during regression, the localization loss penalty term is further improved to enhance the algorithm's ability to perceive positional offsets. The shape penalty term of the baseline CIoU localization loss has lost its effect, in this regard, the optimized loss function retains the Euclidean distance penalty term of the centroid and adjusts it to a scalable exponential function, and improves the shape penalty term to a bounding box distance penalty term, which weakens the detection algorithm's sensitivity to positional offsets, and further improves the performance of the detection algorithm. In order to validate the effectiveness of the proposed detection algorithm, MHRM-YOLO conducts systematic experiments on the challenging optical remote sensing image tiny object detection dataset AITODv2 and the tiny pedestrian dataset TinyPerson. Systematic ablation experiments are conducted for the effects between different module combinations, the effects of the loss function, the performance difference between different backbone network modules and the portability of the algorithm, and the experimental results show that both the MHRM module and the localization loss function can improve the performance of the detection algorithm. Compared with the benchmark YOLOv5 algorithm, the average detection accuracy of MHRM-YOLO on the two datasets is improved by 5.5 degrees o and 1.8 degrees o respectively, which effectively reduces the false detection rate and the leakage rate of the detection of tiny objects in optical remote sensing images. Of course, due to the use of larger-scale feature layers for detection, the MHRMYOLO detection algorithm has an increased computational volume and a slight decrease in inference speed compared with the benchmark algorithm. The algorithm still has the problem of missed detection for relatively irregularly shaped target algorithms. In addition, the experimental results show that although the detection accuracy of the MHRM-YOLO algorithm has an advantage over the mainstream detection algorithms, the detection results are generally low, much lower than the accuracy of conventional target detection, and the algorithm still has room for further optimization.

引用

页数：13

共 36 条

[1] [Anonymous], 2024, GLENN J. YOLOv5
[2] [Anonymous], 2024, GLENN J. YOLOv8
[3] [Anonymous], 2015, Chinese Journal of Lasers
[4] Cascade R-CNN: Delving into High Quality Object Detection
Cai, Zhaowei
Vasconcelos, Nuno
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
[5] RRNet: A Hybrid Detector for Object Detection in Drone-captured Images
Chen, Changrui
Zhang, Yu
Lv, Qingxuan
Wei, Shuo
Wang, Xiaorui
Sun, Xin
Dong, Junyu
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 100 - 108
[6] Chen K, 2019, Arxiv, DOI [arXiv:1906.07155, DOI 10.48550/ARXIV.1906.07155]
[7] Towards Large-Scale Small Object Detection: Survey and Benchmarks
Cheng, Gong
Yuan, Xiang
Yao, Xiwen
Yan, Kebing
Zeng, Qinghua
Xie, Xingxing
Han, Junwei
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13467 - 13488
[8] Small object detection in remote sensing images based on super-resolution
Fang Xiaolin
Hu Fan
Yang Ming
Zhu Tongxin
Bi Ran
Zhang Zenghui
Gao Zhiyuan
[J]. PATTERN RECOGNITION LETTERS, 2022, 153 : 107 - 112
[9] Object Detection Method of Optical Remote Sensing Image with Multi- attention Mechanism
Fu Hongjian
Bai Hongyang
Guo Hongwei
Yuan Yuman
Qin Weiwei
[J]. ACTA PHOTONICA SINICA, 2022, 51 (12)
[10] Gaussian similarity-based adaptive dynamic label assignment for tiny object detection
Fu, Ronghao
Chen, Chengcheng
Yan, Shuang
Heidari, Ali Asghar
Wang, Xianchang
Escorcia-Gutierrez, Jose
Mansour, Romany F.
Chene, Huiling
[J]. NEUROCOMPUTING, 2023, 543

← 1 2 3 4 →