An Efficient UAV Image Object Detection Algorithm Based on Global Attention and Multi-Scale Feature Fusion

被引：1

作者：

Qian, Rui ^{[1
]}

Ding, Yong ^{[2
]}

机构：

[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore

[2] Nanjing Univ Aeronaut & Astronaut, Coll Automat Engn, Nanjing 210016, Peoples R China

来源：

ELECTRONICS | 2024年 / 13卷 / 20期

关键词：

UAV; object detection; global attention; feature fusion;

D O I：

10.3390/electronics13203989

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Object detection technology holds significant promise in unmanned aerial vehicle (UAV) applications. However, traditional methods face challenges in detecting denser, smaller, and more complex targets within UAV aerial images. To address issues such as target occlusion and dense small objects, this paper proposes a multi-scale object detection algorithm based on YOLOv5s. A novel feature extraction module, DCNCSPELAN4, which combines CSPNet and ELAN, is introduced to enhance the receptive field of feature extraction while maintaining network efficiency. Additionally, a lightweight Vision Transformer module, the CloFormer Block, is integrated to provide the network with a global receptive field. Moreover, the algorithm incorporates a three-scale feature fusion (TFE) module and a scale sequence feature fusion (SSFF) module in the neck network to effectively leverage multi-scale spatial information across different feature maps. To address dense small objects, an additional small object detection head was added to the detection layer. The original large object detection head was removed to reduce computational load. The proposed algorithm has been evaluated through ablation experiments and compared with other state-of-the-art methods on the VisDrone2019 and AU-AIR datasets. The results demonstrate that our algorithm outperforms other baseline methods in terms of both accuracy and speed. Compared to the YOLOv5s baseline model, the enhanced algorithm achieves improvements of 12.4% and 8.4% in AP50 and AP metrics, respectively, with only a marginal parameter increase of 0.3 M. These experiments validate the effectiveness of our algorithm for object detection in drone imagery.

引用

页数：20

共 50 条

[1] Multi-Scale Feature Fusion Based Adaptive Object Detection for UAV
Liu Fang
Wu Zhiwei
Yang Anzhe
Han Xiao
ACTA OPTICA SINICA, 2020, 40 (10)
[2] SiamMFF: UAV Object Tracking Algorithm Based on Multi-Scale Feature Fusion
Hou, Yanli
Gai, Xilin
Wang, Xintao
Zhang, Yongqiang
IEEE ACCESS, 2024, 12 : 24725 - 24734
[3] Object Detection of Remote Sensing Image Based on Multi-Scale Feature Fusion and Attention Mechanism
Du, Zuoqiang
Liang, Yuan
IEEE ACCESS, 2024, 12 : 8619 - 8632
[4] Multi-scale object detection in UAV images based on adaptive feature fusion
Tan, Siqi
Duan, Zhijian
Pu, Longzhong
PLOS ONE, 2024, 19 (03):
[5] SGMFNet: a remote sensing image object detection network based on spatial global attention and multi-scale feature fusion
Gong, Xiaolin
Liu, Daqing
REMOTE SENSING LETTERS, 2024, 15 (05) : 466 - 477
[6] Underwater image object detection based on multi-scale feature fusion
Yang, Chao
Zhang, Ce
Jiang, Longyu
Zhang, Xinwen
MACHINE VISION AND APPLICATIONS, 2024, 35 (06)
[7] Text Detection Algorithm Based on Multi-Scale Attention Feature Fusion
She, Xiangyang
Liu, Zhe
Dong, Lihong
Computer Engineering and Applications, 2024, 60 (01) : 198 - 206
[8] Pedestrian detection algorithm based on multi-scale feature extraction and attention feature fusion
Xia, Hao
Ma, Jun
Ou, Jiayu
Lv, Xinyao
Bai, Chengjie
DIGITAL SIGNAL PROCESSING, 2022, 121
[9] Multi-scale fusion and efficient feature extraction for enhanced sonar image object detection
Shi, Pengfei
He, Qi
Zhu, Sisi
Li, Xinyu
Fan, Xinnan
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 256
[10] UAV image object detection based on self-attention guidance and global feature fusion
Bai, Jing
Hu, Haiyang
Liu, Xiaojing
Zhuang, Shanna
Wang, Zhengyou
IMAGE AND VISION COMPUTING, 2024, 151

← 1 2 3 4 5 →