An Efficient UAV Image Object Detection Algorithm Based on Global Attention and Multi-Scale Feature Fusion

被引:1
|
作者
Qian, Rui [1 ]
Ding, Yong [2 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Nanjing Univ Aeronaut & Astronaut, Coll Automat Engn, Nanjing 210016, Peoples R China
关键词
UAV; object detection; global attention; feature fusion;
D O I
10.3390/electronics13203989
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Object detection technology holds significant promise in unmanned aerial vehicle (UAV) applications. However, traditional methods face challenges in detecting denser, smaller, and more complex targets within UAV aerial images. To address issues such as target occlusion and dense small objects, this paper proposes a multi-scale object detection algorithm based on YOLOv5s. A novel feature extraction module, DCNCSPELAN4, which combines CSPNet and ELAN, is introduced to enhance the receptive field of feature extraction while maintaining network efficiency. Additionally, a lightweight Vision Transformer module, the CloFormer Block, is integrated to provide the network with a global receptive field. Moreover, the algorithm incorporates a three-scale feature fusion (TFE) module and a scale sequence feature fusion (SSFF) module in the neck network to effectively leverage multi-scale spatial information across different feature maps. To address dense small objects, an additional small object detection head was added to the detection layer. The original large object detection head was removed to reduce computational load. The proposed algorithm has been evaluated through ablation experiments and compared with other state-of-the-art methods on the VisDrone2019 and AU-AIR datasets. The results demonstrate that our algorithm outperforms other baseline methods in terms of both accuracy and speed. Compared to the YOLOv5s baseline model, the enhanced algorithm achieves improvements of 12.4% and 8.4% in AP50 and AP metrics, respectively, with only a marginal parameter increase of 0.3 M. These experiments validate the effectiveness of our algorithm for object detection in drone imagery.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Multi-Scale Feature Fusion Based Adaptive Object Detection for UAV
    Liu Fang
    Wu Zhiwei
    Yang Anzhe
    Han Xiao
    ACTA OPTICA SINICA, 2020, 40 (10)
  • [2] SiamMFF: UAV Object Tracking Algorithm Based on Multi-Scale Feature Fusion
    Hou, Yanli
    Gai, Xilin
    Wang, Xintao
    Zhang, Yongqiang
    IEEE ACCESS, 2024, 12 : 24725 - 24734
  • [3] Object Detection of Remote Sensing Image Based on Multi-Scale Feature Fusion and Attention Mechanism
    Du, Zuoqiang
    Liang, Yuan
    IEEE ACCESS, 2024, 12 : 8619 - 8632
  • [4] Multi-scale object detection in UAV images based on adaptive feature fusion
    Tan, Siqi
    Duan, Zhijian
    Pu, Longzhong
    PLOS ONE, 2024, 19 (03):
  • [5] SGMFNet: a remote sensing image object detection network based on spatial global attention and multi-scale feature fusion
    Gong, Xiaolin
    Liu, Daqing
    REMOTE SENSING LETTERS, 2024, 15 (05) : 466 - 477
  • [6] Underwater image object detection based on multi-scale feature fusion
    Yang, Chao
    Zhang, Ce
    Jiang, Longyu
    Zhang, Xinwen
    MACHINE VISION AND APPLICATIONS, 2024, 35 (06)
  • [7] Text Detection Algorithm Based on Multi-Scale Attention Feature Fusion
    She, Xiangyang
    Liu, Zhe
    Dong, Lihong
    Computer Engineering and Applications, 2024, 60 (01) : 198 - 206
  • [8] Pedestrian detection algorithm based on multi-scale feature extraction and attention feature fusion
    Xia, Hao
    Ma, Jun
    Ou, Jiayu
    Lv, Xinyao
    Bai, Chengjie
    DIGITAL SIGNAL PROCESSING, 2022, 121
  • [9] Multi-scale fusion and efficient feature extraction for enhanced sonar image object detection
    Shi, Pengfei
    He, Qi
    Zhu, Sisi
    Li, Xinyu
    Fan, Xinnan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 256
  • [10] UAV image object detection based on self-attention guidance and global feature fusion
    Bai, Jing
    Hu, Haiyang
    Liu, Xiaojing
    Zhuang, Shanna
    Wang, Zhengyou
    IMAGE AND VISION COMPUTING, 2024, 151