An Efficient UAV Image Object Detection Algorithm Based on Global Attention and Multi-Scale Feature Fusion

被引:1
|
作者
Qian, Rui [1 ]
Ding, Yong [2 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Nanjing Univ Aeronaut & Astronaut, Coll Automat Engn, Nanjing 210016, Peoples R China
关键词
UAV; object detection; global attention; feature fusion;
D O I
10.3390/electronics13203989
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Object detection technology holds significant promise in unmanned aerial vehicle (UAV) applications. However, traditional methods face challenges in detecting denser, smaller, and more complex targets within UAV aerial images. To address issues such as target occlusion and dense small objects, this paper proposes a multi-scale object detection algorithm based on YOLOv5s. A novel feature extraction module, DCNCSPELAN4, which combines CSPNet and ELAN, is introduced to enhance the receptive field of feature extraction while maintaining network efficiency. Additionally, a lightweight Vision Transformer module, the CloFormer Block, is integrated to provide the network with a global receptive field. Moreover, the algorithm incorporates a three-scale feature fusion (TFE) module and a scale sequence feature fusion (SSFF) module in the neck network to effectively leverage multi-scale spatial information across different feature maps. To address dense small objects, an additional small object detection head was added to the detection layer. The original large object detection head was removed to reduce computational load. The proposed algorithm has been evaluated through ablation experiments and compared with other state-of-the-art methods on the VisDrone2019 and AU-AIR datasets. The results demonstrate that our algorithm outperforms other baseline methods in terms of both accuracy and speed. Compared to the YOLOv5s baseline model, the enhanced algorithm achieves improvements of 12.4% and 8.4% in AP50 and AP metrics, respectively, with only a marginal parameter increase of 0.3 M. These experiments validate the effectiveness of our algorithm for object detection in drone imagery.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Remote Sensing Small Object Detection Network Based on Attention Mechanism and Multi-Scale Feature Fusion
    Qu, Junsuo
    Tang, Zongbing
    Zhang, Le
    Zhang, Yanghai
    Zhang, Zhenguo
    REMOTE SENSING, 2023, 15 (11)
  • [32] Underwater Image Enhancement Based on Multi-Scale Feature Fusion and Attention Network
    Liu Y.
    Liu M.
    Lin S.
    Tao Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (05): : 685 - 695
  • [33] Binocular Depth Estimation Algorithm Based on Multi-Scale Attention Feature Fusion
    Yang Huitong
    Lei Lang
    Lin Yongchun
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [34] Spatial Attention for Multi-Scale Feature Refinement for Object Detection
    Wang, Haoran
    Wang, Zexin
    Jia, Meixia
    Li, Aijin
    Feng, Tuo
    Zhang, Wenhua
    Jiao, Licheng
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 64 - 72
  • [35] A multi-scale feature fusion target detection algorithm
    Dong, Chong
    Li, Jingmei
    Wang, Jiaxiang
    2018 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2018, 10836
  • [36] Lightweight Fire Detection Algorithm Based on Multi-Scale Feature Fusion
    Yang, Guowei
    Liu, Xuan
    Gao, Min
    Xu, Di
    Computer Engineering and Applications, 60 (23): : 229 - 237
  • [37] Small Object Detection in UAV Remote Sensing Images Based on Intra-Group Multi-Scale Fusion Attention and Adaptive Weighted Feature Fusion Mechanism
    Yuan, Zhe
    Gong, Jianglei
    Guo, Baolong
    Wang, Chao
    Liao, Nannan
    Song, Jiawei
    Wu, Qiming
    REMOTE SENSING, 2024, 16 (22)
  • [38] Few-shot based learning recaptured image detection with multi-scale feature fusion and attention☆
    Hussain, Israr
    Tan, Shunquan
    Huang, Jiwu
    PATTERN RECOGNITION, 2025, 161
  • [39] A Multi-Feature Fusion and Attention Network for Multi-Scale Object Detection in Remote Sensing Images
    Cheng, Yong
    Wang, Wei
    Zhang, Wenjie
    Yang, Ling
    Wang, Jun
    Ni, Huan
    Guan, Tingzhao
    He, Jiaxin
    Gu, Yakang
    Tran, Ngoc Nguyen
    REMOTE SENSING, 2023, 15 (08)
  • [40] AMFT-YOLO: A Adaptive Multi-scale YOLO Algorithm with Multi-level Feature Fusion for Object Detection in UAV Scenes
    Wang, Tiebiao
    Cui, Zhenchao
    Li, Xiaoyang
    MULTIMEDIA MODELING, MMM 2025, PT I, 2025, 15520 : 72 - 85