SMFF-YOLO: A Scale-Adaptive YOLO Algorithm with Multi-Level Feature Fusion for Object Detection in UAV Scenes

被引:10
|
作者
Wang, Yuming [1 ,2 ]
Zou, Hua [1 ]
Yin, Ming [2 ]
Zhang, Xining [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[2] Wuhan Text Univ, Sch Elect & Elect Engn, Wuhan 430077, Peoples R China
关键词
object detection; unmanned aerial vehicles; tiny objects; complex scenarios; multi-level feature information fusion; NETWORK;
D O I
10.3390/rs15184580
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Object detection in images captured by unmanned aerial vehicles (UAVs) holds great potential in various domains, including civilian applications, urban planning, and disaster response. However, it faces several challenges, such as multi-scale variations, dense scenes, complex backgrounds, and tiny-sized objects. In this paper, we present a novel scale-adaptive YOLO framework called SMFF-YOLO, which addresses these challenges through a multi-level feature fusion approach. To improve the detection accuracy of small objects, our framework incorporates the ELAN-SW object detection prediction head. This newly designed head effectively utilizes both global contextual information and local features, enhancing the detection accuracy of tiny objects. Additionally, the proposed bidirectional feature fusion pyramid (BFFP) module tackles the issue of scale variations in object sizes by aggregating multi-scale features. To handle complex backgrounds, we introduce the adaptive atrous spatial pyramid pooling (AASPP) module, which enables adaptive feature fusion and alleviates the negative impact of cluttered scenes. Moreover, we adopt the Wise-IoU(WIoU) bounding box regression loss to enhance the competitiveness of different quality anchor boxes, which offers the framework a more informed gradient allocation strategy. We validate the effectiveness of SMFF-YOLO using the VisDrone and UAVDT datasets. Experimental results demonstrate that our model achieves higher detection accuracy, with AP50 reaching 54.3% for VisDrone and 42.4% for UAVDT datasets. Visual comparative experiments with other YOLO-based methods further illustrate the robustness and adaptability of our approach.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] MLSA-YOLO: a multi-level feature fusion and scale-adaptive framework for small object detectionMLSA-YOLO: a multi-level feature fusion and scale-adaptive...J. Peng et al.
    Jiayu Peng
    Kai Lv
    Guoliang Wang
    Wendong Xiao
    Teng Ran
    Liang Yuan
    The Journal of Supercomputing, 81 (4)
  • [2] LFF-YOLO: A YOLO Algorithm With Lightweight Feature Fusion Network for Multi-Scale Defect Detection
    Qian, Xiaohong
    Wang, Xu
    Yang, Shengying
    Lei, Jingsheng
    IEEE ACCESS, 2022, 10 : 130339 - 130349
  • [3] Adaptive Fusion of Multi-Scale YOLO for Pedestrian Detection
    Hsu, Wei-Yen
    Lin, Wen-Yen
    IEEE ACCESS, 2021, 9 : 110063 - 110073
  • [4] A Lightweight YOLO Object Detection Algorithm Based on Bidirectional Multi-Scale Feature Enhancement
    Liu, Qunpo
    Zhang, Jingwen
    Zhang, Zhuoran
    Bu, Xuhui
    Hanajima, Naohiko
    ADVANCED THEORY AND SIMULATIONS, 2024, 7 (05)
  • [5] HDR-YOLO: Adaptive Object Detection in Haze, Dark, and Rain Scenes Based on YOLO
    Lyu, Zonglei
    An, Wei
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (05)
  • [6] FE-YOLO: YOLO ship detection algorithm based on feature fusion and feature enhancement
    Shouwen Cai
    Hao Meng
    Junbao Wu
    Journal of Real-Time Image Processing, 2024, 21
  • [7] YOLO-UAV: Object Detection Method of Unmanned Aerial Vehicle Imagery Based on Efficient Multi-Scale Feature Fusion
    Ma, Chengji
    Fu, Yanyun
    Wang, Deyong
    Guo, Rui
    Zhao, Xueyi
    Fang, Jian
    IEEE ACCESS, 2023, 11 : 126857 - 126878
  • [8] FE-YOLO: YOLO ship detection algorithm based on feature fusion and feature enhancement
    Cai, Shouwen
    Meng, Hao
    Wu, Junbao
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (02)
  • [9] Multi-Scale Feature Fusion Based Adaptive Object Detection for UAV
    Liu Fang
    Wu Zhiwei
    Yang Anzhe
    Han Xiao
    ACTA OPTICA SINICA, 2020, 40 (10)
  • [10] EBFF-YOLO: enhanced bimodal feature fusion network for UAV image object detection
    Xue, Ping
    Zhang, Zhen
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (10) : 6591 - 6600