SMFF-YOLO: A Scale-Adaptive YOLO Algorithm with Multi-Level Feature Fusion for Object Detection in UAV Scenes

被引:10
|
作者
Wang, Yuming [1 ,2 ]
Zou, Hua [1 ]
Yin, Ming [2 ]
Zhang, Xining [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[2] Wuhan Text Univ, Sch Elect & Elect Engn, Wuhan 430077, Peoples R China
关键词
object detection; unmanned aerial vehicles; tiny objects; complex scenarios; multi-level feature information fusion; NETWORK;
D O I
10.3390/rs15184580
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Object detection in images captured by unmanned aerial vehicles (UAVs) holds great potential in various domains, including civilian applications, urban planning, and disaster response. However, it faces several challenges, such as multi-scale variations, dense scenes, complex backgrounds, and tiny-sized objects. In this paper, we present a novel scale-adaptive YOLO framework called SMFF-YOLO, which addresses these challenges through a multi-level feature fusion approach. To improve the detection accuracy of small objects, our framework incorporates the ELAN-SW object detection prediction head. This newly designed head effectively utilizes both global contextual information and local features, enhancing the detection accuracy of tiny objects. Additionally, the proposed bidirectional feature fusion pyramid (BFFP) module tackles the issue of scale variations in object sizes by aggregating multi-scale features. To handle complex backgrounds, we introduce the adaptive atrous spatial pyramid pooling (AASPP) module, which enables adaptive feature fusion and alleviates the negative impact of cluttered scenes. Moreover, we adopt the Wise-IoU(WIoU) bounding box regression loss to enhance the competitiveness of different quality anchor boxes, which offers the framework a more informed gradient allocation strategy. We validate the effectiveness of SMFF-YOLO using the VisDrone and UAVDT datasets. Experimental results demonstrate that our model achieves higher detection accuracy, with AP50 reaching 54.3% for VisDrone and 42.4% for UAVDT datasets. Visual comparative experiments with other YOLO-based methods further illustrate the robustness and adaptability of our approach.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] YOLO V5-MAX: A Multi-object Detection Algorithm in Complex Scenes
    Li, Xingkun
    Tian, Guangyu
    Lu, Zhenghong
    Zhang, Guojun
    Proceedings - 2023 IEEE 6th International Conference on Industrial Cyber-Physical Systems, ICPS 2023, 2023,
  • [22] YOLO V5-MAX: A Multi-object Detection Algorithm in Complex Scenes
    Li, Xingkun
    Tian, Guangyu
    Lu, Zhenghong
    Zhang, Guojun
    2023 IEEE 6TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS, 2023,
  • [23] Multi-dimensional, multi-functional and multi-level attention in YOLO for underwater object detection
    Shen, Xin
    Sun, Xudong
    Wang, Huibing
    Fu, Xianping
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 19935 - 19960
  • [24] Multi-dimensional, multi-functional and multi-level attention in YOLO for underwater object detection
    Xin Shen
    Xudong Sun
    Huibing Wang
    Xianping Fu
    Neural Computing and Applications, 2023, 35 : 19935 - 19960
  • [25] Lightweight Underwater Object Detection Based on YOLO v4 and Multi-Scale Attentional Feature Fusion
    Zhang, Minghua
    Xu, Shubo
    Song, Wei
    He, Qi
    Wei, Quanmiao
    REMOTE SENSING, 2021, 13 (22)
  • [26] MRD-YOLO: A Multispectral Object Detection Algorithm for Complex Road Scenes
    Sun, Chaoyue
    Chen, Yajun
    Qiu, Xiaoyang
    Li, Rongzhen
    You, Longxiang
    SENSORS, 2024, 24 (10)
  • [27] Multi-level feature fusion pyramid network for object detection
    Guo, Zebin
    Shuai, Hui
    Liu, Guangcan
    Zhu, Yisheng
    Wang, Wenqing
    VISUAL COMPUTER, 2023, 39 (09): : 4267 - 4277
  • [28] Multi-level feature fusion pyramid network for object detection
    Zebin Guo
    Hui Shuai
    Guangcan Liu
    Yisheng Zhu
    Wenqing Wang
    The Visual Computer, 2023, 39 : 4267 - 4277
  • [29] LMSD-YOLO: A Lightweight YOLO Algorithm for Multi-Scale SAR Ship Detection
    Guo, Yue
    Chen, Shiqi
    Zhan, Ronghui
    Wang, Wei
    Zhang, Jun
    REMOTE SENSING, 2022, 14 (19)
  • [30] FC-YOLO: an aircraft skin defect detection algorithm based on multi-scale collaborative feature fusion
    Zhang, Wei
    Liu, Jiyuan
    Yan, Zhiqi
    Zhao, Minghang
    Fu, Xuyun
    Zhu, Hengjia
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (11)