RPS-YOLO: A Recursive Pyramid Structure-Based YOLO Network for Small Object Detection in Unmanned Aerial Vehicle Scenarios

被引:0
|
作者
Lei, Penghui [1 ,2 ]
Wang, Chenkang [1 ,2 ]
Liu, Peigang [1 ,2 ]
机构
[1] China Univ Petr East China, Qingdao Inst Software, Coll Comp Sci & Technol, Qingdao 266580, Peoples R China
[2] Shandong Key Lab Intelligent Oil & Gas Ind Softwar, Qingdao 266580, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 04期
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
YOLOv8; small target detection; unmanned aerial vehicle; attention mechanisms;
D O I
10.3390/app15042039
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The fast advancement of unmanned aerial vehicle (UAV) technology has facilitated its use across a wide range of scenarios. Due to the high mobility and flexibility of drones, the images they capture often exhibit significant scale variations and severe object occlusions, leading to a high density of small objects. However, the existing object detection algorithms struggle with detecting small objects effectively in cross-scale detection scenarios. To overcome these difficulties, we introduce a new object detection model, RPS-YOLO, based on the YOLOv8 architecture. Unlike the existing methods that rely on traditional feature pyramids, our approach introduces a recursive feature pyramid (RFP) structure. This structure performs two rounds of feature extraction, and we reduce one downsampling step in the first round to enhance attention to small objects during cross-scale detection. Additionally, we design a novel attention mechanism that improves feature representation and mitigates feature degradation during convolution by capturing spatial- and channel-specific details. Another key innovation is the proposed Localization IOU (LIOU) loss function for bounding box regression, which accelerates the regression process by incorporating angular constraints. Experiments conducted on the VisDrone-DET2021 and UAVDT datasets show that RPS-YOLO surpasses YOLOv8s, with an mAP50 improvement of 8.2% and 3.4%, respectively. Our approach demonstrates that incorporating recursive feature extraction and exploiting detailed information for multi-scale detection significantly improves detection performance, particularly for small objects in UAV images.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] YOLO glass: video-based smart object detection using squeeze and attention YOLO network
    T. Sugashini
    G. Balakrishnan
    Signal, Image and Video Processing, 2024, 18 : 2105 - 2115
  • [42] ESOD-YOLO: an enhanced efficient small object detection framework for aerial images
    Xu, Xin
    Li, Qi
    Pan, Jie
    Lu, Xingzheng
    Wei, Hongwei
    Sun, Mingzheng
    Zhang, Haoze
    COMPUTING, 2025, 107 (02)
  • [43] YOLO-Tiny: A lightweight small object detection algorithm for UAV aerial imagery
    Feng, Fei
    Yang, Lu
    Zhou, Quanxing
    Li, Weipeng
    IET IMAGE PROCESSING, 2025, 19 (01)
  • [44] A Multiscale Information Fusion Network Based on PixelShuffle Integrated With YOLO for Aerial Remote Sensing Object Detection
    Xi, Li Hu
    Hou, Jing Wei
    Ma, Guang Lin
    Hei, Yong Qiang
    Li, Wen Tao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [45] A YOLO-Based Insect Detection: Potential Use of Small Multirotor Unmanned Aerial Vehicles (UAVs) Monitoring
    Berger, Guido S.
    Mendes, Joao
    Chellal, Arezki Abderrahim
    Bonzatto, Luciano, Jr.
    da Silva, Yago M. R.
    Zorawski, Matheus
    Pereira, Ana I.
    Pinto, Milena F.
    Castro, Joao
    Valente, Antonio
    Lima, Jose
    OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, PT I, OL2A 2023, 2024, 1981 : 3 - 17
  • [46] Improved YOLOv7 Algorithm for Small Object Detection in Unmanned Aerial Vehicle Image Scenarios
    Li, Xinmin
    Wei, Yingkun
    Li, Jiahui
    Duan, Wenwen
    Zhang, Xiaoqiang
    Huang, Yi
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [47] Lightweight vehicle object detection network for unmanned aerial vehicles aerial images
    Liu, Lu-Chen
    Jia, Xiang-Yu
    Han, Dong-Nuo
    Li, Zhen-Dong
    Sun, Hong-Mei
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
  • [48] DFTD-YOLO: Lightweight Multi-Target Detection From Unmanned Aerial Vehicle Viewpoints
    Chen, Yuteng
    Liu, Zhaoguang
    IEEE ACCESS, 2025, 13 : 24672 - 24680
  • [49] Small Object Detection in Traffic Scenes Based on YOLO-MXANet
    He, Xiaowei
    Cheng, Rao
    Zheng, Zhonglong
    Wang, Zeji
    SENSORS, 2021, 21 (21)
  • [50] A deep neural network for small object detection in complex environments with unmanned aerial vehicle imagery
    Jobaer, Sayed
    Tang, Xue-song
    Zhang, Yihong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 148