RPS-YOLO: A Recursive Pyramid Structure-Based YOLO Network for Small Object Detection in Unmanned Aerial Vehicle Scenarios

被引:0
|
作者
Lei, Penghui [1 ,2 ]
Wang, Chenkang [1 ,2 ]
Liu, Peigang [1 ,2 ]
机构
[1] China Univ Petr East China, Qingdao Inst Software, Coll Comp Sci & Technol, Qingdao 266580, Peoples R China
[2] Shandong Key Lab Intelligent Oil & Gas Ind Softwar, Qingdao 266580, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 04期
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
YOLOv8; small target detection; unmanned aerial vehicle; attention mechanisms;
D O I
10.3390/app15042039
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The fast advancement of unmanned aerial vehicle (UAV) technology has facilitated its use across a wide range of scenarios. Due to the high mobility and flexibility of drones, the images they capture often exhibit significant scale variations and severe object occlusions, leading to a high density of small objects. However, the existing object detection algorithms struggle with detecting small objects effectively in cross-scale detection scenarios. To overcome these difficulties, we introduce a new object detection model, RPS-YOLO, based on the YOLOv8 architecture. Unlike the existing methods that rely on traditional feature pyramids, our approach introduces a recursive feature pyramid (RFP) structure. This structure performs two rounds of feature extraction, and we reduce one downsampling step in the first round to enhance attention to small objects during cross-scale detection. Additionally, we design a novel attention mechanism that improves feature representation and mitigates feature degradation during convolution by capturing spatial- and channel-specific details. Another key innovation is the proposed Localization IOU (LIOU) loss function for bounding box regression, which accelerates the regression process by incorporating angular constraints. Experiments conducted on the VisDrone-DET2021 and UAVDT datasets show that RPS-YOLO surpasses YOLOv8s, with an mAP50 improvement of 8.2% and 3.4%, respectively. Our approach demonstrates that incorporating recursive feature extraction and exploiting detailed information for multi-scale detection significantly improves detection performance, particularly for small objects in UAV images.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] SAM-YOLO: An Improved Small Object Detection Model for Vehicle Detection
    Liao, JiaWang
    Jiang, SuYu
    Chen, MingHua
    Sun, ChengJiao
    EUROPEAN JOURNAL ON ARTIFICIAL INTELLIGENCE, 2025,
  • [32] Object Detection Technique for Small Unmanned Aerial Vehicle
    Bin Ramli, M. Faiz
    Legowo, Ari
    Shamsudin, Syariful Syafiq
    6TH INTERNATIONAL CONFERENCE ON MECHATRONICS (ICOM'17), 2017, 260
  • [33] DAID-YOLO: Small Object Detection Algorithm for Drone Aerial Images
    Han Ping
    Luo Jie
    Zuo Huahong
    2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 591 - 595
  • [34] Vis-YOLO: a lightweight and efficient image detector for unmanned aerial vehicle small objects
    Deng, Xiangyu
    Du, Jiangyong
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (05)
  • [35] Lightweight Object Detection Networks for UAV Aerial Images Based on YOLO
    Li, Yanshan
    Wang, Jiarong
    Zhang, Kunhua
    Yi, Jiawei
    Wei, Miaomiao
    Zheng, Lirong
    Xie, Weixin
    CHINESE JOURNAL OF ELECTRONICS, 2024, 33 (04) : 997 - 1009
  • [36] FDI-YOLO: Feature disentanglement and interaction network based on YOLO for SAR object detection
    Wang, Peng
    Luo, Yuan
    Zhu, Zhilin
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 260
  • [37] GCN-YOLO: YOLO Based on Graph Convolutional Network for SAR Vehicle Target Detection
    Chen, Peiyao
    Wang, Yinghua
    Liu, Hongwei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 1
  • [38] Small Object Detection Algorithm Based on ATO-YOLO
    Su, Jia
    Qin, Yichang
    Jia, Ze
    Wang, Jing
    Computer Engineering and Applications, 2024, 60 (06) : 68 - 77
  • [39] Object Detection Method Based on Improved YOLO Lightweight Network
    Li Chengyue
    Yao Jianmin
    Lin Zhixian
    Yan Qun
    Fan Baoqing
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (14)
  • [40] YOLO glass: video-based smart object detection using squeeze and attention YOLO network
    Sugashini, T.
    Balakrishnan, G.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 2105 - 2115