Small target detection algorithm for aerial images based on feature reuse mechanism

被引:0
|
作者
Deng T. [1 ]
Cheng X. [1 ]
Liu J. [1 ]
Zhang X. [1 ]
机构
[1] School of Traffic and Transportation, Chongqing Jiaotong University, Chongqing
关键词
CARAFE; lightweight backbone; object detection; unmanned aerial vehicle (UAV) image; YOLOv8;
D O I
10.3785/j.issn.1008-973X.2024.03.001
中图分类号
学科分类号
摘要
A lightweight and efficient aerial image detection algorithm called Functional ShuffleNet YOLO (FS-YOLO) was proposed based on YOLOv8s, in order to address the issues of low detection accuracy for small targets and a large number of model parameters in current unmanned aerial vehicle (UAV) aerial image detection. A lightweight feature extraction network was introduced by reducing channel dimensions and improving the network architecture. This facilitated the efficient reuse of redundant feature information, generating more feature maps with fewer parameters, enhancing the model’s ability to extract and express feature information while significantly reducing the model size. Additionally, a content-aware feature recombination module was introduced during the feature fusion stage to enhance the attention on salient semantic information of small targets, thereby improving the detection performance of the network for aerial images. Experimental validation was conducted using the VisDrone dataset, and the results indicated that the proposed algorithm achieved a detection accuracy of 47.0% mAP0.5 with only 5.48 million parameters. This represented a 50.7% reduction in parameter count compared to the YOLOv8s benchmark algorithm, along with a 6.1% improvement in accuracy. Experimental results of DIOR dataset showed that FS-YOLO had strong generalization and was more competitive than other state-of-the-art algorithms. © 2024 Zhejiang University. All rights reserved.
引用
收藏
页码:437 / 448
页数:11
相关论文
共 26 条
  • [1] ZHANG X, WANG C, JIN J, Et al., Object detection of VisDrone by stronger feature extraction FasterRCNN [J], Journal of Electronic Imaging, 32, 1, pp. 13-18, (2023)
  • [2] HUANG H, LI L, MA H., An improved Cascade R-CNN based target detection algorithm for UAV aerial images [C], 2022 7th International Conference on Image, Vision and Computing, pp. 232-237, (2022)
  • [3] ZHU X, LYU S, WANG X, Et al., TPH-YOLOv5: improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios [C], Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, pp. 2778-2788, (2021)
  • [4] LIU C, YANG D, TANG L, Et al., A lightweight object detector based on spatial-coordinate self-attention for UAV aerial images, Remote Sensing, 15, 1, (2023)
  • [5] HOWARD A, ZHU M, CHEN B, Et al., Mobilenets: efficient convolutional neural networks for mobile vision applications
  • [6] ZHANG X, ZHOU X, LIN M, Et al., Shufflenet: an extremely efficient convolutional neural network for mobile devices [C], 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6848-6856, (2018)
  • [7] MA N, ZHANG X, ZHENG H T, Et al., Shufflenet v2: practical guidelines for efficient CNN architecture sign [C], Proceedings of the European Conference on Computer Vision, pp. 116-131, (2018)
  • [8] YU G, CHANG Q, LV W, Et al., PP-PicoDet: a bett-er real-time object detector on mobile devices
  • [9] WANG J, CHEN K, XU R, Et al., CARAFE: content-aware reassembly of features [C], 2019 IEEE/CVF International Conference on Computer Vision, pp. 3007-3016, (2019)
  • [10] JOCHER G, NISHIMURA K, MINEEVA T, Et al., YOLOv8