Fine-Grained Feature Perception for Unmanned Aerial Vehicle Target Detection Algorithm

被引:3
|
作者
Liu, Shi [1 ]
Zhu, Meng [2 ]
Tao, Rui [1 ,3 ]
Ren, Honge [1 ,4 ]
机构
[1] Northeast Forestry Univ, Coll Comp & Control Engn, Harbin 150040, Peoples R China
[2] Harbin Univ, Coll Informat Engn, Harbin 150086, Peoples R China
[3] Hulunbuir Univ, Coll Artificial Intelligence & Big Data, Hulunbuir 021008, Peoples R China
[4] Heilongjiang Forestry Intelligent Equipment Engn R, Harbin 150040, Peoples R China
关键词
unmanned aerial vehicle; small object detection; Fine-Grained Feature; YOLOv8; NETWORK;
D O I
10.3390/drones8050181
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Unmanned aerial vehicle (UAV) aerial images often present challenges such as small target sizes, high target density, varied shooting angles, and dynamic poses. Existing target detection algorithms exhibit a noticeable performance decline when confronted with UAV aerial images compared to general scenes. This paper proposes an outstanding small target detection algorithm for UAVs, named Fine-Grained Feature Perception YOLOv8s-P2 (FGFP-YOLOv8s-P2), based on YOLOv8s-P2 architecture. We specialize in improving inspection accuracy while meeting real-time inspection requirements. First, we enhance the targets' pixel information by utilizing slice-assisted training and inference techniques, thereby reducing missed detections. Then, we propose a feature extraction module with deformable convolutions. Decoupling the learning process of offset and modulation scalar enables better adaptation to variations in the size and shape of diverse targets. In addition, we introduce a large kernel spatial pyramid pooling module. By cascading convolutions, we leverage the advantages of large kernels to flexibly adjust the model's attention to various regions of high-level feature maps, better adapting to complex visual scenes and circumventing the cost drawbacks associated with large kernels. To match the excellent real-time detection performance of the baseline model, we propose an improved Random FasterNet Block. This block introduces randomness during convolution and captures spatial features of non-linear transformation channels, enriching feature representations and enhancing model efficiency. Extensive experiments and comprehensive evaluations on the VisDrone2019 and DOTA-v1.0 datasets demonstrate the effectiveness of FGFP-YOLOv8s-P2. This achievement provides robust technical support for efficient small target detection by UAVs in complex scenarios.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] A fast feature extraction and matching algorithm for unmanned aerial vehicle images
    Yu H.
    Yang W.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2016, 38 (03): : 509 - 516
  • [22] Assessing fine-grained feature dependencies
    Rodrigues, Iran
    Ribeiro, Marcio
    Medeiros, Flavio
    Borba, Paulo
    Fonseca, Baldoino
    Gheyi, Rohit
    INFORMATION AND SOFTWARE TECHNOLOGY, 2016, 78 : 27 - 52
  • [23] Target Position Compensation Algorithm for Unmanned Aerial Vehicle Radar Image
    Yao, Xue
    Liu, Yu
    Cui, Guolong
    Nie, Xiangfei
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 808 - 811
  • [24] Motion Detection Algorithm for Unmanned Aerial Vehicle Nighttime Surveillance
    Xiao, Huaxin
    Liu, Yu
    Wang, Wei
    Zhang, Maojun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (12): : 3248 - 3251
  • [25] A NOVEL PART FEATURE INTEGRATION AND FUSION METHOD FOR FINE-GRAINED VEHICLE RECOGNITION
    Wang, Ping
    Cao, Yijie
    Lu, Lei
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1990 - 1994
  • [26] A novel fine-grained rumor detection algorithm with attention mechanism
    Zhang, Ke
    Cao, Jianjun
    Pi, Dechang
    NEUROCOMPUTING, 2024, 583
  • [27] Fine-grained facial landmark detection exploiting intermediate feature representations
    Yan, Yongzhe
    Duffner, Stefan
    Phutane, Priyanka
    Berthelier, Anthony
    Naturel, Xavier
    Blanc, Christophe
    Garcia, Christophe
    Chateau, Thierry
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 200 (200)
  • [28] Scene Uyghur Text Detection Based on Fine-Grained Feature Representation
    Wang, Yiwen
    Mamat, Hornisa
    Xu, Xuebin
    Aysa, Alimjan
    Ubul, Kurban
    SENSORS, 2022, 22 (12)
  • [29] Fine-Grained Feature Enhancement for Object Detection in Remote Sensing Images
    Zhou, Yong
    Wang, Sifan
    Zhao, Jiaqi
    Zhu, Hancheng
    Yao, Rui
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [30] Target Object Detection from Unmanned Aerial Vehicle (UAV) Images Based on Improved YOLO Algorithm
    Jawaharlalnehru, Arunnehru
    Sambandham, Thalapathiraj
    Sekar, Vaijayanthi
    Ravikumar, Dhanasekar
    Loganathan, Vijayaraja
    Kannadasan, Raju
    Khan, Arfat Ahmad
    Wechtaisong, Chitapong
    Haq, Mohd Anul
    Alhussen, Ahmed
    Alzamil, Zamil S.
    ELECTRONICS, 2022, 11 (15)