Fine-Grained Feature Perception for Unmanned Aerial Vehicle Target Detection Algorithm

被引：3

作者：

Liu, Shi ^{[1
]}

Zhu, Meng ^{[2
]}

Tao, Rui ^{[1
,3
]}

Ren, Honge ^{[1
,4
]}

机构：

[1] Northeast Forestry Univ, Coll Comp & Control Engn, Harbin 150040, Peoples R China

[2] Harbin Univ, Coll Informat Engn, Harbin 150086, Peoples R China

[3] Hulunbuir Univ, Coll Artificial Intelligence & Big Data, Hulunbuir 021008, Peoples R China

[4] Heilongjiang Forestry Intelligent Equipment Engn R, Harbin 150040, Peoples R China

来源：

DRONES | 2024年 / 8卷 / 05期

关键词：

unmanned aerial vehicle; small object detection; Fine-Grained Feature; YOLOv8; NETWORK;

D O I：

10.3390/drones8050181

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Unmanned aerial vehicle (UAV) aerial images often present challenges such as small target sizes, high target density, varied shooting angles, and dynamic poses. Existing target detection algorithms exhibit a noticeable performance decline when confronted with UAV aerial images compared to general scenes. This paper proposes an outstanding small target detection algorithm for UAVs, named Fine-Grained Feature Perception YOLOv8s-P2 (FGFP-YOLOv8s-P2), based on YOLOv8s-P2 architecture. We specialize in improving inspection accuracy while meeting real-time inspection requirements. First, we enhance the targets' pixel information by utilizing slice-assisted training and inference techniques, thereby reducing missed detections. Then, we propose a feature extraction module with deformable convolutions. Decoupling the learning process of offset and modulation scalar enables better adaptation to variations in the size and shape of diverse targets. In addition, we introduce a large kernel spatial pyramid pooling module. By cascading convolutions, we leverage the advantages of large kernels to flexibly adjust the model's attention to various regions of high-level feature maps, better adapting to complex visual scenes and circumventing the cost drawbacks associated with large kernels. To match the excellent real-time detection performance of the baseline model, we propose an improved Random FasterNet Block. This block introduces randomness during convolution and captures spatial features of non-linear transformation channels, enriching feature representations and enhancing model efficiency. Extensive experiments and comprehensive evaluations on the VisDrone2019 and DOTA-v1.0 datasets demonstrate the effectiveness of FGFP-YOLOv8s-P2. This achievement provides robust technical support for efficient small target detection by UAVs in complex scenarios.

引用

页数：22

共 50 条

[21] A fast feature extraction and matching algorithm for unmanned aerial vehicle images
Yu H.
Yang W.
Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2016, 38 (03): : 509 - 516
[22] Assessing fine-grained feature dependencies
Rodrigues, Iran
Ribeiro, Marcio
Medeiros, Flavio
Borba, Paulo
Fonseca, Baldoino
Gheyi, Rohit
INFORMATION AND SOFTWARE TECHNOLOGY, 2016, 78 : 27 - 52
[23] Target Position Compensation Algorithm for Unmanned Aerial Vehicle Radar Image
Yao, Xue
Liu, Yu
Cui, Guolong
Nie, Xiangfei
2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 808 - 811
[24] Motion Detection Algorithm for Unmanned Aerial Vehicle Nighttime Surveillance
Xiao, Huaxin
Liu, Yu
Wang, Wei
Zhang, Maojun
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (12): : 3248 - 3251
[25] A NOVEL PART FEATURE INTEGRATION AND FUSION METHOD FOR FINE-GRAINED VEHICLE RECOGNITION
Wang, Ping
Cao, Yijie
Lu, Lei
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1990 - 1994
[26] A novel fine-grained rumor detection algorithm with attention mechanism
Zhang, Ke
Cao, Jianjun
Pi, Dechang
NEUROCOMPUTING, 2024, 583
[27] Fine-grained facial landmark detection exploiting intermediate feature representations
Yan, Yongzhe
Duffner, Stefan
Phutane, Priyanka
Berthelier, Anthony
Naturel, Xavier
Blanc, Christophe
Garcia, Christophe
Chateau, Thierry
COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 200 (200)
[28] Scene Uyghur Text Detection Based on Fine-Grained Feature Representation
Wang, Yiwen
Mamat, Hornisa
Xu, Xuebin
Aysa, Alimjan
Ubul, Kurban
SENSORS, 2022, 22 (12)
[29] Fine-Grained Feature Enhancement for Object Detection in Remote Sensing Images
Zhou, Yong
Wang, Sifan
Zhao, Jiaqi
Zhu, Hancheng
Yao, Rui
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[30] Target Object Detection from Unmanned Aerial Vehicle (UAV) Images Based on Improved YOLO Algorithm
Jawaharlalnehru, Arunnehru
Sambandham, Thalapathiraj
Sekar, Vaijayanthi
Ravikumar, Dhanasekar
Loganathan, Vijayaraja
Kannadasan, Raju
Khan, Arfat Ahmad
Wechtaisong, Chitapong
Haq, Mohd Anul
Alhussen, Ahmed
Alzamil, Zamil S.
ELECTRONICS, 2022, 11 (15)

← 1 2 3 4 5 →