CPA-YOLOv7: Contextual and pyramid attention-based improvement of YOLOv7 for drones scene target detection

被引：5

作者：

Shi, Houwang ^{[1
]}

Yang, Wenzhong ^{[1
]}

Chen, Danni ^{[1
]}

Wang, Min ^{[1
]}

机构：

[1] Xinjiang Univ, Xinjiang Key Lab Multilingual Informat Technol, Urumqi 830046, Xinjiang, Peoples R China

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2023年 / 97卷

基金：

中国国家自然科学基金;

关键词：

Deep learning; Small target detection; Multi-scale feature fusion; Attention mechanism; Unmanned aerial vehicle view small object; Loss function;

D O I：

10.1016/j.jvcir.2023.103965

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Target detection in unmanned aerial vehicle application scenarios has other problems, such as dense targets. The existing unmanned aerial vehicle target detection model with high computational complexity makes it difficult to meet real-time unmanned aerial vehicle target detection, and the detection accuracy of small targets is low. To address these problems, we propose an improved YOLOv7 small target detection model based on context and pyramidal attention that can cope with dense unmanned aerial vehicle scenarios CPA-YOLOv7. This model embeds our proposed lightweight multi-scale attentional feature spatial pyramid pooling module, which can better distinguish between small and large target features, reducing the computational effort while improving the detection accuracy of the model. Secondly, we design a contextual dynamic fusion attention module in the network to fuse global and local contextual information and dynamically assign features to multiple groups of channels; in the multi-scale fusion process, it effectively increases the characterization ability of small target features and enables the network to better focus on small target information. Finally, we improve Wise Intersection-over-Union loss as the regression loss function, add a moderation factor to retain some of the high and low-quality sample weights to improve the regression accuracy of high-quality anchor frames, and use the dynamic non-monotonic focusing mechanism to increase the model's focus on ordinary quality anchor frames to improve the model's localization performance and robustness to low-quality samples. Numerous experimental results show that on the unmanned aerial vehicle datasets VisDrone2021-DET and AI-TOD, the mAP values of our model are 2.3% and 1.1% higher than those of the YOLOv7 model with fewer parameters introduced, and the computational speed reaches 146 frames per second (FPS), which can meet the real-time requirements of unmanned aerial vehicle detection.

引用

页数：12

共 50 条

[1] YOLOv7-SN: Underwater Target Detection Algorithm Based on Improved YOLOv7
Zhao, Ming
Zhou, Huibo
Li, Xue
SYMMETRY-BASEL, 2024, 16 (05):
[2] MCA-YOLOv7: An Improved UAV Target Detection Algorithm Based on YOLOv7
Qin, Zhiyong
Chen, Dike
Wang, Hongyuan
IEEE ACCESS, 2024, 12 : 42642 - 42650
[3] An Underwater Target Detection Algorithm Based on Attention Mechanism and Improved YOLOv7
Ren, Liqiu
Li, Zhanying
He, Xueyu
Kong, Lingyan
Zhang, Yinghao
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (02): : 2829 - 2845
[4] Underwater Target Detection Based on Improved YOLOv7
Liu, Kaiyue
Sun, Qi
Sun, Daming
Peng, Lin
Yang, Mengduo
Wang, Nizhuan
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (03)
[5] Underwater Target Detection Based on Improved YOLOv7
Fu, Junshang
Tian, Ying
IAENG International Journal of Computer Science, 2024, 51 (04) : 422 - 429
[6] Crowded Scene PPE Detection Using Attention Based YOLOv7 and Alpha pose
Areerob, Punyapat
Matangkasombut, Tanawat
Monnikhof, Krishadawut Olde
Kumwilaisak, Wuttipong
2024 21ST INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, ECTI-CON 2024, 2024,
[7] NAM-YOLOV7: An Improved YOLOv7 Based on Attention Model for Animal Death Detection
Sirisha, Uddagiri
Chandana, Bolem Sai
Harikiran, Jonnadula
TRAITEMENT DU SIGNAL, 2023, 40 (02) : 783 - 789
[8] Sign-YOLO: Traffic Sign Detection Using Attention-Based YOLOv7
Mahadshetti, Ruturaj
Kim, Jinsul
Um, Tai-Won
IEEE ACCESS, 2024, 12 : 132689 - 132700
[9] Automated Detection of Gastric Lesions in Endoscopic Images by Leveraging Attention-Based YOLOv7
Ahmad, Sheeraz
Kim, Jae-Seoung
Park, Dong Kyun
Whangbo, Taegkeun
IEEE ACCESS, 2023, 11 : 87166 - 87177
[10] Night target detection algorithm based on improved YOLOv7
Bowen, Zheng
Huacai, Lu
Shengbo, Zhu
Xinqiang, Chen
Hongwei, Xing
SCIENTIFIC REPORTS, 2024, 14 (01):

← 1 2 3 4 5 →