CPA-YOLOv7: Contextual and pyramid attention-based improvement of YOLOv7 for drones scene target detection

被引:5
|
作者
Shi, Houwang [1 ]
Yang, Wenzhong [1 ]
Chen, Danni [1 ]
Wang, Min [1 ]
机构
[1] Xinjiang Univ, Xinjiang Key Lab Multilingual Informat Technol, Urumqi 830046, Xinjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Small target detection; Multi-scale feature fusion; Attention mechanism; Unmanned aerial vehicle view small object; Loss function;
D O I
10.1016/j.jvcir.2023.103965
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Target detection in unmanned aerial vehicle application scenarios has other problems, such as dense targets. The existing unmanned aerial vehicle target detection model with high computational complexity makes it difficult to meet real-time unmanned aerial vehicle target detection, and the detection accuracy of small targets is low. To address these problems, we propose an improved YOLOv7 small target detection model based on context and pyramidal attention that can cope with dense unmanned aerial vehicle scenarios CPA-YOLOv7. This model embeds our proposed lightweight multi-scale attentional feature spatial pyramid pooling module, which can better distinguish between small and large target features, reducing the computational effort while improving the detection accuracy of the model. Secondly, we design a contextual dynamic fusion attention module in the network to fuse global and local contextual information and dynamically assign features to multiple groups of channels; in the multi-scale fusion process, it effectively increases the characterization ability of small target features and enables the network to better focus on small target information. Finally, we improve Wise Intersection-over-Union loss as the regression loss function, add a moderation factor to retain some of the high and low-quality sample weights to improve the regression accuracy of high-quality anchor frames, and use the dynamic non-monotonic focusing mechanism to increase the model's focus on ordinary quality anchor frames to improve the model's localization performance and robustness to low-quality samples. Numerous experimental results show that on the unmanned aerial vehicle datasets VisDrone2021-DET and AI-TOD, the mAP values of our model are 2.3% and 1.1% higher than those of the YOLOv7 model with fewer parameters introduced, and the computational speed reaches 146 frames per second (FPS), which can meet the real-time requirements of unmanned aerial vehicle detection.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] YOLOv7-SN: Underwater Target Detection Algorithm Based on Improved YOLOv7
    Zhao, Ming
    Zhou, Huibo
    Li, Xue
    SYMMETRY-BASEL, 2024, 16 (05):
  • [2] MCA-YOLOv7: An Improved UAV Target Detection Algorithm Based on YOLOv7
    Qin, Zhiyong
    Chen, Dike
    Wang, Hongyuan
    IEEE ACCESS, 2024, 12 : 42642 - 42650
  • [3] An Underwater Target Detection Algorithm Based on Attention Mechanism and Improved YOLOv7
    Ren, Liqiu
    Li, Zhanying
    He, Xueyu
    Kong, Lingyan
    Zhang, Yinghao
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (02): : 2829 - 2845
  • [4] Underwater Target Detection Based on Improved YOLOv7
    Liu, Kaiyue
    Sun, Qi
    Sun, Daming
    Peng, Lin
    Yang, Mengduo
    Wang, Nizhuan
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (03)
  • [5] Underwater Target Detection Based on Improved YOLOv7
    Fu, Junshang
    Tian, Ying
    IAENG International Journal of Computer Science, 2024, 51 (04) : 422 - 429
  • [6] Crowded Scene PPE Detection Using Attention Based YOLOv7 and Alpha pose
    Areerob, Punyapat
    Matangkasombut, Tanawat
    Monnikhof, Krishadawut Olde
    Kumwilaisak, Wuttipong
    2024 21ST INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, ECTI-CON 2024, 2024,
  • [7] NAM-YOLOV7: An Improved YOLOv7 Based on Attention Model for Animal Death Detection
    Sirisha, Uddagiri
    Chandana, Bolem Sai
    Harikiran, Jonnadula
    TRAITEMENT DU SIGNAL, 2023, 40 (02) : 783 - 789
  • [8] Sign-YOLO: Traffic Sign Detection Using Attention-Based YOLOv7
    Mahadshetti, Ruturaj
    Kim, Jinsul
    Um, Tai-Won
    IEEE ACCESS, 2024, 12 : 132689 - 132700
  • [9] Automated Detection of Gastric Lesions in Endoscopic Images by Leveraging Attention-Based YOLOv7
    Ahmad, Sheeraz
    Kim, Jae-Seoung
    Park, Dong Kyun
    Whangbo, Taegkeun
    IEEE ACCESS, 2023, 11 : 87166 - 87177
  • [10] Night target detection algorithm based on improved YOLOv7
    Bowen, Zheng
    Huacai, Lu
    Shengbo, Zhu
    Xinqiang, Chen
    Hongwei, Xing
    SCIENTIFIC REPORTS, 2024, 14 (01):