CPA-YOLOv7: Contextual and pyramid attention-based improvement of YOLOv7 for drones scene target detection

被引:5
|
作者
Shi, Houwang [1 ]
Yang, Wenzhong [1 ]
Chen, Danni [1 ]
Wang, Min [1 ]
机构
[1] Xinjiang Univ, Xinjiang Key Lab Multilingual Informat Technol, Urumqi 830046, Xinjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Small target detection; Multi-scale feature fusion; Attention mechanism; Unmanned aerial vehicle view small object; Loss function;
D O I
10.1016/j.jvcir.2023.103965
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Target detection in unmanned aerial vehicle application scenarios has other problems, such as dense targets. The existing unmanned aerial vehicle target detection model with high computational complexity makes it difficult to meet real-time unmanned aerial vehicle target detection, and the detection accuracy of small targets is low. To address these problems, we propose an improved YOLOv7 small target detection model based on context and pyramidal attention that can cope with dense unmanned aerial vehicle scenarios CPA-YOLOv7. This model embeds our proposed lightweight multi-scale attentional feature spatial pyramid pooling module, which can better distinguish between small and large target features, reducing the computational effort while improving the detection accuracy of the model. Secondly, we design a contextual dynamic fusion attention module in the network to fuse global and local contextual information and dynamically assign features to multiple groups of channels; in the multi-scale fusion process, it effectively increases the characterization ability of small target features and enables the network to better focus on small target information. Finally, we improve Wise Intersection-over-Union loss as the regression loss function, add a moderation factor to retain some of the high and low-quality sample weights to improve the regression accuracy of high-quality anchor frames, and use the dynamic non-monotonic focusing mechanism to increase the model's focus on ordinary quality anchor frames to improve the model's localization performance and robustness to low-quality samples. Numerous experimental results show that on the unmanned aerial vehicle datasets VisDrone2021-DET and AI-TOD, the mAP values of our model are 2.3% and 1.1% higher than those of the YOLOv7 model with fewer parameters introduced, and the computational speed reaches 146 frames per second (FPS), which can meet the real-time requirements of unmanned aerial vehicle detection.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Improved remote sensing image target detection based on YOLOv7
    XU Shuanglong
    CHEN Zhihong
    ZHANG Haiwei
    XUE Lifang
    SU Huijun
    Optoelectronics Letters, 2024, 20 (04) : 234 - 242
  • [22] Improved remote sensing image target detection based on YOLOv7
    Shuanglong Xu
    Zhihong Chen
    Haiwei Zhang
    Lifang Xue
    Huijun Su
    Optoelectronics Letters, 2024, 20 : 234 - 242
  • [23] Improved YOLOv7 based apple target detection in complex environment
    Mo, Henghui
    Wei, Linjing
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (12): : 2447 - 2458
  • [24] Novel Personal Protective Equipment Detection Technique with Attention-based YOLOv7 and Human Pose Estimation
    Monnikhof, Krishadawut Olde
    Areerob, Punyapat
    Wu, Zheng
    Tanasnitikul, Takrit
    Kumwilaisak, Wuttipong
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (01)
  • [25] Small target flame detection algorithm based on improved YOLOv7
    Niu, Shaoshan
    Zhu, Yun
    Wang, Jianyu
    Xu, Zhengxing
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (05)
  • [26] Improved remote sensing image target detection based on YOLOv7
    Xu, Shuanglong
    Chen, Zhihong
    Zhang, Haiwei
    Xue, Lifang
    Su, Huijun
    OPTOELECTRONICS LETTERS, 2024, 20 (04) : 234 - 242
  • [27] PBA-YOLOv7: An Object Detection Method Based on an Improved YOLOv7 Network
    Sun, Yang
    Li, Yi
    Li, Song
    Duan, Zehao
    Ning, Haonan
    Zhang, Yuhang
    APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [28] Improved Cherry Detection Method at Night Based on YOLOv7: YOLOv7-Cherry
    Gai, Rongli
    Kong, Xiangzhou
    Qin, Shan
    Wei, Kai
    Computer Engineering and Applications, 2024, 60 (21) : 315 - 323
  • [29] YOLOv7-PSAFP: Crop pest and disease detection based on improved YOLOv7
    Du, Lujia
    Zhu, Junlong
    Liu, Muhua
    Wang, Lin
    IET IMAGE PROCESSING, 2025, 19 (01)
  • [30] CEAM-YOLOv7: Improved YOLOv7 Based on Channel Expansion and Attention Mechanism for Driver Distraction Behavior Detection
    Liu, Shugang
    Wang, Yujie
    Yu, Qiangguo
    Liu, Hongli
    Peng, Zhan
    IEEE ACCESS, 2022, 10 : 129116 - 129124