YOLO-SA: An Efficient Object Detection Model Based on Self-attention Mechanism

被引:0
|
作者
Li, Ang [1 ]
Song, Xiangyu [2 ]
Sun, ShiJie [1 ]
Zhang, Zhaoyang [1 ]
Cai, Taotao [3 ]
Song, Huansheng [1 ]
机构
[1] Changan Univ, Xian, Peoples R China
[2] Swinburne Univ Technol, Melbourne, Vic, Australia
[3] Macquarie Univ, Sydney, NSW, Australia
来源
关键词
Object detection; CNN architecture; Attention mechanism; Decoupled detection head;
D O I
10.1007/978-981-97-2421-5_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detector based on CNN structure has been widely used in object detection, object classification and other tasks. The traditional CNN module usually adopts complex multi-branch design, which reduces the reasoning speed and memory utilization. Moreover, in many works, attention mechanism is usually added to the object detector to extract rich features in spatial information, which are usually used as additional modules of convolution without fundamental improvement from the limitations of convolution operation. Finally, traditional object detectors often have coupled detection heads, which can compromise model performance. To solve the above problems, we propose a new object detection model, YOLO-SA, based on the current popular object detector model YOLOv5. We introduce a new reparameterized module RepVGG to replace the original DarkNet53 structure of YOLOv5 model, which greatly reduces the complexity of the model and improves the detection accuracy. We introduce a self-attention mechanism module in the feature fusion part of the model, which is independent from other convolutional layers and has higher performance than other mainstream attention mechanism modules. We replace the coupled detection head in YOLOv5 model with an anchor-based decoupled detection head, which greatly improved the convergence speed in the training process. Experiments show that the detection accuracy of the YOLO-SA model proposed by us reaches 71.2% and 75.8% on COCO2014 and VOC2012 dataset respectively, which is superior to the YOLOv5s model as the baseline and other mainstream object detection models, showing certain superiority.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] Object Detection Algorithm Based on Context Information and Self-Attention Mechanism
    Liang, Hong
    Zhou, Hui
    Zhang, Qian
    Wu, Ting
    [J]. SYMMETRY-BASEL, 2022, 14 (05):
  • [2] SA-YOLOv3: An Efficient and Accurate Object Detector Using Self-Attention Mechanism for Autonomous Driving
    Tian, Daxin
    Lin, Chunmian
    Zhou, Jianshan
    Duan, Xuting
    Cao, Yue
    Zhao, Dezong
    Cao, Dongpu
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (05) : 4099 - 4110
  • [3] Small Object Detection in Remote Sensing Images Based on Window Self-Attention Mechanism
    Xu, Jiaxin
    Zhang, Qiao
    Liu, Yu
    Zheng, Mengting
    [J]. PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2023, 89 (08): : 489 - 497
  • [4] Lung Nodule Detection Based on Spike-Driven Self-Attention YOLO
    Wei, Xiaoqing
    Lv, Yuchao
    Wang, Hui
    Yang, Peiyin
    Dong, Zheng
    Liu, Ju
    Wu, Qiang
    [J]. ADVANCES IN SWARM INTELLIGENCE, PT II, ICSI 2024, 2024, 14789 : 187 - 196
  • [5] YOLO-DAW: Object detection model based on dual attention mechanism within windows
    Yin Z.
    Shao J.
    Zhang N.
    [J]. Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2023, 53 (04): : 718 - 724
  • [6] Object Detection Model Based on Scene-Level Region Proposal Self-Attention
    Quan, Yu
    Li, Zhixin
    Zhang, Canlong
    Ma, Huifang
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 954 - 961
  • [7] A Self-Attention Mechanism-Based Model for Early Detection of Fake News
    Jamshidi, Bahman
    Hakak, Saqib
    Lu, Rongxing
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 5241 - 5252
  • [8] Rethinking Self-Attention for Multispectral Object Detection
    Hu, Sijie
    Bonardi, Fabien
    Bouchafa, Samia
    Prendinger, Helmut
    Sidibe, Desire
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 16300 - 16311
  • [9] Adaptive Sparse Self-attention for Object Detection
    Xu, Mingjie
    Song, Yonghong
    Xie, Kangkang
    Guo, Pengcheng
    Mu, Jiaxi
    Liu, Wen
    Wang, Zhengyang
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [10] SCVD-SA: A Smart Contract Vulnerability Detection Method based on Hybrid Deep Learning Model and Self-attention Mechanism
    Wang, Dongjie
    Chen, Jinfu
    Cai, Saihua
    Feng, Qiaowei
    Chen, Yuhao
    Hu, Xinyi
    [J]. 2024 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING-COMPANION, SANER-C 2024, 2024, : 207 - 214