YOLO-SA: An Efficient Object Detection Model Based on Self-attention Mechanism

被引：0

作者：

Li, Ang ^{[1
]}

Song, Xiangyu ^{[2
]}

Sun, ShiJie ^{[1
]}

Zhang, Zhaoyang ^{[1
]}

Cai, Taotao ^{[3
]}

Song, Huansheng ^{[1
]}

机构：

[1] Changan Univ, Xian, Peoples R China

[2] Swinburne Univ Technol, Melbourne, Vic, Australia

[3] Macquarie Univ, Sydney, NSW, Australia

来源：

WEB AND BIG DATA, PT IV, APWEB-WAIM 2023 | 2024年 / 14334卷

关键词：

Object detection; CNN architecture; Attention mechanism; Decoupled detection head;

D O I：

10.1007/978-981-97-2421-5_1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object detector based on CNN structure has been widely used in object detection, object classification and other tasks. The traditional CNN module usually adopts complex multi-branch design, which reduces the reasoning speed and memory utilization. Moreover, in many works, attention mechanism is usually added to the object detector to extract rich features in spatial information, which are usually used as additional modules of convolution without fundamental improvement from the limitations of convolution operation. Finally, traditional object detectors often have coupled detection heads, which can compromise model performance. To solve the above problems, we propose a new object detection model, YOLO-SA, based on the current popular object detector model YOLOv5. We introduce a new reparameterized module RepVGG to replace the original DarkNet53 structure of YOLOv5 model, which greatly reduces the complexity of the model and improves the detection accuracy. We introduce a self-attention mechanism module in the feature fusion part of the model, which is independent from other convolutional layers and has higher performance than other mainstream attention mechanism modules. We replace the coupled detection head in YOLOv5 model with an anchor-based decoupled detection head, which greatly improved the convergence speed in the training process. Experiments show that the detection accuracy of the YOLO-SA model proposed by us reaches 71.2% and 75.8% on COCO2014 and VOC2012 dataset respectively, which is superior to the YOLOv5s model as the baseline and other mainstream object detection models, showing certain superiority.

引用

页码：1 / 15

页数：15

共 50 条

[1] Object Detection Algorithm Based on Context Information and Self-Attention Mechanism
Liang, Hong
Zhou, Hui
Zhang, Qian
Wu, Ting
[J]. SYMMETRY-BASEL, 2022, 14 (05):
[2] SA-YOLOv3: An Efficient and Accurate Object Detector Using Self-Attention Mechanism for Autonomous Driving
Tian, Daxin
Lin, Chunmian
Zhou, Jianshan
Duan, Xuting
Cao, Yue
Zhao, Dezong
Cao, Dongpu
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (05) : 4099 - 4110
[3] Small Object Detection in Remote Sensing Images Based on Window Self-Attention Mechanism
Xu, Jiaxin
Zhang, Qiao
Liu, Yu
Zheng, Mengting
[J]. PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2023, 89 (08): : 489 - 497
[4] Lung Nodule Detection Based on Spike-Driven Self-Attention YOLO
Wei, Xiaoqing
Lv, Yuchao
Wang, Hui
Yang, Peiyin
Dong, Zheng
Liu, Ju
Wu, Qiang
[J]. ADVANCES IN SWARM INTELLIGENCE, PT II, ICSI 2024, 2024, 14789 : 187 - 196
[5] YOLO-DAW: Object detection model based on dual attention mechanism within windows
Yin Z.
Shao J.
Zhang N.
[J]. Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2023, 53 (04): : 718 - 724
[6] Object Detection Model Based on Scene-Level Region Proposal Self-Attention
Quan, Yu
Li, Zhixin
Zhang, Canlong
Ma, Huifang
[J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 954 - 961
[7] A Self-Attention Mechanism-Based Model for Early Detection of Fake News
Jamshidi, Bahman
Hakak, Saqib
Lu, Rongxing
[J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 5241 - 5252
[8] Rethinking Self-Attention for Multispectral Object Detection
Hu, Sijie
Bonardi, Fabien
Bouchafa, Samia
Prendinger, Helmut
Sidibe, Desire
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 16300 - 16311
[9] Adaptive Sparse Self-attention for Object Detection
Xu, Mingjie
Song, Yonghong
Xie, Kangkang
Guo, Pengcheng
Mu, Jiaxi
Liu, Wen
Wang, Zhengyang
[J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[10] SCVD-SA: A Smart Contract Vulnerability Detection Method based on Hybrid Deep Learning Model and Self-attention Mechanism
Wang, Dongjie
Chen, Jinfu
Cai, Saihua
Feng, Qiaowei
Chen, Yuhao
Hu, Xinyi
[J]. 2024 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING-COMPANION, SANER-C 2024, 2024, : 207 - 214

← 1 2 3 4 5 →