Siamese Network Algorithm Based on Multi-Scale Channel Attention Fusion and Multi-Scale Depth-Wise Cross Correlation

被引:2
|
作者
Chen, Qingjun [1 ]
Zheng, Hua [1 ,2 ,3 ,4 ]
Pan, Hao [1 ]
Liao, Xiaoqi [1 ]
Wang, Hongkai [1 ]
机构
[1] Fujian Normal Univ, Coll Photon & Elect Engn, Fuzhou 350108, Peoples R China
[2] Fujian Normal Univ, Key Lab Optoelect Sci & Technol Med, Minist Educ, Fuzhou 350108, Peoples R China
[3] Fujian Normal Univ, Fujian Prov Key Lab Photon Technol, Fuzhou 350108, Peoples R China
[4] Fujian Prov Engn Res Ctr Optoelect Sensors & Inte, Fuzhou 350108, Peoples R China
关键词
Siamese network; visual object tracking; anchor-free regression strategy; multi-scale channel attention fusion; multi-scale depth-wise cross correlation; TRACKING;
D O I
10.1117/12.2680160
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The research takes the feature extraction network and depth-wise cross correlation learning method of the Siamese network as the starting point. Firstly, the regression strategy of the proposed framework is anchor-free, and the residual network ResNet50 is chosen as the backbone network, and add the channel attention mechanism SENet. The SE-MSCAM multi-scale channel attention model is proposed to make up for the lack of local feature extraction ability of the feature extraction network on the basis of SENet. On this basis, the attention fusion module AFFN is added to enhance the soft selection of attention. Combined with the SE-MSCAM multi-scale attention model and the attention fusion module AFFN, the ResNet50-AFFN multi-scale channel attention fusion network is proposed. Secondly, regarding the limitation of single-scale learning of SiamRPN++ depth-wise cross correlation, the MS-DWXCorr multi-scale depth-wise cross correlation is proposed which increases the diversity of learning feature scales to improve the efficiency of tracking network similarity learning. The experimental results show that, on the VOT2018 benchmark, the EAO of our method outperforms 4.0% of the mainstream algorithm SiamCAR, the tracking accuracy is improved by 3.4% and the tracking speed of our method maintains 40 FPS; the tracking success rate is improved by 2.0% and the tracking accuracy rate is improved by 3.2% compared to the mainstream algorithm SiamCAR. It has higher accuracy and robustness in dealing with occlusion, deformation, illumination variation, deformation, and other scenarios of visual tracking, and has better tracking performance.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [1] Siamese Network with Channel-wise Attention and Multi-scale Fusion for Robust Object Tracking
    Tang, Eryong
    Wang, Yusheng
    Liu, Ye
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 6515 - 6520
  • [2] Object Tracking Algorithm for Multi-Scale Channel Attention and Siamese Network
    Wang, Shuxian
    Ge, Haibo
    Li, Wenhao
    Computer Engineering and Applications, 2023, 59 (14) : 142 - 150
  • [3] Multi-scale Refocusing Attention Siamese Network
    Liu, Guoqiang
    Chen, Zhe
    Shen, Guangze
    2024 5TH INTERNATIONAL CONFERENCE ON GEOLOGY, MAPPING AND REMOTE SENSING, ICGMRS 2024, 2024, : 42 - 46
  • [4] Siamese Network Tracker Based on Multi-Scale Feature Fusion
    Zhao, Jiaxu
    Niu, Dapeng
    SYSTEMS, 2023, 11 (08):
  • [5] Binocular Depth Estimation Algorithm Based on Multi-Scale Attention Feature Fusion
    Yang Huitong
    Lei Lang
    Lin Yongchun
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [6] Image fusion algorithm based on multi-scale detail siamese convolutional neural network
    Liu Bo
    Han Guang-liang
    Luo Hui-yuan
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2021, 36 (09) : 1283 - 1293
  • [7] Siamese Network with Multi-scale Feature Fusion and Dual Attention Mechanism for Template Matching
    Zhao, Kai
    He, Binbing
    Pan, Shiju
    Zhu, Yuan
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6588 - 6592
  • [8] Multi-Scale Bilateral Attention Fusion Network For Pansharpening
    Guo Z.
    Li J.
    Lei J.
    Liu J.
    Zhou S.
    Wang B.
    Kasabov N.K.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 1 - 15
  • [9] A Multi-Scale Channel Attention Network for Prostate Segmentation
    Ding, Meiwen
    Lin, Zhiping
    Lee, Chau Hung
    Tan, Cher Heng
    Huang, Weimin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (05) : 1754 - 1758
  • [10] Multi-focused image fusion algorithm based on multi-scale hybrid attention residual network
    Liu, Tingting
    Chen, Mingju
    Duan, Zhengxu
    Cui, Anle
    PLOS ONE, 2024, 19 (05):