Siamese Network Algorithm Based on Multi-Scale Channel Attention Fusion and Multi-Scale Depth-Wise Cross Correlation

被引:2
|
作者
Chen, Qingjun [1 ]
Zheng, Hua [1 ,2 ,3 ,4 ]
Pan, Hao [1 ]
Liao, Xiaoqi [1 ]
Wang, Hongkai [1 ]
机构
[1] Fujian Normal Univ, Coll Photon & Elect Engn, Fuzhou 350108, Peoples R China
[2] Fujian Normal Univ, Key Lab Optoelect Sci & Technol Med, Minist Educ, Fuzhou 350108, Peoples R China
[3] Fujian Normal Univ, Fujian Prov Key Lab Photon Technol, Fuzhou 350108, Peoples R China
[4] Fujian Prov Engn Res Ctr Optoelect Sensors & Inte, Fuzhou 350108, Peoples R China
关键词
Siamese network; visual object tracking; anchor-free regression strategy; multi-scale channel attention fusion; multi-scale depth-wise cross correlation; TRACKING;
D O I
10.1117/12.2680160
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The research takes the feature extraction network and depth-wise cross correlation learning method of the Siamese network as the starting point. Firstly, the regression strategy of the proposed framework is anchor-free, and the residual network ResNet50 is chosen as the backbone network, and add the channel attention mechanism SENet. The SE-MSCAM multi-scale channel attention model is proposed to make up for the lack of local feature extraction ability of the feature extraction network on the basis of SENet. On this basis, the attention fusion module AFFN is added to enhance the soft selection of attention. Combined with the SE-MSCAM multi-scale attention model and the attention fusion module AFFN, the ResNet50-AFFN multi-scale channel attention fusion network is proposed. Secondly, regarding the limitation of single-scale learning of SiamRPN++ depth-wise cross correlation, the MS-DWXCorr multi-scale depth-wise cross correlation is proposed which increases the diversity of learning feature scales to improve the efficiency of tracking network similarity learning. The experimental results show that, on the VOT2018 benchmark, the EAO of our method outperforms 4.0% of the mainstream algorithm SiamCAR, the tracking accuracy is improved by 3.4% and the tracking speed of our method maintains 40 FPS; the tracking success rate is improved by 2.0% and the tracking accuracy rate is improved by 3.2% compared to the mainstream algorithm SiamCAR. It has higher accuracy and robustness in dealing with occlusion, deformation, illumination variation, deformation, and other scenarios of visual tracking, and has better tracking performance.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [21] Multi-Scale Cross-Attention Fusion Network Based on Image Super-Resolution
    Ma, Yimin
    Xu, Yi
    Liu, Yunqing
    Yan, Fei
    Zhang, Qiong
    Li, Qi
    Liu, Quanyang
    APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [22] Monocular Image Depth Estimation Based on Multi-Scale Attention Oriented Network
    Liu J.
    Wen J.
    Liang Y.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2020, 48 (12): : 52 - 62
  • [23] Multi-scale siamese networks for multi-focus image fusion
    Wu, Pan
    Hua, Zhen
    Li, Jinjiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (10) : 15651 - 15672
  • [24] A multi-scale patch-wise algorithm for multi-exposure image fusion
    Jia, Jinquan
    Sun, Jian
    Zhu, Zhiqin
    OPTIK, 2021, 248
  • [25] Multi-scale siamese networks for multi-focus image fusion
    Pan Wu
    Zhen Hua
    Jinjiang Li
    Multimedia Tools and Applications, 2023, 82 : 15651 - 15672
  • [26] A Channel Multi-Scale Fusion Network for Scene Depth Map Super-Resolution
    Yongwei M.
    Xinjie Z.
    Hanshi R.
    Jiajing Z.
    Shusen S.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (01): : 37 - 47
  • [27] Multi-Scale Aggregation Residual Channel Attention Fusion Network for Single Image Deraining
    Wang, Jyun-Guo
    Wu, Cheng-Shiuan
    APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [28] Underwater Image Enhancement Based on Multi-Scale Feature Fusion and Attention Network
    Liu Y.
    Liu M.
    Lin S.
    Tao Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (05): : 685 - 695
  • [29] Person Re-identification Based on Multi-scale Network Attention Fusion
    Wang Fenhua
    Zhao Bo
    Huang Chao
    Yan Youqi
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (12) : 3045 - 3052
  • [30] Monocular Depth Estimation Based on Multi-Scale Depth Map Fusion
    Yang, Xin
    Chang, Qingling
    Liu, Xinglin
    He, Siyuan
    Cui, Yan
    IEEE ACCESS, 2021, 9 : 67696 - 67705