Siamese Network Algorithm Based on Multi-Scale Channel Attention Fusion and Multi-Scale Depth-Wise Cross Correlation

被引:2
|
作者
Chen, Qingjun [1 ]
Zheng, Hua [1 ,2 ,3 ,4 ]
Pan, Hao [1 ]
Liao, Xiaoqi [1 ]
Wang, Hongkai [1 ]
机构
[1] Fujian Normal Univ, Coll Photon & Elect Engn, Fuzhou 350108, Peoples R China
[2] Fujian Normal Univ, Key Lab Optoelect Sci & Technol Med, Minist Educ, Fuzhou 350108, Peoples R China
[3] Fujian Normal Univ, Fujian Prov Key Lab Photon Technol, Fuzhou 350108, Peoples R China
[4] Fujian Prov Engn Res Ctr Optoelect Sensors & Inte, Fuzhou 350108, Peoples R China
关键词
Siamese network; visual object tracking; anchor-free regression strategy; multi-scale channel attention fusion; multi-scale depth-wise cross correlation; TRACKING;
D O I
10.1117/12.2680160
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The research takes the feature extraction network and depth-wise cross correlation learning method of the Siamese network as the starting point. Firstly, the regression strategy of the proposed framework is anchor-free, and the residual network ResNet50 is chosen as the backbone network, and add the channel attention mechanism SENet. The SE-MSCAM multi-scale channel attention model is proposed to make up for the lack of local feature extraction ability of the feature extraction network on the basis of SENet. On this basis, the attention fusion module AFFN is added to enhance the soft selection of attention. Combined with the SE-MSCAM multi-scale attention model and the attention fusion module AFFN, the ResNet50-AFFN multi-scale channel attention fusion network is proposed. Secondly, regarding the limitation of single-scale learning of SiamRPN++ depth-wise cross correlation, the MS-DWXCorr multi-scale depth-wise cross correlation is proposed which increases the diversity of learning feature scales to improve the efficiency of tracking network similarity learning. The experimental results show that, on the VOT2018 benchmark, the EAO of our method outperforms 4.0% of the mainstream algorithm SiamCAR, the tracking accuracy is improved by 3.4% and the tracking speed of our method maintains 40 FPS; the tracking success rate is improved by 2.0% and the tracking accuracy rate is improved by 3.2% compared to the mainstream algorithm SiamCAR. It has higher accuracy and robustness in dealing with occlusion, deformation, illumination variation, deformation, and other scenarios of visual tracking, and has better tracking performance.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [41] QRS multi-scale fusion detection algorithm
    Sun, Tao
    Zhang, Hong-Jian
    Zhou, Li
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2002, 36 (01): : 26 - 28
  • [42] Multi-Scale Cost Attention and Adaptive Fusion Stereo Matching Network
    Liu, Zhenguo
    Li, Zhao
    Ao, Wengang
    Zhang, Shaoshuang
    Liu, Wenlong
    He, Yizhi
    ELECTRONICS, 2023, 12 (07)
  • [43] Multi-scale feature fusion network with local attention for lung segmentation
    Xie, Yinghua
    Zhou, Yuntong
    Wang, Chen
    Ma, Yanshan
    Yang, Ming
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 119
  • [44] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [45] MAFormer: A transformer network with multi-scale attention fusion for visual recognition
    Sun, Huixin
    Wang, Yunhao
    Wang, Xiaodi
    Zhang, Bin
    Xin, Ying
    Zhang, Baochang
    Cao, Xianbin
    Ding, Errui
    Han, Shumin
    NEUROCOMPUTING, 2024, 595
  • [46] Multi-Scale Mixed Attention Network for CT and MRI Image Fusion
    Liu, Yang
    Yan, Binyu
    Zhang, Rongzhu
    Liu, Kai
    Jeon, Gwanggil
    Yang, Xiaoming
    ENTROPY, 2022, 24 (06)
  • [47] Multi-Scale Feature Fusion Network with Attention for Single Image Dehazing
    Pattern Recognition and Image Analysis, 2021, 31 : 608 - 615
  • [48] Multi-Scale Feature Fusion Network with Attention for Single Image Dehazing
    Hu, Bin
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2021, 31 (04) : 608 - 615
  • [49] Drone Target Detection Algorithm Based on Multi-scale Fusion and Lightweight Network
    Xue S.
    Lu T.
    Lü Q.
    Cao G.
    Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2023, 50 (08): : 82 - 93
  • [50] MsRAN: a multi-scale residual attention network for multi-model image fusion
    Wang, Jing
    Yu, Long
    Tian, Shengwei
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (12) : 3615 - 3634