Siamese Network Algorithm Based on Multi-Scale Channel Attention Fusion and Multi-Scale Depth-Wise Cross Correlation

被引:2
|
作者
Chen, Qingjun [1 ]
Zheng, Hua [1 ,2 ,3 ,4 ]
Pan, Hao [1 ]
Liao, Xiaoqi [1 ]
Wang, Hongkai [1 ]
机构
[1] Fujian Normal Univ, Coll Photon & Elect Engn, Fuzhou 350108, Peoples R China
[2] Fujian Normal Univ, Key Lab Optoelect Sci & Technol Med, Minist Educ, Fuzhou 350108, Peoples R China
[3] Fujian Normal Univ, Fujian Prov Key Lab Photon Technol, Fuzhou 350108, Peoples R China
[4] Fujian Prov Engn Res Ctr Optoelect Sensors & Inte, Fuzhou 350108, Peoples R China
关键词
Siamese network; visual object tracking; anchor-free regression strategy; multi-scale channel attention fusion; multi-scale depth-wise cross correlation; TRACKING;
D O I
10.1117/12.2680160
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The research takes the feature extraction network and depth-wise cross correlation learning method of the Siamese network as the starting point. Firstly, the regression strategy of the proposed framework is anchor-free, and the residual network ResNet50 is chosen as the backbone network, and add the channel attention mechanism SENet. The SE-MSCAM multi-scale channel attention model is proposed to make up for the lack of local feature extraction ability of the feature extraction network on the basis of SENet. On this basis, the attention fusion module AFFN is added to enhance the soft selection of attention. Combined with the SE-MSCAM multi-scale attention model and the attention fusion module AFFN, the ResNet50-AFFN multi-scale channel attention fusion network is proposed. Secondly, regarding the limitation of single-scale learning of SiamRPN++ depth-wise cross correlation, the MS-DWXCorr multi-scale depth-wise cross correlation is proposed which increases the diversity of learning feature scales to improve the efficiency of tracking network similarity learning. The experimental results show that, on the VOT2018 benchmark, the EAO of our method outperforms 4.0% of the mainstream algorithm SiamCAR, the tracking accuracy is improved by 3.4% and the tracking speed of our method maintains 40 FPS; the tracking success rate is improved by 2.0% and the tracking accuracy rate is improved by 3.2% compared to the mainstream algorithm SiamCAR. It has higher accuracy and robustness in dealing with occlusion, deformation, illumination variation, deformation, and other scenarios of visual tracking, and has better tracking performance.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [31] Skin disease migration segmentation network based on multi-scale channel attention
    Yu, Bin
    Yu, Long
    Tian, Shengwei
    Wu, Weidong
    Zhang Dezhi
    Kang, Xiaojing
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (03): : 730 - 739
  • [32] A Multi-scale Fusion-based Dark Channel Prior Dehazing Algorithm
    Zeng, Yujun
    Liu, Xiaolin
    SIXTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2014), 2015, 9443
  • [33] Multi-scale Residual Pyramid Attention Network for Monocular Depth Estimation
    Liu, Jing
    Zhang, Xiaona
    Li, Zhaoxin
    Mao, Tianlu
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5137 - 5144
  • [34] Multi-scale network with shared cross-attention for audio–visual correlation learning
    Jiwei Zhang
    Yi Yu
    Suhua Tang
    Wei Li
    Jianming Wu
    Neural Computing and Applications, 2023, 35 : 20173 - 20187
  • [35] Multi-scale Gated Inpainting Network with Patch-Wise Spacial Attention
    Hu, Xinrong
    Jin, Junjie
    Xiong, Mingfu
    Liu, Junping
    Peng, Tao
    Zhang, Zili
    Chen, Jia
    He, Ruhan
    Qin, Xiao
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS: DASFAA 2021 INTERNATIONAL WORKSHOPS, 2021, 12680 : 169 - 184
  • [36] Multi-Scale Attention Network for Image Cropping
    Lian, Tianpei
    Xian, Ke
    Pan, Zhiyu
    Hong, Chaoyi
    Cao, Zhiguo
    Zhong, Weicai
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2640 - 2645
  • [37] Lightweight Seizure Detection Based on Multi-Scale Channel Attention
    Wang, Ziwei
    Hou, Sujuan
    Xiao, Tiantian
    Zhang, Yongfeng
    Lv, Hongbin
    Li, Jiacheng
    Zhao, Shanshan
    Zhao, Yanna
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2023, 33 (12)
  • [38] Multi-scale attention network for image inpainting
    Qin, Jia
    Bai, Huihui
    Zhao, Yao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 204
  • [39] Cross Attention-Based Multi-Scale Convolutional Fusion Network for Hyperspectral and LiDAR Joint Classification
    Ge, Haimiao
    Wang, Liguo
    Pan, Haizhu
    Liu, Yanzhong
    Li, Cheng
    Lv, Dan
    Ma, Huiyu
    Remote Sensing, 2024, 16 (21)
  • [40] JAMFN: Joint Attention Multi-Scale Fusion Network for Depression Detection
    Zhou, Li
    Liu, Zhenyu
    Shangguan, Zixuan
    Yuan, Xiaoyan
    Li, Yutong
    Hu, Bin
    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023, 2023-August : 3417 - 3421