An Improved Siamese Tracking Network Based On Self-Attention And Cross-Attention

被引:0
|
作者
Lai Yijun [1 ]
Song Jianmei [1 ]
She Haoping [1 ]
机构
[1] Beijing Inst Technol, Sch Aerosp Engn, Beijing, Peoples R China
关键词
object tracking; Siamese network; self-attention; cross-attention;
D O I
10.1109/CCDC58219.2023.10326870
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Siamese visual tracking network SiamRPN++ is found that its success rate and robustness is unsatisfactory when meeting complex scenes such as occlusion, large deformation, interference of similar objects and long-time tracking. Refer to these, we propose an improvement strategy based on self-attention and cross-attention mechanism. For backbone, we use Channel and Space self-attention modules, and we using different cross channel attention modules between template features and search features in every three RPN modules, finally using special self-attention on similarity feature maps. These tricks effectively suppress interference, improve the features' quality and make progress in robustness. Comparing with original SiamRPN++ with parameters from official open-source frame, PySOT, our network improves robustness of 3% on VOT2018, accuracy of 2% and success rate of 3% on OTB100.
引用
收藏
页码:466 / 470
页数:5
相关论文
共 50 条
  • [1] Multi-level Cross-attention Siamese Network For Visual Object Tracking
    Zhang, Jianwei
    Wang, Jingchao
    Zhang, Huanlong
    Miao, Mengen
    Cai, Zengyu
    Chen, Fuguo
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (12): : 3976 - 3990
  • [2] Self-Attention based Siamese Neural Network recognition Model
    Liu, Yuxing
    Chang, Geng
    Fu, Guofeng
    Wei, Yingchao
    Lan, Jie
    Liu, Jiarui
    [J]. 2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 721 - 724
  • [3] IS CROSS-ATTENTION PREFERABLE TO SELF-ATTENTION FOR MULTI-MODAL EMOTION RECOGNITION?
    Rajan, Vandana
    Brutti, Alessio
    Cavallaro, Andrea
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4693 - 4697
  • [4] Siamese visual tracking based on criss-cross attention and improved head network
    Zhang, Jianming
    Huang, Haitao
    Jin, Xiaokang
    Kuang, Li-Dan
    Zhang, Jin
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (1) : 1589 - 1615
  • [5] Siamese visual tracking based on criss-cross attention and improved head network
    Jianming Zhang
    Haitao Huang
    Xiaokang Jin
    Li-Dan Kuang
    Jin Zhang
    [J]. Multimedia Tools and Applications, 2024, 83 : 1589 - 1615
  • [6] A cross-attention and Siamese network based model for off-topic detection
    Fan, Cong
    Guo, Shen
    Wumaier, Aishan
    Liu, Jiajun
    [J]. 2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 770 - 777
  • [7] CASNet: A Cross-Attention Siamese Network for Video Salient Object Detection
    Ji, Yuzhu
    Zhang, Haijun
    Jie, Zequn
    Ma, Lin
    Wu, Q. M. Jonathan
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2676 - 2690
  • [8] A fine-grained classification method based on self-attention Siamese network
    He Can
    Yuan Guowu
    Wu Hao
    [J]. 2021 THE 5TH INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, ICVIP 2021, 2021, : 148 - 154
  • [9] Adaptive Multi-Feature Fusion Visual Target Tracking Based on Siamese Neural Network with Cross-Attention Mechanism
    Zhou, Qian
    Xia, Haoran
    Yan, Hongzheng
    Yang, Ming
    Chen, Shidong
    [J]. 2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 307 - 316
  • [10] SCAD: A Siamese Cross-Attention Discrimination Network for Bitemporal Building Change Detection
    Xu, Chuan
    Ye, Zhaoyi
    Mei, Liye
    Shen, Sen
    Zhang, Qi
    Sui, Haigang
    Yang, Wei
    Sun, Shaohua
    [J]. REMOTE SENSING, 2022, 14 (24)