Siamese visual tracking based on criss-cross attention and improved head network

被引:0
|
作者
Jianming Zhang
Haitao Huang
Xiaokang Jin
Li-Dan Kuang
Jin Zhang
机构
[1] Ministry of Education (Changsha University of Science and Technology),Key Laboratory of Safety Control of Bridge Engineering
[2] Changsha University of Science and Technology,School of Computer and Communication Engineering
[3] Jinhua Advanced Research Institute,undefined
来源
关键词
Visual tracking; Siamese network; Deep learning; Attention mechanism; Anchor-free; Center-ness;
D O I
暂无
中图分类号
学科分类号
摘要
The efficient Siamese anchor-free tracker has fewer parameters, but it produces a large number of low-quality bounding boxes which are located far away from the center of the object. Moreover, a plenty of background information or distractors also interfere with the tracking process, resulting in the inaccurate results of classification and regression. As such, we propose a novel Siamese anchor-free network based on criss-cross attention and an improved head network. We apply ResNet-50 to extract the features of the template image and search region, then feed the feature maps into a recurrent criss-cross attention module to make it more discriminative. The enhanced feature maps are inputted into our improved head network, which include the center-ness branch based on the original classification and regression branches to filter out low-quality bounding boxes. Our proposed tracker reduces the impact of background information or distractors and can obtain high-quality bounding boxes, generating more accurate and robust tracking results. Extensive experiments and comparisons with state-of-the-art trackers are conducted on many challenging benchmarks such as VOT2016, VOT2018, GOT-10k, UAV123 and OTB2015. Our tracker achieves excellent performance with a considerable real-time speed.
引用
收藏
页码:1589 / 1615
页数:26
相关论文
共 50 条
  • [1] Siamese visual tracking based on criss-cross attention and improved head network
    Zhang, Jianming
    Huang, Haitao
    Jin, Xiaokang
    Kuang, Li-Dan
    Zhang, Jin
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (1) : 1589 - 1615
  • [2] Criss-Cross Attentional Siamese Networks for Object Tracking
    Wang, Zhangdong
    Qin, Jiaohua
    Xiang, Xuyu
    Tan, Yun
    Xiong, Neal N.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 2931 - 2946
  • [3] ATCC: Accurate tracking by criss-cross location attention
    Wu, Yong
    Liu, Zhi
    Zhou, Xiaofei
    Ye, Linwei
    Wang, Yang
    [J]. IMAGE AND VISION COMPUTING, 2021, 111
  • [4] Siamese Network Based on MLP and Multi-head Cross Attention for Visual Object Tracking
    Li, Piaoyang
    Lan, Shiyong
    Sun, Shipeng
    Wang, Wenwu
    Gao, Yongyang
    Yang, Yongyu
    Yu, Guangyu
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PART X, 2023, 14263 : 420 - 431
  • [5] An Improved Siamese Tracking Network Based On Self-Attention And Cross-Attention
    Lai Yijun
    Song Jianmei
    She Haoping
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 466 - 470
  • [6] Visual Tracking With Siamese Network Based on Fast Attention Network
    Qin, Lin
    Yang, Yang
    Huang, Dandan
    Zhu, Naibo
    Yang, Han
    Xu, Zhisong
    [J]. IEEE ACCESS, 2022, 10 : 35632 - 35642
  • [7] CCNet: Criss-Cross Attention for Semantic Segmentation
    Huang, Zilong
    Wang, Xinggang
    Wei, Yunchao
    Huang, Lichao
    Shi, Humphrey
    Liu, Wenyu
    Huang, Thomas S.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 6896 - 6908
  • [8] CCNet: Criss-Cross Attention for Semantic Segmentation
    Huang, Zilong
    Wang, Xinggang
    Huang, Lichao
    Huang, Chang
    Wei, Yunchao
    Liu, Wenyu
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 603 - 612
  • [9] Siamese-Based Twin Attention Network for Visual Tracking
    Bao, Hua
    Shu, Ping
    Zhang, Hongchao
    Liu, Xiaobai
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (02) : 847 - 860
  • [10] SiamAtt: Siamese attention network for visual tracking
    Yang, Kai
    He, Zhenyu
    Zhou, Zikun
    Fan, Nana
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 203