High-Performance Transformer Tracking

被引:17
|
作者
Chen, Xin [1 ,2 ]
Yan, Bin [1 ,2 ]
Zhu, Jiawen [1 ,2 ]
Lu, Huchuan [1 ,2 ]
Ruan, Xiang [3 ]
Wang, Dong [1 ,2 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Liaoning, Peoples R China
[2] Dalian Univ Technol, Ningbo Inst, Ningbo 315016, Zhejiang, Peoples R China
[3] Tiwaki Co Ltd, Kusatsu, Shiga 5258577, Japan
基金
中国国家自然科学基金;
关键词
Transformers; Target tracking; Correlation; Magnetic heads; Feature extraction; Semantics; Head; Cross-attention; object tracking; self-attention; siamese tracking; transformer; VISUAL TRACKING;
D O I
10.1109/TPAMI.2022.3232535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Correlation has a critical role in the tracking field, especially in recent popular Siamese-based trackers. The correlation operation is a simple fusion method that considers the similarity between the template and the search region. However, the correlation operation is a local linear matching process, losing semantic information and easily falling into a local optimum, which may be the bottleneck in designing high-accuracy tracking algorithms. In this work, to determine whether a better feature fusion method exists than correlation, a novel attention-based feature fusion network, inspired by the transformer, is presented. This network effectively combines the template and search region features using attention mechanism. Specifically, the proposed method includes an ego-context augment module based on self-attention and a cross-feature augment module based on cross-attention. First, we present a transformer tracking (named TransT) method based on the Siamese-like feature extraction backbone, the designed attention-based fusion mechanism, and the classification and regression heads. Based on the TransT baseline, we also design a segmentation branch to generate the accurate mask. Finally, we propose a stronger version of TransT by extending it with a multi-template scheme and an IoU prediction head, named TransT-M. Experiments show that our TransT and TransT-M methods achieve promising results on seven popular benchmarks. Code and models are available at https://github.com/chenxin-dlut/TransT-M.
引用
收藏
页码:8507 / 8523
页数:17
相关论文
共 50 条
  • [31] High-Performance Visual Tracking Based on High-Order Pooling Network
    Feng, Xinxi
    Pu, Lei
    IEEE ACCESS, 2022, 10 : 102957 - 102967
  • [32] A Fully Symmetric High-Performance Transformer Balun Based on TSV for RF Applications
    Wang, Fengjuan
    Zhang, Dingxi
    Yin, Xiangkun
    Yu, Ningmei
    Yang, Yuan
    IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2023, 13 (07): : 1074 - 1077
  • [33] A High-Performance Hybrid Current Transformer Based on a Fast Variable Optical Attenuator
    Wei, Pu
    Cheng, Cheng
    Wang, Xuefeng
    Shan, Xuekang
    Sun, Xiaohan
    IEEE TRANSACTIONS ON POWER DELIVERY, 2014, 29 (06) : 2656 - 2663
  • [34] Tracking Your Browser with High-Performance Browser Fingerprint Recognition Model
    Wei Jiang
    Xiaoxi Wang
    Xinfang Song
    Qixu Liu
    Xiaofeng Liu
    中国通信, 2020, 17 (03) : 168 - 175
  • [35] High-performance UAVs visual tracking using deep convolutional feature
    Shuaidong Yang
    Jin Xu
    Haiyun Chen
    Min Wang
    Neural Computing and Applications, 2022, 34 : 13539 - 13558
  • [36] High-Performance Robotic Contour Tracking based on the Dynamic Compensation Concept
    Huang, Shouren
    Bergstroem, Niklas
    Yamakawa, Yuji
    Senoo, Taku
    Ishikawa, Masatoshi
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 3886 - 3893
  • [37] Tracking Your Browser with High-Performance Browser Fingerprint Recognition Model
    Jiang, Wei
    Wang, Xiaoxi
    Song, Xinfang
    Liu, Qixu
    Liu, Xiaofeng
    CHINA COMMUNICATIONS, 2020, 17 (03) : 168 - 175
  • [38] High-Performance Discriminative Tracking with Target-Aware Feature Embeddings
    Yu, Bin
    Tang, Ming
    Zheng, Linyu
    Zhu, Guibo
    Wang, Jinqiao
    Lu, Hanqing
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 3 - 15
  • [39] Design and implementation of a high-performance technique for tracking PV peak power
    Belkaid, Abdelhakim
    Gaubert, Jean-Paul
    Gherbi, Ahmed
    IET RENEWABLE POWER GENERATION, 2017, 11 (01) : 92 - 99
  • [40] Simple structure for a high-performance three-dimensional tracking filter
    Weiss, H
    Hexner, G
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2004, 27 (03) : 491 - 493