High-Performance Transformer Tracking

被引:17
|
作者
Chen, Xin [1 ,2 ]
Yan, Bin [1 ,2 ]
Zhu, Jiawen [1 ,2 ]
Lu, Huchuan [1 ,2 ]
Ruan, Xiang [3 ]
Wang, Dong [1 ,2 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Liaoning, Peoples R China
[2] Dalian Univ Technol, Ningbo Inst, Ningbo 315016, Zhejiang, Peoples R China
[3] Tiwaki Co Ltd, Kusatsu, Shiga 5258577, Japan
基金
中国国家自然科学基金;
关键词
Transformers; Target tracking; Correlation; Magnetic heads; Feature extraction; Semantics; Head; Cross-attention; object tracking; self-attention; siamese tracking; transformer; VISUAL TRACKING;
D O I
10.1109/TPAMI.2022.3232535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Correlation has a critical role in the tracking field, especially in recent popular Siamese-based trackers. The correlation operation is a simple fusion method that considers the similarity between the template and the search region. However, the correlation operation is a local linear matching process, losing semantic information and easily falling into a local optimum, which may be the bottleneck in designing high-accuracy tracking algorithms. In this work, to determine whether a better feature fusion method exists than correlation, a novel attention-based feature fusion network, inspired by the transformer, is presented. This network effectively combines the template and search region features using attention mechanism. Specifically, the proposed method includes an ego-context augment module based on self-attention and a cross-feature augment module based on cross-attention. First, we present a transformer tracking (named TransT) method based on the Siamese-like feature extraction backbone, the designed attention-based fusion mechanism, and the classification and regression heads. Based on the TransT baseline, we also design a segmentation branch to generate the accurate mask. Finally, we propose a stronger version of TransT by extending it with a multi-template scheme and an IoU prediction head, named TransT-M. Experiments show that our TransT and TransT-M methods achieve promising results on seven popular benchmarks. Code and models are available at https://github.com/chenxin-dlut/TransT-M.
引用
收藏
页码:8507 / 8523
页数:17
相关论文
共 50 条
  • [21] A High-Performance Signal Processing System for Monopulse Tracking Radar
    Hong, Yong-bin
    Xu, Cheng-fa
    Gao, Mei-guo
    Zhao, Li-zhi
    MANUFACTURING SCIENCE AND TECHNOLOGY, PTS 1-8, 2012, 383-390 : 471 - 475
  • [22] High-performance autostereoscopic display based on the lenticular tracking method
    Huang, Tianqi
    Han, Boxuan
    Zhang, Xinran
    Liao, Hongen
    OPTICS EXPRESS, 2019, 27 (15) : 20421 - 20434
  • [23] Stator Flux Trajectory Tracking Control for High-Performance Drives
    Oikonomou, Nikolaos
    Holtz, Joachim
    CONFERENCE RECORD OF THE 2006 IEEE INDUSTRY APPLICATIONS CONFERENCE, FORTY-FIRST IAS ANNUAL MEETING, VOL 1-5, 2006, : 1268 - 1275
  • [24] DYNAMIC CONTROL OF A ROBOT MANIPULATOR FOR HIGH-PERFORMANCE CONVEYOR TRACKING
    PARK, TH
    LEE, BH
    JOURNAL OF ROBOTIC SYSTEMS, 1995, 12 (10): : 647 - 660
  • [25] High-Performance Siamese Network for Real-Time Tracking
    Du, Guocai
    Zhou, Peiyong
    Abudurexiti, Ruxianguli
    Mahpirat
    Aysa, Alimjan
    Ubul, Kurban
    SENSORS, 2022, 22 (22)
  • [26] High-performance trajectory tracking control of a quadrotor with disturbance observer
    Dong, Wei
    Gu, Guo-Ying
    Zhu, Xiangyang
    Ding, Han
    SENSORS AND ACTUATORS A-PHYSICAL, 2014, 211 : 67 - 77
  • [27] High-performance UAVs visual tracking based on siamese network
    Shuaidong Yang
    Haiyun Chen
    Fancheng Xu
    Yang Li
    Jiemin Yuan
    The Visual Computer, 2022, 38 : 2107 - 2123
  • [28] High-Performance Visual Tracking With Extreme Learning Machine Framework
    Deng, Chenwei
    Han, Yuqi
    Zhao, Baojun
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2781 - 2792
  • [29] VARIABLE STRUCTURE TRACKING OF DC MOTOR FOR HIGH-PERFORMANCE APPLICATIONS
    ELSHARKAWI, MA
    HUANG, CH
    IEEE TRANSACTIONS ON ENERGY CONVERSION, 1989, 4 (04) : 643 - 650
  • [30] High-Performance Visual Tracking Based on High-Order Pooling Network
    Feng, Xinxi
    Pu, Lei
    IEEE Access, 2022, 10 : 102957 - 102967