Towards Real-World Visual Tracking With Temporal Contexts

被引:23
|
作者
Cao, Ziang [1 ]
Huang, Ziyuan [2 ]
Pan, Liang [1 ]
Zhang, Shiwei [3 ]
Liu, Ziwei [1 ]
Fu, Changhong [4 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[2] Natl Univ Singapore, Dept Mech Engn, Singapore 119077, Singapore
[3] DAMO Acad, Alibaba Grp, Hangzhou 310052, Zhejiang, Peoples R China
[4] Tongji Univ, Sch Mech Engn, Shanghai 201804, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Latency-aware evaluations; real-world tests; temporal contexts; two-level framework; visual tracking; PLUS PLUS; NETWORK;
D O I
10.1109/TPAMI.2023.3307174
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual tracking has made significant improvements in the past few decades. Most existing state-of-the-art trackers 1) merely aim for performance in ideal conditions while overlooking the real-world conditions; 2) adopt the tracking-by-detection paradigm, neglecting rich temporal contexts; 3) only integrate the temporal information into the template, where temporal contexts among consecutive frames are far from being fully utilized. To handle those problems, we propose a two-level framework (TCTrack) that can exploit temporal contexts efficiently. Based on it, we propose a stronger version for real-world visual tracking, i.e., TCTrack++. It boils down to two levels: features and similarity maps. Specifically, for feature extraction, we propose an attention-based temporally adaptive convolution to enhance the spatial features using temporal information, which is achieved by dynamically calibrating the convolution weights. For similarity map refinement, we introduce an adaptive temporal transformer to encode the temporal knowledge efficiently and decode it for the accurate refinement of the similarity map. To further improve the performance, we additionally introduce a curriculum learning strategy. Also, we adopt online evaluation to measure performance in real-world conditions. Exhaustive experiments on 8 well-known benchmarks demonstrate the superiority of TCTrack++. Real-world tests directly verify that TCTrack++ can be readily used in real-world applications.
引用
收藏
页码:15834 / 15849
页数:16
相关论文
共 50 条
  • [1] Commentary - The development of real-world knowledge and reasoning in real-world contexts
    Ceci, SJ
    DEVELOPMENTAL REVIEW, 2002, 22 (02) : 323 - 330
  • [2] Face tracking and recognition with visual constraints in real-world videos
    Kim, Minyoung
    Kumar, Sanjiv
    Pavlovic, Vladimir
    Rowley, Henry
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 1787 - +
  • [3] Tracking the reading eye: towards a model of real-world reading
    Jarodzka, H.
    Brand-Gruwel, S.
    JOURNAL OF COMPUTER ASSISTED LEARNING, 2017, 33 (03) : 193 - 201
  • [4] Towards Real-World Neurorobotics: Integrated Neuromorphic Visual Attention
    Adams, Samantha V.
    Rast, Alexander D.
    Patterson, Cameron
    Galluppi, Francesco
    Brohan, Kevin
    Perez-Carrasco, Jose-Antonio
    Wennekers, Thomas
    Furber, Steve
    Cangelosi, Angelo
    NEURAL INFORMATION PROCESSING, ICONIP 2014, PT III, 2014, 8836 : 563 - 570
  • [5] WHEN THE WORLD IS NOT THE PROBLEM: REAL-WORLD CONTEXTS IN ANALOGIES
    Chu, Haiwen
    Rubel, Laurie H.
    PROCEEDINGS OF THE SEVENTH INTERNATIONAL MATHEMATICS EDUCATION AND SOCIETY CONFERENCE, VOLS 1 AND 2, 2013, : 262 - 271
  • [6] The influence of visual perception on responses towards real-world environments and application towards design
    Dzebic, Vedran
    Perdue, Justin S.
    Ellard, Colin G.
    INTELLIGENT BUILDINGS INTERNATIONAL, 2013, 5 (05) : 29 - 47
  • [7] TRANSFORMATION OF CHEMISTRY EXPERIMENTS INTO REAL-WORLD CONTEXTS
    BAYER, R
    HUDSON, B
    SCHNEIDER, J
    JOURNAL OF CHEMICAL EDUCATION, 1993, 70 (04) : 323 - 324
  • [8] Curricular orientations to real-world contexts in mathematics
    Smith, Cathy
    Morgan, Candia
    CURRICULUM JOURNAL, 2016, 27 (01): : 24 - 45
  • [9] Knowledge in real-world contexts: not glamorous, but indispensable
    Rich P.
    Asian Journal of Philosophy, 2 (2):
  • [10] Towards the Real-World Semantic Web - Web Search on Spatial and Temporal Metadata
    Akahani, Jun-Ichi
    Hiramatsu, Kaoru
    Sugiyama, Akira
    Yanagisawa, Yutaka
    Satoh, Tetsuji
    NTT Technical Review, 2003, 1 (03): : 71 - 75