Temporal-Aware Siamese Tracker: Integrate Temporal Context for 3D Object Tracking

被引:0
|
作者
Lan, Kaihao [1 ]
Jiang, Haobo [1 ]
Xie, Jin [1 ]
机构
[1] Nanjing Univ Sci & Technol, Nanjing, Peoples R China
来源
关键词
D O I
10.1007/978-3-031-26319-4_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning discriminative target-specific feature representation for object localization is the core of the 3D Siamese object tracking algorithms. Current Siamese trackers focus on aggregating the target information from the latest template into the search area for target-specific feature construction, which presents the limited performance in the case of object occlusion or object missing. To this end, in this paper, we propose a novel temporal-aware Siamese tracking framework, where the rich target clue lying in a set of historical templates is integrated into the search area for reliable target-specific feature aggregation. Specifically, our method consists of three modules, including a template set sampling module, a temporal feature enhancement module and a temporal-aware feature aggregation module. In the template set sampling module, an effective scoring network is proposed to evaluate the tracking quality of the template so that the high-quality templates are collected to form the historical template set. Then, with the initial feature embeddings of the historical templates, the temporal feature enhancement module concatenates all template embeddings as a whole and then feeds them into a linear attention module for cross-template feature enhancement. Furthermore, the temporal-aware feature aggregation module aggregates the target clue lying in each template into the search area to construct multiple historical target-specific search-area features. Particularly, we follow the collection orders of the templates to fuse all generated target-specific features via an RNN-based module so that the fusion weight of the previous template information can be discounted to better fit the current tracking state. Finally, we feed the temporal fused target-specific feature into a modified CenterPoint detection head for target position regression. Extensive experiments on KITTI, NuScenes and waymo open datasets show the effectiveness of our proposed method. Source code is available at https://github.com/tqsdyy/TAT.
引用
收藏
页码:20 / 35
页数:16
相关论文
共 50 条
  • [41] Context-Aware 3D Object Streaming for Mobile Games
    Rahimi, Hesam
    Shirehjini, Ali Asghar Nazari
    Shirmohammadi, Shervin
    2011 10TH ANNUAL WORKSHOP ON NETWORK AND SYSTEMS SUPPORT FOR GAMES (NETGAMES 2011), 2011,
  • [42] Context-aware 3D object anchoring for mobile robots
    Guenther, Martin
    Ruiz-Sarmiento, J. R.
    Galindo, Cipriano
    Gonzalez-Jimenez, Javier
    Hertzberg, Joachim
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 110 : 12 - 32
  • [43] ShaSTA: Modeling Shape and Spatio-Temporal Affinities for 3D Multi-Object Tracking
    Sadjadpour, Tara
    Li, Jie
    Ambrus, Rares
    Bohg, Jeannette
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05): : 4273 - 4280
  • [44] Spatial-Temporal Aware Long-Term Object Tracking
    Zhang, Wei
    Kang, Baosheng
    Zhang, Shunli
    IEEE ACCESS, 2020, 8 : 71662 - 71684
  • [45] Temporal tracking of in projection 3D coronary arteries angiograms
    Shechter, G
    Devernay, F
    Coste-Manière, E
    McVeigh, ER
    MEDICAL IMAGING 2002: IMAGE PROCESSING, VOL 1-3, 2002, 4684 : 612 - 623
  • [46] Novel 3D Objects to Study Recognition and Temporal Context
    Kakaei, Ehsan
    Aleshin, Stepan
    Braun, Jochen
    PERCEPTION, 2019, 48 : 88 - 88
  • [47] Graph-Based Point Tracker for 3D Object Tracking in Point Clouds
    Park, Minseong
    Seong, Hongje
    Jang, Wonje
    Kim, Euntai
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2053 - 2061
  • [48] Target-Aware Tracking With Spatial-Temporal Context Attention
    He, Kai-Jie
    Zhang, Can-Long
    Xie, Sheng
    Li, Zhi-Xin
    Wang, Zhi-Wen
    Qin, Rui-Guo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7176 - 7189
  • [49] OCCLUSION-AWARE 3D MULTIPLE OBJECT TRACKER WITH TWO CAMERAS FOR VISUAL SURVEILLANCE
    Topcu, Osman
    Alatan, A. Aydin
    Ercan, Ali Ozer
    2014 11TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2014, : 56 - 61
  • [50] Spatio-temporal SRU with global context-aware attention for 3D human action recognition
    She, Qingshan
    Mu, Gaoyuan
    Gan, Haitao
    Fan, Yingle
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (17-18) : 12349 - 12371