Temporal-Aware Siamese Tracker: Integrate Temporal Context for 3D Object Tracking

被引:0
|
作者
Lan, Kaihao [1 ]
Jiang, Haobo [1 ]
Xie, Jin [1 ]
机构
[1] Nanjing Univ Sci & Technol, Nanjing, Peoples R China
来源
关键词
D O I
10.1007/978-3-031-26319-4_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning discriminative target-specific feature representation for object localization is the core of the 3D Siamese object tracking algorithms. Current Siamese trackers focus on aggregating the target information from the latest template into the search area for target-specific feature construction, which presents the limited performance in the case of object occlusion or object missing. To this end, in this paper, we propose a novel temporal-aware Siamese tracking framework, where the rich target clue lying in a set of historical templates is integrated into the search area for reliable target-specific feature aggregation. Specifically, our method consists of three modules, including a template set sampling module, a temporal feature enhancement module and a temporal-aware feature aggregation module. In the template set sampling module, an effective scoring network is proposed to evaluate the tracking quality of the template so that the high-quality templates are collected to form the historical template set. Then, with the initial feature embeddings of the historical templates, the temporal feature enhancement module concatenates all template embeddings as a whole and then feeds them into a linear attention module for cross-template feature enhancement. Furthermore, the temporal-aware feature aggregation module aggregates the target clue lying in each template into the search area to construct multiple historical target-specific search-area features. Particularly, we follow the collection orders of the templates to fuse all generated target-specific features via an RNN-based module so that the fusion weight of the previous template information can be discounted to better fit the current tracking state. Finally, we feed the temporal fused target-specific feature into a modified CenterPoint detection head for target position regression. Extensive experiments on KITTI, NuScenes and waymo open datasets show the effectiveness of our proposed method. Source code is available at https://github.com/tqsdyy/TAT.
引用
收藏
页码:20 / 35
页数:16
相关论文
共 50 条
  • [1] STTracker: Spatio-Temporal Tracker for 3D Single Object Tracking
    Cui, Yubo
    Li, Zhiheng
    Fang, Zheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (08) : 4967 - 4974
  • [2] Leveraging temporal-aware fine-grained features for robust multiple object tracking
    Wu, Han
    Nie, Jiahao
    Zhu, Ziming
    He, Zhiwei
    Gao, Mingyu
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (03): : 2910 - 2931
  • [3] Learning Adaptive Spatial Regularization and Temporal-Aware Correlation Filters for Visual Object Tracking
    Liu, Liqiang
    Feng, Tiantian
    Fu, Yanfang
    Shen, Chao
    Hu, Zhijuan
    Qin, Maoyuan
    Bai, Xiaojun
    Zhao, Shifeng
    MATHEMATICS, 2022, 10 (22)
  • [4] Leveraging temporal-aware fine-grained features for robust multiple object tracking
    Han Wu
    Jiahao Nie
    Ziming Zhu
    Zhiwei He
    Mingyu Gao
    The Journal of Supercomputing, 2023, 79 : 2910 - 2931
  • [5] Context-aware Siamese network for object tracking
    Zhang, Jianwei
    Wang, Jingchao
    Zhang, Huanlong
    Miao, Mengen
    Wu, Di
    IET IMAGE PROCESSING, 2023, 17 (01) : 215 - 226
  • [6] F-Siamese Tracker: A Frustum-based Double Siamese Network for 3D Single Object Tracking
    Zou, Hao
    Cui, Jinhao
    Kong, Xin
    Zhang, Chujuan
    Liu, Yong
    Wen, Feng
    Li, Wanlong
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 8133 - 8139
  • [7] Temporal-Aware Self-Supervised Learning for 3D Hand Pose and Mesh Estimation in Videos
    Chen, Liangjian
    Lin, Shih-Yao
    Xie, Yusheng
    Lin, Yen-Yu
    Xie, Xiaohui
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1049 - 1058
  • [8] TaCoTrack: Tracking Object With Temporal Context
    Wang, Zhixuan
    Wang, Bo
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4068 - 4073
  • [9] Implicit learning in 3D object recognition: The importance of temporal context
    Becker, S
    NEURAL COMPUTATION, 1999, 11 (02) : 347 - 374
  • [10] Temporal-Aware Lightweight Visual Tracking Method for Dynamic Traffic Scenes
    Cen, Xuming
    Hu, Nan
    Wang, Haozhe
    Liu, Shiyi
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 612 - 619