3D Siamese Transformer Network for Single Object Tracking on Point Clouds

被引:19
|
作者
Hui, Le [1 ]
Wang, Lingpeng [1 ]
Tang, Linghua [1 ]
Lan, Kaihao [1 ]
Xie, Jin [1 ]
Yang, Jian [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Socia, Key Lab Intelligent Percept & Syst HighDimens Inf, Nanjing, Peoples R China
来源
关键词
3D single object tracking; Siamese network; Transformer; Point clouds;
D O I
10.1007/978-3-031-20086-1_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Siamese network based trackers formulate 3D single object tracking as cross-correlation learning between point features of a template and a search area. Due to the large appearance variation between the template and search area during tracking, how to learn the robust cross correlation between them for identifying the potential target in the search area is still a challenging problem. In this paper, we explicitly use Transformer to form a 3D Siamese Transformer network for learning robust cross correlation between the template and the search area of point clouds. Specifically, we develop a Siamese point Transformer network to learn shape context information of the target. Its encoder uses self-attention to capture non-local information of point clouds to characterize the shape information of the object, and the decoder utilizes cross-attention to upsample discriminative point features. After that, we develop an iterative coarse-to-fine correlation network to learn the robust cross correlation between the template and the search area. It formulates the cross-feature augmentation to associate the template with the potential target in the search area via cross attention. To further enhance the potential target, it employs the ego-feature augmentation that applies self-attention to the local k-NN graph of the feature space to aggregate target features. Experiments on the KITTI, nuScenes, and Waymo datasets show that our method achieves state-of-the-art performance on the 3D single object tracking task. Source code is available at https://github.com/fpthink/STNet.
引用
收藏
页码:293 / 310
页数:18
相关论文
共 50 条
  • [1] Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds
    Zheng, Chaoda
    Yan, Xu
    Zhang, Haiming
    Wang, Baoyuan
    Cheng, Shenghui
    Cui, Shuguang
    Li, Zhen
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8101 - 8110
  • [2] Point Siamese Network for Person Tracking Using 3D Point Clouds
    Cui, Yubo
    Fang, Zheng
    Zhou, Sifan
    [J]. SENSORS, 2020, 20 (01)
  • [3] PTT: Point-Track-Transformer Module for 3D Single Object Tracking in Point Clouds
    Shan, Jiayao
    Zhou, Sifan
    Fang, Zheng
    Cui, Yubo
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 1310 - 1316
  • [4] Object-Preserving Siamese Network for Single-Object Tracking on Point Clouds
    Zhao, Kaijie
    Zhao, Haitao
    Wang, Zhongze
    Peng, Jingchao
    Hu, Zhengwei
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3007 - 3017
  • [5] Multi-Correlation Siamese Transformer Network With Dense Connection for 3D Single Object Tracking
    Feng, Shihao
    Liang, Pengpeng
    Gao, Jin
    Cheng, Erkang
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8066 - 8073
  • [6] SPAN: siampillars attention network for 3D object tracking in point clouds
    Zhuang, Yi
    Zhao, Haitao
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (08) : 2105 - 2117
  • [7] SPAN: siampillars attention network for 3D object tracking in point clouds
    Yi Zhuang
    Haitao Zhao
    [J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 2105 - 2117
  • [8] TM2B: Transformer-Based Motion-to-Box Network for 3D Single Object Tracking on Point Clouds
    Xu, Anqi
    Nie, Jiahao
    He, Zhiwei
    Lv, Xudong
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (08): : 7078 - 7085
  • [9] GLT-T: Global-Local Transformer Voting for 3D Single Object Tracking in Point Clouds
    Nie, Jiahao
    He, Zhiwei
    Yang, Yuxiang
    Gao, Mingyu
    Zhang, Jing
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1957 - 1965
  • [10] Weakly Supervised Point Clouds Transformer for 3D Object Detection
    Tang, Zuojin
    Sun, Bo
    Ma, Tongwei
    Li, Daosheng
    Xu, Zhenhui
    [J]. 2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 3948 - 3955