3D Siamese Transformer Network for Single Object Tracking on Point Clouds

被引：19

作者：

Hui, Le ^{[1
]}

Wang, Lingpeng ^{[1
]}

Tang, Linghua ^{[1
]}

Lan, Kaihao ^{[1
]}

Xie, Jin ^{[1
]}

Yang, Jian ^{[1
]}

机构：

[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Socia, Key Lab Intelligent Percept & Syst HighDimens Inf, Nanjing, Peoples R China

来源：

COMPUTER VISION - ECCV 2022, PT II | 2022年 / 13662卷

关键词：

3D single object tracking; Siamese network; Transformer; Point clouds;

D O I：

10.1007/978-3-031-20086-1_17

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Siamese network based trackers formulate 3D single object tracking as cross-correlation learning between point features of a template and a search area. Due to the large appearance variation between the template and search area during tracking, how to learn the robust cross correlation between them for identifying the potential target in the search area is still a challenging problem. In this paper, we explicitly use Transformer to form a 3D Siamese Transformer network for learning robust cross correlation between the template and the search area of point clouds. Specifically, we develop a Siamese point Transformer network to learn shape context information of the target. Its encoder uses self-attention to capture non-local information of point clouds to characterize the shape information of the object, and the decoder utilizes cross-attention to upsample discriminative point features. After that, we develop an iterative coarse-to-fine correlation network to learn the robust cross correlation between the template and the search area. It formulates the cross-feature augmentation to associate the template with the potential target in the search area via cross attention. To further enhance the potential target, it employs the ego-feature augmentation that applies self-attention to the local k-NN graph of the feature space to aggregate target features. Experiments on the KITTI, nuScenes, and Waymo datasets show that our method achieves state-of-the-art performance on the 3D single object tracking task. Source code is available at https://github.com/fpthink/STNet.

引用

页码：293 / 310

页数：18

共 50 条

[1] Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds
Zheng, Chaoda
Yan, Xu
Zhang, Haiming
Wang, Baoyuan
Cheng, Shenghui
Cui, Shuguang
Li, Zhen
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8101 - 8110
[2] Point Siamese Network for Person Tracking Using 3D Point Clouds
Cui, Yubo
Fang, Zheng
Zhou, Sifan
[J]. SENSORS, 2020, 20 (01)
[3] PTT: Point-Track-Transformer Module for 3D Single Object Tracking in Point Clouds
Shan, Jiayao
Zhou, Sifan
Fang, Zheng
Cui, Yubo
[J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 1310 - 1316
[4] Object-Preserving Siamese Network for Single-Object Tracking on Point Clouds
Zhao, Kaijie
Zhao, Haitao
Wang, Zhongze
Peng, Jingchao
Hu, Zhengwei
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3007 - 3017
[5] Multi-Correlation Siamese Transformer Network With Dense Connection for 3D Single Object Tracking
Feng, Shihao
Liang, Pengpeng
Gao, Jin
Cheng, Erkang
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8066 - 8073
[6] SPAN: siampillars attention network for 3D object tracking in point clouds
Zhuang, Yi
Zhao, Haitao
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (08) : 2105 - 2117
[7] SPAN: siampillars attention network for 3D object tracking in point clouds
Yi Zhuang
Haitao Zhao
[J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 2105 - 2117
[8] TM2B: Transformer-Based Motion-to-Box Network for 3D Single Object Tracking on Point Clouds
Xu, Anqi
Nie, Jiahao
He, Zhiwei
Lv, Xudong
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (08): : 7078 - 7085
[9] GLT-T: Global-Local Transformer Voting for 3D Single Object Tracking in Point Clouds
Nie, Jiahao
He, Zhiwei
Yang, Yuxiang
Gao, Mingyu
Zhang, Jing
[J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1957 - 1965
[10] Weakly Supervised Point Clouds Transformer for 3D Object Detection
Tang, Zuojin
Sun, Bo
Ma, Tongwei
Li, Daosheng
Xu, Zhenhui
[J]. 2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 3948 - 3955

← 1 2 3 4 5 →