3D Siamese Voxel-to-BEV Tracker for Sparse Point Clouds

被引:0
|
作者
Hui, Le [1 ,2 ]
Wang, Lingpeng [1 ,2 ]
Cheng, Mingmei [1 ,2 ]
Xie, Jin [1 ,2 ]
Yang, Jian [1 ,2 ]
机构
[1] Nanjing Univ Sci & Technol, PCA Lab, Key Lab Intelligent Percept & Syst High Dimens In, Minist Educ, Nanjing, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Socia, Nanjing, Peoples R China
关键词
OBJECT; ROBUST;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object tracking in point clouds is still a challenging problem due to the sparsity of LiDAR points in dynamic environments. In this work, we propose a Siamese voxel-to-BEV tracker, which can significantly improve the tracking performance in sparse 3D point clouds. Specifically, it consists of a Siamese shape-aware feature learning network and a voxel-to-BEV target localization network. The Siamese shape-aware feature learning network can capture 3D shape information of the object to learn the discriminative features of the object so that the potential target from the background in sparse point clouds can be identified. To this end, we first perform template feature embedding to embed the template's feature into the potential target and then generate a dense 3D shape to characterize the shape information of the potential target. For localizing the tracked target, the voxel-toBEV target localization network regresses the target's 2D center and the z-axis center from the dense bird's eye view (BEV) feature map in an anchor-free manner. Concretely, we compress the voxelized point cloud along z-axis through max pooling to obtain a dense BEV feature map, where the regression of the 2D center and the z -axis center can be performed more effectively. Extensive evaluation on the KITTI and nuScenes datasets shows that our method significantly outperforms the current state-of-the-art methods by a large margin. Code is available at https: //github.com/fpthink/V2B.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] PointSiamRCNN: Target-aware Voxel-based Siamese Tracker for Point Clouds
    Zou, Hao
    Zhang, Chujuan
    Liu, Yong
    Li, Wanlong
    Wen, Feng
    Zhang, Hongbo
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 7029 - 7035
  • [2] Point Siamese Network for Person Tracking Using 3D Point Clouds
    Cui, Yubo
    Fang, Zheng
    Zhou, Sifan
    [J]. SENSORS, 2020, 20 (01)
  • [3] Permuted Sparse Representation for 3D Point Clouds
    Hou, Junhui
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (12) : 1847 - 1851
  • [4] Graph-Based Point Tracker for 3D Object Tracking in Point Clouds
    Park, Minseong
    Seong, Hongje
    Jang, Wonje
    Kim, Euntai
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2053 - 2061
  • [5] 3D Siamese Transformer Network for Single Object Tracking on Point Clouds
    Hui, Le
    Wang, Lingpeng
    Tang, Linghua
    Lan, Kaihao
    Xie, Jin
    Yang, Jian
    [J]. COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 293 - 310
  • [6] SVGA-Net: Sparse Voxel-Graph Attention Network for 3D Object Detection from Point Clouds
    He, Qingdong
    Wang, Zhengning
    Zeng, Hao
    Zeng, Yi
    Liu, Yijun
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 870 - 878
  • [7] A Lightweight and Detector-Free 3D Single Object Tracker on Point Clouds
    Xia, Yan
    Wu, Qiangqiang
    Li, Wei
    Chan, Antoni B. B.
    Stilla, Uwe
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (05) : 5543 - 5554
  • [8] SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds
    Sun, Pei
    Tan, Mingxing
    Wang, Weiyue
    Liu, Chenxi
    Xia, Fei
    Leng, Zhaoqi
    Anguelov, Dragomir
    [J]. COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 426 - 442
  • [9] Fast and Robust 3D Feature Extraction from Sparse Point Clouds
    Serafin, Jacopo
    Olson, Edwin
    Grisetti, Giorgio
    [J]. 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 4105 - 4112
  • [10] Automated Reconstruction of 3D Open Surfaces from Sparse Point Clouds
    Arshad, Mohammad Samiul
    Beksi, William J.
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY ADJUNCT (ISMAR-ADJUNCT 2022), 2022, : 216 - 221