3D Siamese Voxel-to-BEV Tracker for Sparse Point Clouds

被引:0
|
作者
Hui, Le [1 ,2 ]
Wang, Lingpeng [1 ,2 ]
Cheng, Mingmei [1 ,2 ]
Xie, Jin [1 ,2 ]
Yang, Jian [1 ,2 ]
机构
[1] Nanjing Univ Sci & Technol, PCA Lab, Key Lab Intelligent Percept & Syst High Dimens In, Minist Educ, Nanjing, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Socia, Nanjing, Peoples R China
关键词
OBJECT; ROBUST;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object tracking in point clouds is still a challenging problem due to the sparsity of LiDAR points in dynamic environments. In this work, we propose a Siamese voxel-to-BEV tracker, which can significantly improve the tracking performance in sparse 3D point clouds. Specifically, it consists of a Siamese shape-aware feature learning network and a voxel-to-BEV target localization network. The Siamese shape-aware feature learning network can capture 3D shape information of the object to learn the discriminative features of the object so that the potential target from the background in sparse point clouds can be identified. To this end, we first perform template feature embedding to embed the template's feature into the potential target and then generate a dense 3D shape to characterize the shape information of the potential target. For localizing the tracked target, the voxel-toBEV target localization network regresses the target's 2D center and the z-axis center from the dense bird's eye view (BEV) feature map in an anchor-free manner. Concretely, we compress the voxelized point cloud along z-axis through max pooling to obtain a dense BEV feature map, where the regression of the 2D center and the z -axis center can be performed more effectively. Extensive evaluation on the KITTI and nuScenes datasets shows that our method significantly outperforms the current state-of-the-art methods by a large margin. Code is available at https: //github.com/fpthink/V2B.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Meshfree Thinning of 3D Point Clouds
    Nira Dyn
    Armin Iske
    Holger Wendland
    [J]. Foundations of Computational Mathematics, 2008, 8 : 409 - 425
  • [42] Structure Perception in 3D Point Clouds
    Gruchalla, Kenny
    Raghupathi, Sunand
    Brunhart-Lupo, Nicholas
    [J]. ACM SYMPOSIUM ON APPLIED PERCEPTION (SAP 2021), 2021,
  • [43] P2V-RCNN: Point to Voxel Feature Learning for 3D Object Detection From Point Clouds
    Li, Jiale
    Sun, Yu
    Luo, Shujie
    Zhu, Ziqi
    Dai, Hang
    Krylov, Andrey S.
    Ding, Yong
    Shao, Ling
    [J]. IEEE ACCESS, 2021, 9 : 98249 - 98260
  • [44] NORMAL CLASSIFICATION OF 3D OCCUPANCY GRIDS FOR VOXEL-BASED INDOOR RECONSTRUCTION FROM POINT CLOUDS
    Huebner, P.
    Wursthorn, S.
    Weinmann, M.
    [J]. XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION IV, 2022, 5-4 : 121 - 128
  • [45] Persistent Point Feature Histograms for 3D Point Clouds
    Rusu, Radu Bogdan
    Marton, Zoltan Csaba
    Blodow, Nico
    Beetz, Michael
    [J]. IAS-10: INTELLIGENT AUTONOMOUS SYSTEMS 10, 2008, : 119 - 128
  • [46] Monitoring of urban forests using 3D spatial indices based on LiDAR point clouds and voxel approach
    Zieba-Kulawik, Karolina
    Skoczylas, Konrad
    Wezyk, Piotr
    Teller, Jacques
    Mustafa, Ahmed
    Omrani, Hichem
    [J]. URBAN FORESTRY & URBAN GREENING, 2021, 65
  • [47] Segmentation Based Classification of 3D Urban Point Clouds: A Super-Voxel Based Approach with Evaluation
    Aijazi, Ahmad Kamal
    Checchin, Paul
    Trassoudaine, Laurent
    [J]. REMOTE SENSING, 2013, 5 (04) : 1624 - 1650
  • [48] Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds
    He, Chenhang
    Li, Ruihuang
    Li, Shuai
    Zhang, Lei
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8407 - 8417
  • [49] CasFormer: Cascaded Transformer Based on Dynamic Voxel Pyramid for 3D Object Detection from Point Clouds
    Li, Xinglong
    Zhang, Xiaowei
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 299 - 311
  • [50] Novel 3D local feature descriptor of point clouds based on spatial voxel homogenization for feature matching
    Jiong Yang
    Jian Zhang
    Zhengyang Cai
    Dongyang Fang
    [J]. Visual Computing for Industry, Biomedicine, and Art, 6