PVSA : A general and elegant sampling algorithm for Voxel-based 3D object detection

被引:0
|
作者
Gong, Diancheng [1 ]
Li, Junru [1 ]
Wang, Chunchun [2 ]
Wang, Zhiling [1 ]
机构
[1] Univ Sci & Technol China, Chinese Acad Sci, Heifei Inst Phys Sci, Inst Intelligent Machines, Hefei, Peoples R China
[2] Anhui Univ Sci & Technol, Chinese Acad Sci, Heifei Inst Phys Sci, Inst Intelligent Machines, Hefei, Peoples R China
关键词
Sampling Algorithm; Voxelization; Small Object Detection; Autonomous Vehicle;
D O I
10.1109/ICCAR61844.2024.10569566
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Perceiving the environment is vital for autonomous vehicles as it serves as the foundation for decision making and path planning. LiDAR is a widely employed sensor, which produces a voluminous and sparsely populated point cloud. For voxel-based 3D object detection methods, the initial step involves the division of the raw point cloud into voxels, the process known as voxelization. Nevertheless, once the number of point clouds contained within a voxel reaches the certain threshold, the allocation of additional point clouds to that voxel ceases. This leads to a greater degree of information loss. Scholars primarily focus on the subsequent stages following voxelization, such as feature extraction and utilization. We first focus on the sampling issue during the voxelization. In the paper, we propose a general and elegant Points in Voxel Sampling Algorithm module named PVSA. During the voxelization, the assignment of all points into their respective voxels continues even after the maximum number of points in a voxel has been reached. For voxels in which the number of internal point clouds exceeds the certain threshold, the farthest distance sampling method is utilized as it ensures a genuine and uniform distribution of the point cloud within the voxel. We conducted an evaluation of the proposed module using the Kitti dataset. Experimental findings suggest that the incorporation of the PVSA module enhances the object detection capabilities of the voxel-based model, particularly in the identification of samll targets like pedestrians. The incorporation of PVSA modules significantly enhances Pillarnet's capacity to recognize pedestrians, resulting in a 46.2% pt improvement in performance at a distance of 20 meters. On average, there is an enhancement of 1.43% pt.
引用
收藏
页码:59 / 65
页数:7
相关论文
共 50 条
  • [1] Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection
    Deng, Jiajun
    Shi, Shaoshuai
    Li, Peiwei
    Zhou, Wengang
    Zhang, Yanyong
    Li, Houqiang
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1201 - 1209
  • [2] A Voxel-Based 3D Building Detection Algorithm for Airborne LIDAR Point Clouds
    Liying Wang
    Yan Xu
    Yu Li
    [J]. Journal of the Indian Society of Remote Sensing, 2019, 47 : 349 - 358
  • [3] A Voxel-Based 3D Building Detection Algorithm for Airborne LIDAR Point Clouds
    Wang, Liying
    Xu, Yan
    Li, Yu
    [J]. JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2019, 47 (02) : 349 - 358
  • [4] NormalNet: A voxel-based CNN for 3D object classification and retrieval
    Wang, Cheng
    Cheng, Ming
    Sohel, Ferdous
    Bennamoun, Mohammed
    Li, Jonathan
    [J]. NEUROCOMPUTING, 2019, 323 : 139 - 147
  • [5] 3D VOXEL-BASED GRAPHICS - INTRODUCTION
    KAUFMAN, A
    [J]. COMPUTERS & GRAPHICS, 1989, 13 (02) : 133 - 134
  • [6] Voxel Transformer for 3D Object Detection
    Mao, Jiageng
    Xue, Yujing
    Niu, Minzhe
    Bai, Haoyue
    Feng, Jiashi
    Liang, Xiaodan
    Xu, Hang
    Xu, Chunjing
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3144 - 3153
  • [7] Efficient flexible voxel-based two-stage network for 3D object detection in autonomous driving
    Sun, Fanyue
    Tong, Guoxiang
    Song, Yan
    [J]. Applied Soft Computing, 2024, 162
  • [8] A Convolutional Neural Networks Oriented Approach for Voxel-Based 3D Object Classification
    Sirma, Ridvan
    Dinar, Berkan
    Sahin, Yusuf Huseyin
    Unal, Gozde
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [9] Voxel-based 3D face representations for recognition
    Moreno, AB
    Sánchez, A
    Vélez, JF
    [J]. IWSSIP 2005: PROCEEDINGS OF THE 12TH INTERNATIONAL WORSHOP ON SYSTEMS, SIGNALS & IMAGE PROCESSING, 2005, : 283 - 287
  • [10] Voxel-Based Assessment of Printability of 3D Shapes
    Telea, Alexandru
    Jalba, Andrei
    [J]. MATHEMATICAL MORPHOLOGY AND ITS APPLICATIONS TO IMAGE AND SIGNAL PROCESSING, (ISMM 2011), 2011, 6671 : 393 - 404