PVSA : A general and elegant sampling algorithm for Voxel-based 3D object detection

被引：0

作者：

Gong, Diancheng ^{[1
]}

Li, Junru ^{[1
]}

Wang, Chunchun ^{[2
]}

Wang, Zhiling ^{[1
]}

机构：

[1] Univ Sci & Technol China, Chinese Acad Sci, Heifei Inst Phys Sci, Inst Intelligent Machines, Hefei, Peoples R China

[2] Anhui Univ Sci & Technol, Chinese Acad Sci, Heifei Inst Phys Sci, Inst Intelligent Machines, Hefei, Peoples R China

来源：

2024 10TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTIC, ICCAR 2024 | 2024年

关键词：

Sampling Algorithm; Voxelization; Small Object Detection; Autonomous Vehicle;

D O I：

10.1109/ICCAR61844.2024.10569566

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Perceiving the environment is vital for autonomous vehicles as it serves as the foundation for decision making and path planning. LiDAR is a widely employed sensor, which produces a voluminous and sparsely populated point cloud. For voxel-based 3D object detection methods, the initial step involves the division of the raw point cloud into voxels, the process known as voxelization. Nevertheless, once the number of point clouds contained within a voxel reaches the certain threshold, the allocation of additional point clouds to that voxel ceases. This leads to a greater degree of information loss. Scholars primarily focus on the subsequent stages following voxelization, such as feature extraction and utilization. We first focus on the sampling issue during the voxelization. In the paper, we propose a general and elegant Points in Voxel Sampling Algorithm module named PVSA. During the voxelization, the assignment of all points into their respective voxels continues even after the maximum number of points in a voxel has been reached. For voxels in which the number of internal point clouds exceeds the certain threshold, the farthest distance sampling method is utilized as it ensures a genuine and uniform distribution of the point cloud within the voxel. We conducted an evaluation of the proposed module using the Kitti dataset. Experimental findings suggest that the incorporation of the PVSA module enhances the object detection capabilities of the voxel-based model, particularly in the identification of samll targets like pedestrians. The incorporation of PVSA modules significantly enhances Pillarnet's capacity to recognize pedestrians, resulting in a 46.2% pt improvement in performance at a distance of 20 meters. On average, there is an enhancement of 1.43% pt.

引用

页码：59 / 65

页数：7

共 50 条

[1] Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection
Deng, Jiajun
Shi, Shaoshuai
Li, Peiwei
Zhou, Wengang
Zhang, Yanyong
Li, Houqiang
[J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1201 - 1209
[2] A Voxel-Based 3D Building Detection Algorithm for Airborne LIDAR Point Clouds
Liying Wang
Yan Xu
Yu Li
[J]. Journal of the Indian Society of Remote Sensing, 2019, 47 : 349 - 358
[3] A Voxel-Based 3D Building Detection Algorithm for Airborne LIDAR Point Clouds
Wang, Liying
Xu, Yan
Li, Yu
[J]. JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2019, 47 (02) : 349 - 358
[4] NormalNet: A voxel-based CNN for 3D object classification and retrieval
Wang, Cheng
Cheng, Ming
Sohel, Ferdous
Bennamoun, Mohammed
Li, Jonathan
[J]. NEUROCOMPUTING, 2019, 323 : 139 - 147
[5] 3D VOXEL-BASED GRAPHICS - INTRODUCTION
KAUFMAN, A
[J]. COMPUTERS & GRAPHICS, 1989, 13 (02) : 133 - 134
[6] Voxel Transformer for 3D Object Detection
Mao, Jiageng
Xue, Yujing
Niu, Minzhe
Bai, Haoyue
Feng, Jiashi
Liang, Xiaodan
Xu, Hang
Xu, Chunjing
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3144 - 3153
[7] Efficient flexible voxel-based two-stage network for 3D object detection in autonomous driving
Sun, Fanyue
Tong, Guoxiang
Song, Yan
[J]. Applied Soft Computing, 2024, 162
[8] A Convolutional Neural Networks Oriented Approach for Voxel-Based 3D Object Classification
Sirma, Ridvan
Dinar, Berkan
Sahin, Yusuf Huseyin
Unal, Gozde
[J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
[9] Voxel-based 3D face representations for recognition
Moreno, AB
Sánchez, A
Vélez, JF
[J]. IWSSIP 2005: PROCEEDINGS OF THE 12TH INTERNATIONAL WORSHOP ON SYSTEMS, SIGNALS & IMAGE PROCESSING, 2005, : 283 - 287
[10] Voxel-Based Assessment of Printability of 3D Shapes
Telea, Alexandru
Jalba, Andrei
[J]. MATHEMATICAL MORPHOLOGY AND ITS APPLICATIONS TO IMAGE AND SIGNAL PROCESSING, (ISMM 2011), 2011, 6671 : 393 - 404

← 1 2 3 4 5 →