P2V-RCNN: Point to Voxel Feature Learning for 3D Object Detection From Point Clouds

被引:14
|
作者
Li, Jiale [1 ]
Sun, Yu [2 ]
Luo, Shujie [1 ]
Zhu, Ziqi [1 ]
Dai, Hang [3 ]
Krylov, Andrey S. [4 ]
Ding, Yong [1 ]
Shao, Ling [5 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Sch Micronano Elect, Hangzhou 311200, Peoples R China
[3] Mohamed Bin Zayed Univ Artificial Intelligence, Comp Vis Dept, Abu Dhabi, U Arab Emirates
[4] Lomonosov Moscow State Univ, Lab Math Methods Image Proc, Moscow 119991, Russia
[5] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
关键词
Three-dimensional displays; Feature extraction; Proposals; Semantics; Object detection; Cameras; Licenses; 3D object detection; point clouds; attention mechanism; autonomous driving;
D O I
10.1109/ACCESS.2021.3094562
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The most recent 3D object detectors for point clouds rely on the coarse voxel-based representation rather than the accurate point-based representation due to a higher box recall in the voxel-based Region Proposal Network (RPN). However, the detection accuracy is severely restricted by the information loss of pose details in the voxels. Different from considering the point cloud as voxel or point representation only, we propose a point-to-voxel feature learning approach to voxelize the point cloud with both the point-wise semantic and local spatial features, which maintains the voxel-wise features to build the high-recall voxel-based RPN and also provides the accurate point-wise features for refining the detection results. Another difficulty in object detection for point cloud is that the visible part varies a lot against the full view of object because of the perspective issues in data acquisition. To address this, we propose an attentive corner aggregation module to attentively aggregate the features of local point cloud surrounding a 3D proposal from the perspectives of eight corners in the proposal 3D bounding box. The experimental results on the competitive KITTI 3D object detection benchmark show that the proposed method achieves state-of-the-art performance.
引用
收藏
页码:98249 / 98260
页数:12
相关论文
共 50 条
  • [1] PV-RCNN++: semantical point-voxel feature interaction for 3D object detection
    Peng Wu
    Lipeng Gu
    Xuefeng Yan
    Haoran Xie
    Fu Lee Wang
    Gary Cheng
    Mingqiang Wei
    [J]. The Visual Computer, 2023, 39 (6) : 2425 - 2440
  • [2] 3D Cascade RCNN: High Quality Object Detection in Point Clouds
    Cai, Qi
    Pan, Yingwei
    Yao, Ting
    Mei, Tao
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5706 - 5719
  • [3] Mask-SL RCNN: Feature-Enhanced 3D Object Detection Network for Point Clouds
    Zhong, Yuanhong
    Yang, Guangxia
    Deng, Dihang
    Tang, Panliang
    Ren, Fan
    [J]. IEEE PHOTONICS JOURNAL, 2023, 15 (05):
  • [4] Voxel-FPN: Multi-Scale Voxel Feature Aggregation for 3D Object Detection from LIDAR Point Clouds
    Kuang, Hongwu
    Wang, Bei
    An, Jianping
    Zhang, Ming
    Zhang, Zehan
    [J]. SENSORS, 2020, 20 (03)
  • [5] PV-RCNN plus plus : semantical point-voxel feature interaction for 3D object detection
    Wu, Peng
    Gu, Lipeng
    Yan, Xuefeng
    Xie, Haoran
    Wang, Fu Lee
    Cheng, Gary
    Wei, Mingqiang
    [J]. VISUAL COMPUTER, 2023, 39 (06): : 2425 - 2440
  • [6] Planar object detection from 3D point clouds based on pyramid voxel representation
    Hu, Zhaozheng
    Bai, Dongfang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (22) : 24343 - 24357
  • [7] Planar object detection from 3D point clouds based on pyramid voxel representation
    Zhaozheng Hu
    Dongfang Bai
    [J]. Multimedia Tools and Applications, 2017, 76 : 24343 - 24357
  • [8] DVST: Deformable Voxel Set Transformer for 3D Object Detection from Point Clouds
    Ning, Yaqian
    Cao, Jie
    Bao, Chun
    Hao, Qun
    [J]. REMOTE SENSING, 2023, 15 (23)
  • [9] PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
    Shaoshuai Shi
    Li Jiang
    Jiajun Deng
    Zhe Wang
    Chaoxu Guo
    Jianping Shi
    Xiaogang Wang
    Hongsheng Li
    [J]. International Journal of Computer Vision, 2023, 131 : 531 - 551
  • [10] NV2P-RCNN: Feature Aggregation Based on Voxel Neighborhood for 3D Object Detection
    Weile Huo
    Tao Jing
    Shuang Ren
    [J]. Neural Processing Letters, 2023, 55 : 6925 - 6945