Voxel-FPN: Multi-Scale Voxel Feature Aggregation for 3D Object Detection from LIDAR Point Clouds

被引:123
|
作者
Kuang, Hongwu [1 ]
Wang, Bei [1 ]
An, Jianping [1 ]
Zhang, Ming [1 ]
Zhang, Zehan [1 ]
机构
[1] Hangzhou Hikvis Digital Technol Co Ltd, Hangzhou 310052, Peoples R China
关键词
3D object detection; multi-scale voxel feature aggregation; LIDAR; autonomous driving;
D O I
10.3390/s20030704
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Object detection in point cloud data is one of the key components in computer vision systems, especially for autonomous driving applications. In this work, we present Voxel-Feature Pyramid Network, a novel one-stage 3D object detector that utilizes raw data from LIDAR sensors only. The core framework consists of an encoder network and a corresponding decoder followed by a region proposal network. Encoder extracts and fuses multi-scale voxel information in a bottom-up manner, whereas decoder fuses multiple feature maps from various scales by Feature Pyramid Network in a top-down way. Extensive experiments show that the proposed method has better performance on extracting features from point data and demonstrates its superiority over some baselines on the challenging KITTI-3D benchmark, obtaining good performance on both speed and accuracy in real-world scenarios.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] SMS-Net: Sparse multi-scale voxel feature aggregation network for LiDAR-based 3D object detection
    Liu, Sheng
    Huang, Wenhao
    Cao, Yifeng
    Li, Dingda
    Chen, Shengyong
    [J]. NEUROCOMPUTING, 2022, 501 : 555 - 565
  • [2] Scale invariant point feature (SIPF) for 3D point clouds and 3D multi-scale object detection
    Lin, Baowei
    Wang, Fasheng
    Zhao, Fangda
    Sun, Yi
    [J]. NEURAL COMPUTING & APPLICATIONS, 2018, 29 (05): : 1209 - 1224
  • [3] Multi-Scale Keypoints Feature Fusion Network for 3D Object Detection from Point Clouds
    Zhang, Xu
    Bai, Linjuan
    Zhang, Zuyu
    Li, Yan
    [J]. HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2022, 12
  • [4] Retraction Note: Scale invariant point feature (SIPF) for 3D point clouds and 3D multi-scale object detection
    Baowei Lin
    Fasheng Wang
    Fangda Zhao
    Yi Sun
    [J]. Neural Computing and Applications, 2024, 36 (18) : 11065 - 11065
  • [5] RETRACTED ARTICLE: Scale invariant point feature (SIPF) for 3D point clouds and 3D multi-scale object detection
    Baowei Lin
    Fasheng Wang
    Fangda Zhao
    Yi Sun
    [J]. Neural Computing and Applications, 2018, 29 : 1209 - 1224
  • [6] P2V-RCNN: Point to Voxel Feature Learning for 3D Object Detection From Point Clouds
    Li, Jiale
    Sun, Yu
    Luo, Shujie
    Zhu, Ziqi
    Dai, Hang
    Krylov, Andrey S.
    Ding, Yong
    Shao, Ling
    [J]. IEEE ACCESS, 2021, 9 : 98249 - 98260
  • [7] MSG-Voxel-GAN: multi-scale gradient voxel GAN for 3D object generation
    Wang, Bingxu
    Lan, Jinhui
    Li, Feifan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023,
  • [8] Planar object detection from 3D point clouds based on pyramid voxel representation
    Hu, Zhaozheng
    Bai, Dongfang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (22) : 24343 - 24357
  • [9] Planar object detection from 3D point clouds based on pyramid voxel representation
    Zhaozheng Hu
    Dongfang Bai
    [J]. Multimedia Tools and Applications, 2017, 76 : 24343 - 24357
  • [10] DVST: Deformable Voxel Set Transformer for 3D Object Detection from Point Clouds
    Ning, Yaqian
    Cao, Jie
    Bao, Chun
    Hao, Qun
    [J]. REMOTE SENSING, 2023, 15 (23)