Voxel-FPN: Multi-Scale Voxel Feature Aggregation for 3D Object Detection from LIDAR Point Clouds

被引:122
|
作者
Kuang, Hongwu [1 ]
Wang, Bei [1 ]
An, Jianping [1 ]
Zhang, Ming [1 ]
Zhang, Zehan [1 ]
机构
[1] Hangzhou Hikvis Digital Technol Co Ltd, Hangzhou 310052, Peoples R China
关键词
3D object detection; multi-scale voxel feature aggregation; LIDAR; autonomous driving;
D O I
10.3390/s20030704
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Object detection in point cloud data is one of the key components in computer vision systems, especially for autonomous driving applications. In this work, we present Voxel-Feature Pyramid Network, a novel one-stage 3D object detector that utilizes raw data from LIDAR sensors only. The core framework consists of an encoder network and a corresponding decoder followed by a region proposal network. Encoder extracts and fuses multi-scale voxel information in a bottom-up manner, whereas decoder fuses multiple feature maps from various scales by Feature Pyramid Network in a top-down way. Extensive experiments show that the proposed method has better performance on extracting features from point data and demonstrates its superiority over some baselines on the challenging KITTI-3D benchmark, obtaining good performance on both speed and accuracy in real-world scenarios.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] HVNet: Hybrid Voxel Network for LiDAR Based 3D Object Detection
    Ye, Maosheng
    Xu, Shuangjie
    Cao, Tongyi
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1628 - 1637
  • [22] EFNet: enhancing feature information for 3D object detection in LiDAR point clouds
    Meng, Xin
    Zhou, Yuan
    Du, Kaiyue
    Ma, Jun
    Meng, Jin
    Kumar, Aakash
    Lv, Jiahang
    Kim, Jonghyuk
    Wang, Shifeng
    [J]. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2024, 41 (04) : 739 - 748
  • [23] Reinforced Voxel-RCNN: An Efficient 3D Object Detection Method Based on Feature Aggregation*
    Jiang, Jia-ji
    Wan, Hai-bin
    Sun, Hong-min
    Qin, Tuan-fa
    Wang, Zheng-qiang
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (09) : 1228 - 1238
  • [24] PV-RCNN++: semantical point-voxel feature interaction for 3D object detection
    Peng Wu
    Lipeng Gu
    Xuefeng Yan
    Haoran Xie
    Fu Lee Wang
    Gary Cheng
    Mingqiang Wei
    [J]. The Visual Computer, 2023, 39 (6) : 2425 - 2440
  • [25] NV2P-RCNN: Feature Aggregation Based on Voxel Neighborhood for 3D Object Detection
    Huo, Weile
    Jing, Tao
    Ren, Shuang
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (06) : 6925 - 6945
  • [26] Vehicle Detection Algorithm Based on Voxel Pillars from LiDAR Point Clouds
    Cheng X.
    Wang H.-F.
    Zhou J.-M.
    Zhang W.
    Zhao X.-M.
    [J]. Zhongguo Gonglu Xuebao/China Journal of Highway and Transport, 2023, 36 (03): : 247 - 260
  • [27] NV2P-RCNN: Feature Aggregation Based on Voxel Neighborhood for 3D Object Detection
    Weile Huo
    Tao Jing
    Shuang Ren
    [J]. Neural Processing Letters, 2023, 55 : 6925 - 6945
  • [28] SVGA-Net: Sparse Voxel-Graph Attention Network for 3D Object Detection from Point Clouds
    He, Qingdong
    Wang, Zhengning
    Zeng, Hao
    Zeng, Yi
    Liu, Yijun
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 870 - 878
  • [29] Multi-Scale Point-Wise Convolutional Neural Networks for 3D Object Segmentation From LiDAR Point Clouds in Large-Scale Environments
    Ma, Lingfei
    Li, Ying
    Li, Jonathan
    Tan, Weikai
    Yu, Yongtao
    Chapman, Michael A.
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (02) : 821 - 836
  • [30] Dense Voxel Fusion for 3D Object Detection
    Mahmoud, Anas
    Hu, Jordan S. K.
    Waslander, Steven L.
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 663 - 672