VoxNet: A 3D Convolutional Neural Network for Real-Time Object Recognition

被引:0
|
作者
Maturana, Daniel [1 ]
Scherer, Sebastian [1 ]
机构
[1] Carnegie Mellon Univ, Inst Robot, Forbes Ave 5000, Pittsburgh, PA 15201 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Robust object recognition is a crucial skill for robots operating autonomously in real world environments. Range sensors such as LiDAR and RGBD cameras are increasingly found in modern robotic systems, providing a rich source of 3D information that can aid in this task. However, many current systems do not fully utilize this information and have trouble efficiently dealing with large amounts of point cloud data. In this paper, we propose VoxNet, an architecture to tackle this problem by integrating a volumetric Occupancy Grid representation with a supervised 3D Convolutional Neural Network (3D CNN). We evaluate our approach on publicly available benchmarks using LiDAR, RGBD, and CAD data. VoxNet achieves accuracy beyond the state of the art while labeling hundreds of instances per second.
引用
收藏
页码:922 / 928
页数:7
相关论文
共 50 条
  • [1] PointNet: A 3D Convolutional Neural Network for Real-Time Object Class Recognition
    Garcia-Garcia, A.
    Gomez-Donoso, F.
    Garcia-Rodriguez, J.
    Orts-Escolano, S.
    Cazorla, M.
    Azorin-Lopez, J.
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1578 - 1584
  • [2] A 3D Convolutional Neural Network Towards Real-time Amodal 3D Object Detection
    Sun, Hao
    Meng, Zehui
    Du, Xinxin
    Ang, Marcelo H., Jr.
    [J]. 2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 8331 - 8338
  • [3] Real-Time Video Object Recognition Using Convolutional Neural Network
    Ahn, Byungik
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [4] 3D convolutional neural network for object recognition: a review
    Rahul Dev Singh
    Ajay Mittal
    Rajesh K. Bhatia
    [J]. Multimedia Tools and Applications, 2019, 78 : 15951 - 15995
  • [5] 3D convolutional neural network for object recognition: a review
    Singh, Rahul Dev
    Mittal, Ajay
    Bhatia, Rajesh K.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (12) : 15951 - 15995
  • [6] Lightweight convolutional neural network for real-time 3D object detection in road and railway environments
    A. Mauri
    R. Khemmar
    B. Decoux
    M. Haddad
    R. Boutteau
    [J]. Journal of Real-Time Image Processing, 2022, 19 : 499 - 516
  • [7] Lightweight convolutional neural network for real-time 3D object detection in road and railway environments
    Mauri, A.
    Khemmar, R.
    Decoux, B.
    Haddad, M.
    Boutteau, R.
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2022, 19 (03) : 499 - 516
  • [8] Real-Time Gesture Recognition Using 3D Sensory Data and a Light Convolutional Neural Network
    Diliberti, Nicholas
    Peng, Chao
    Kauffman, Christopher
    Dong, Yangzi
    Hansberger, Jeffrey T.
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 401 - 410
  • [9] Real-Time Object Recognition Algorithm Based on Deep Convolutional Neural Network
    Yang, Lihong
    Wang, Liewei
    Wu, Shuo
    [J]. 2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 331 - 335
  • [10] Complementary spatial transformer network for real-time 3D object recognition
    Krishna Kumar, K. P.
    Paul, Varghese
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (05)