Multi-View Classification and 3D Bounding Box Regression Networks

被引:0
|
作者
Pramerdorfer, Christopher [1 ,2 ]
Kampel, Martin [2 ]
Van Loock, Mark [3 ]
机构
[1] Cogvis, Vienna, Austria
[2] TU Wien, Vienna, Austria
[3] Toyota Motor Europe, Brussels, Belgium
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a method for jointly classifying objects in depth maps and regressing amodal (extending beyond occluded parts) 3D bounding boxes in a way that is highly robust to occlusions. Our method is based on a novel multi-view convolutional neural network architecture with shared layers for both tasks, improving efficiency. The network processes views that encode object geometry and occlusion information and outputs class scores and bounding box coordinates in world coordinates, requiring no post-processing steps. We demonstrate the effectiveness of our method by example of fall detection, presenting a new dataset of 40k samples rendered from 3D models. On this dataset, our method achieves an average classification accuracy above 97% and a regression error below 10 cm at occlusion ratios of up to 90%. The dataset and trained models are publicly available.
引用
收藏
页码:734 / 739
页数:6
相关论文
共 50 条
  • [1] Deformable convolutional networks for multi-view 3D shape classification
    Ma, Pengfei
    Ma, Jie
    Wang, Xujiao
    Yang, Lichuang
    Wang, Nannan
    [J]. ELECTRONICS LETTERS, 2018, 54 (24) : 1373 - 1374
  • [2] 3D multi-view convolutional neural networks for lung nodule classification
    Kang, Guixia
    Liu, Kui
    Hou, Beibei
    Zhang, Ningbo
    [J]. PLOS ONE, 2017, 12 (11):
  • [3] Multi-view SoftPool attention convolutional networks for 3D model classification
    Wang, Wenju
    Wang, Xiaolin
    Chen, Gang
    Zhou, Haoran
    [J]. FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [4] Multi-view expressive graph neural networks for 3D CAD model classification
    Li, Shuang
    Corney, Jonathan
    [J]. COMPUTERS IN INDUSTRY, 2023, 151
  • [5] 3D object Classification using Bounding box
    Malwe, Gauri
    Kshirsagar, Deepak
    Madkaikar, Ashish
    [J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [6] Joint Multi-view 2D Convolutional Neural Networks for 3D Object Classification
    Xu, Jinglin
    Zhang, Xiangsen
    Li, Wenbin
    Liu, Xinwang
    Han, Junwei
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3202 - 3208
  • [7] A generalizable approach for multi-view 3D human pose regression
    Kadkhodamohammadi, Abdolrahim
    Padoy, Nicolas
    [J]. MACHINE VISION AND APPLICATIONS, 2020, 32 (01)
  • [8] A generalizable approach for multi-view 3D human pose regression
    Abdolrahim Kadkhodamohammadi
    Nicolas Padoy
    [J]. Machine Vision and Applications, 2021, 32
  • [9] Volumetric and Multi-View CNNs for Object Classification on 3D Data
    Qi, Charles R.
    Su, Hao
    Niessner, Matthias
    Dai, Angela
    Yan, Mengyuan
    Guibas, Leonidas J.
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5648 - 5656
  • [10] MVPNet: Multi-View Point Regression Networks for 3D Object Reconstruction from A Single Image
    Wang, Jinglu
    Sun, Bo
    Lu, Yan
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8949 - 8956