Multi-view PointNet for 3D Scene Understanding

被引:77
|
作者
Jaritz, Maximilian [1 ]
Gu, Jiayuan [2 ]
Su, Hao [2 ]
机构
[1] INRIA, Valeo, Rocquencourt, France
[2] Univ Calif San Diego, San Diego, CA USA
关键词
D O I
10.1109/ICCVW.2019.00494
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fusion of 2D images and 3D point clouds is important because information from dense images can enhance sparse point clouds. However, fusion is challenging because 2D and 3D data live in different spaces. In this work, we propose MVPNet (Multi-View PointNet), where we aggregate 2D multi-view image features into 3D point clouds, and then use a point based network to fuse the features in 3D canonical space to predict 3D semantic labels. To this end, we introduce view selection along with a 2D-3D feature aggregation module. Extensive experiments show the benefit of leveraging features from dense images and reveal superior robustness to varying point cloud density compared to 3D-only methods. On the ScanNetV2 [4] benchmark, our MVPNet significantly outperforms prior point cloud based approaches on the task of 3D Semantic Segmentation. It is much faster to train than the large networks of the sparse voxel approach [6]. We provide solid ablation studies to ease the future design of 2D-3D fusion methods and their extension to other tasks, as we showcase for 3D instance segmentation.
引用
收藏
页码:3995 / 4003
页数:9
相关论文
共 50 条
  • [21] A COMPACT 3D REPRESENTATION FOR MULTI-VIEW VIDEO
    Salvador, Jordi
    Casas, Josep R.
    [J]. INTERNATIONAL CONFERENCE ON 3D IMAGING 2011 (IC3D 2011), 2011,
  • [22] Multi-view 3D display using waveguides
    Lee, Byoungho
    Lee, Chang-Kun
    [J]. INTERNATIONAL CONFERENCE ON OPTICAL AND PHOTONIC ENGINEERING (ICOPEN 2015), 2015, 9524
  • [23] A method of multi-view intraoral 3D measurement
    Zhao, Huijie
    Wang, Zhen
    Jiang, Hongzhi
    Xu, Yang
    Lv, Peijun
    Sun, Yunchun
    [J]. INTERNATIONAL CONFERENCE ON PHOTONICS AND OPTICAL ENGINEERING (ICPOE 2014), 2015, 9449
  • [24] Registration of arbitrary multi-view 3D acquisitions
    Chane, Camille Simon
    Schuetze, Rainer
    Boochs, Frank
    Marzani, Franck S.
    [J]. COMPUTERS IN INDUSTRY, 2013, 64 (09) : 1082 - 1089
  • [25] Multi-View and 3D Deformable Part Models
    Pepik, Bojan
    Stark, Michael
    Gehler, Peter
    Schiele, Bernt
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (11) : 2232 - 2245
  • [26] Efficient Caching for Multi-View 3D Videos
    Lee, Ji-Tang
    Yang, De-Nian
    Liao, Wanjiun
    [J]. 2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
  • [27] Multi-view 3D shape style transformation
    Dalian University of Technology, Dalian, China
    [J]. Visual Comput, 1600, 2 (669-684):
  • [28] 3D Reconstruction with Multi-view Texture Mapping
    Ye, Xiaodan
    Wang, Lianghao
    Li, Dongxiao
    Zhang, Ming
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 198 - 207
  • [29] Multi-View Stereo 3D Edge Reconstruction
    Bignoli, Andrea
    Romanoni, Andrea
    Matteucci, Matteo
    [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 867 - 875
  • [30] Multi-view 3D shape style transformation
    Liu, Xiuping
    Huang, Hua
    Wang, Weiming
    Zhou, Jun
    [J]. VISUAL COMPUTER, 2022, 38 (02): : 669 - 684