Multi-view PointNet for 3D Scene Understanding

被引：77

作者：

Jaritz, Maximilian ^{[1
]}

Gu, Jiayuan ^{[2
]}

Su, Hao ^{[2
]}

机构：

[1] INRIA, Valeo, Rocquencourt, France

[2] Univ Calif San Diego, San Diego, CA USA

来源：

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW) | 2019年

关键词：

D O I：

10.1109/ICCVW.2019.00494

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fusion of 2D images and 3D point clouds is important because information from dense images can enhance sparse point clouds. However, fusion is challenging because 2D and 3D data live in different spaces. In this work, we propose MVPNet (Multi-View PointNet), where we aggregate 2D multi-view image features into 3D point clouds, and then use a point based network to fuse the features in 3D canonical space to predict 3D semantic labels. To this end, we introduce view selection along with a 2D-3D feature aggregation module. Extensive experiments show the benefit of leveraging features from dense images and reveal superior robustness to varying point cloud density compared to 3D-only methods. On the ScanNetV2 [4] benchmark, our MVPNet significantly outperforms prior point cloud based approaches on the task of 3D Semantic Segmentation. It is much faster to train than the large networks of the sparse voxel approach [6]. We provide solid ablation studies to ease the future design of 2D-3D fusion methods and their extension to other tasks, as we showcase for 3D instance segmentation.

引用

页码：3995 / 4003

页数：9

共 50 条

[21] A COMPACT 3D REPRESENTATION FOR MULTI-VIEW VIDEO
Salvador, Jordi
Casas, Josep R.
[J]. INTERNATIONAL CONFERENCE ON 3D IMAGING 2011 (IC3D 2011), 2011,
[22] Multi-view 3D display using waveguides
Lee, Byoungho
Lee, Chang-Kun
[J]. INTERNATIONAL CONFERENCE ON OPTICAL AND PHOTONIC ENGINEERING (ICOPEN 2015), 2015, 9524
[23] A method of multi-view intraoral 3D measurement
Zhao, Huijie
Wang, Zhen
Jiang, Hongzhi
Xu, Yang
Lv, Peijun
Sun, Yunchun
[J]. INTERNATIONAL CONFERENCE ON PHOTONICS AND OPTICAL ENGINEERING (ICPOE 2014), 2015, 9449
[24] Registration of arbitrary multi-view 3D acquisitions
Chane, Camille Simon
Schuetze, Rainer
Boochs, Frank
Marzani, Franck S.
[J]. COMPUTERS IN INDUSTRY, 2013, 64 (09) : 1082 - 1089
[25] Multi-View and 3D Deformable Part Models
Pepik, Bojan
Stark, Michael
Gehler, Peter
Schiele, Bernt
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (11) : 2232 - 2245
[26] Efficient Caching for Multi-View 3D Videos
Lee, Ji-Tang
Yang, De-Nian
Liao, Wanjiun
[J]. 2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
[27] Multi-view 3D shape style transformation
Dalian University of Technology, Dalian, China
[J]. Visual Comput, 1600, 2 (669-684):
[28] 3D Reconstruction with Multi-view Texture Mapping
Ye, Xiaodan
Wang, Lianghao
Li, Dongxiao
Zhang, Ming
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 198 - 207
[29] Multi-View Stereo 3D Edge Reconstruction
Bignoli, Andrea
Romanoni, Andrea
Matteucci, Matteo
[J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 867 - 875
[30] Multi-view 3D shape style transformation
Liu, Xiuping
Huang, Hua
Wang, Weiming
Zhou, Jun
[J]. VISUAL COMPUTER, 2022, 38 (02): : 669 - 684

← 1 2 3 4 5 →