NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video

被引：119

作者：

Sun, Jiaming ^{[1
,2
,3
]}

Xie, Yiming ^{[1
,3
]}

Chen, Linghao ^{[1
,3
]}

Zhou, Xiaowei ^{[1
,3
]}

Bao, Hujun ^{[1
,3
]}

机构：

[1] Zhejiang Univ, Hangzhou, Peoples R China

[2] SenseTime Res, Hong Kong, Peoples R China

[3] State Key Lab CAD & CG & ZJU SenseTime Joint Lab, Hangzhou, Peoples R China

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

D O I：

10.1109/CVPR46437.2021.01534

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from a monocular video. Unlike previous methods that estimate single-view depth maps separately on each key-frame and fuse them later, we propose to directly reconstruct local surfaces represented as sparse TSDF volumes for each video fragment sequentially by a neural network. A learning-based TSDF fusion module based on gated recurrent units is used to guide the network to fuse features from previous fragments. This design allows the network to capture local smoothness prior and global shape prior of 3D surfaces when sequentially reconstructing the surfaces, resulting in accurate, coherent, and real-time surface reconstruction. The experiments on ScanNet and 7-Scenes datasets show that our system outperforms state-of-the-art methods in terms of both accuracy and speed. To the best of our knowledge, this is the first learning-based system that is able to reconstruct dense coherent 3D geometry in real-time. Code is available at the project page: https://zju3dv.github.io/neuralrecon/.

引用

页码：15593 / 15602

页数：10

共 50 条

[21] Time-coherent 3D animation reconstruction from RGB-D video
Naveed Ahmed
Salam Khalifa
[J]. Signal, Image and Video Processing, 2016, 10 : 783 - 790
[22] Reconstruction of Personalized 3D Face Rigs from Monocular Video
Garrido, Pablo
Zollhoefer, Michael
Casas, Dan
Valgaerts, Levi
Varanasi, Kiran
Perez, Patrick
Theobalt, Christian
[J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (03):
[23] Joint Semantic Segmentation and 3D Reconstruction from Monocular Video
Kundu, Abhijit
Li, Yin
Dellaert, Frank
Li, Fuxin
Rehg, James M.
[J]. COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 703 - 718
[24] RGB2Hands: Real-Time Tracking of 3D Hand Interactions from Monocular RGB Video
Wang, Jiayi
Mueller, Franziska
Bernard, Florian
Sorli, Suzanne
Sotnychenko, Oleksandr
Qian, Neng
Otaduy, Miguel A.
Casas, Dan
Theobalt, Christian
[J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (06):
[25] 3D fractal compression for real-time video
Chabarchine, A
Creutzburg, R
[J]. ISPA 2001: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2001, : 570 - 573
[26] GANerated Hands for Real-Time 3D Hand Tracking from Monocular RGB
Mueller, Franziska
Bernard, Florian
Sotnychenko, Oleksandr
Mehta, Dushyant
Sridhar, Srinath
Casas, Dan
Theobalt, Christian
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 49 - 59
[27] StereoScan: Dense 3d Reconstruction in Real-time
Geiger, Andreas
Ziegler, Julius
Stiller, Christoph
[J]. 2011 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2011, : 963 - 968
[28] DenseMatch: a dataset for real-time 3D reconstruction
Lombardi, Marco
Savardi, Mattia
Signoroni, Alberto
[J]. DATA IN BRIEF, 2021, 39
[29] Real-Time Active Multiview 3D Reconstruction
Ide, Kai
Sikora, Thomas
[J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTER VISION IN REMOTE SENSING, 2012, : 203 - 208
[30] REAL-TIME 3D RECONSTRUCTION FROM IMAGES TAKEN FROM AN UAV
Zingoni, A.
Diani, M.
Corsini, G.
Masini, A.
[J]. ISPRS GEOSPATIAL WEEK 2015, 2015, 40-3 (W3): : 313 - 319

← 1 2 3 4 5 →