NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video

被引：119

作者：

Sun, Jiaming ^{[1
,2
,3
]}

Xie, Yiming ^{[1
,3
]}

Chen, Linghao ^{[1
,3
]}

Zhou, Xiaowei ^{[1
,3
]}

Bao, Hujun ^{[1
,3
]}

机构：

[1] Zhejiang Univ, Hangzhou, Peoples R China

[2] SenseTime Res, Hong Kong, Peoples R China

[3] State Key Lab CAD & CG & ZJU SenseTime Joint Lab, Hangzhou, Peoples R China

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

D O I：

10.1109/CVPR46437.2021.01534

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from a monocular video. Unlike previous methods that estimate single-view depth maps separately on each key-frame and fuse them later, we propose to directly reconstruct local surfaces represented as sparse TSDF volumes for each video fragment sequentially by a neural network. A learning-based TSDF fusion module based on gated recurrent units is used to guide the network to fuse features from previous fragments. This design allows the network to capture local smoothness prior and global shape prior of 3D surfaces when sequentially reconstructing the surfaces, resulting in accurate, coherent, and real-time surface reconstruction. The experiments on ScanNet and 7-Scenes datasets show that our system outperforms state-of-the-art methods in terms of both accuracy and speed. To the best of our knowledge, this is the first learning-based system that is able to reconstruct dense coherent 3D geometry in real-time. Code is available at the project page: https://zju3dv.github.io/neuralrecon/.

引用

页码：15593 / 15602

页数：10

共 50 条

[1] Real-Time 3D Pose Reconstruction of Human Body from Monocular Video Sequences
Zhu, LiangJia
Hwang, Jenq-Neng
Chen, Chih-Chang
Lin, Ming-Hui
Yen, Chen-Lan
[J]. ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 717 - +
[2] Detailed Real-Time Urban 3D Reconstruction from Video
M. Pollefeys
D. Nistér
J.-M. Frahm
A. Akbarzadeh
P. Mordohai
B. Clipp
C. Engels
D. Gallup
S.-J. Kim
P. Merrell
C. Salmi
S. Sinha
B. Talton
L. Wang
Q. Yang
H. Stewénius
R. Yang
G. Welch
H. Towles
[J]. International Journal of Computer Vision, 2008, 78 : 143 - 167
[3] Detailed real-time urban 3D reconstruction from video
Pollefeys, M.
Nister, D.
Frahm, J. -M.
Akbarzadeh, A.
Mordohai, P.
Clipp, B.
Engels, C.
Gallup, D.
Kim, S. -J.
Merrell, P.
Salmi, C.
Sinha, S.
Talton, B.
Wang, L.
Yang, Q.
Stewenius, H.
Yang, R.
Welch, G.
Towles, H.
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 78 (2-3) : 143 - 167
[4] Real-time 3D features reconstruction through monocular vision
Liverani, Alfredo
Leali, Francesco
Pellicciari, Marcello
[J]. INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2010, 4 (02): : 103 - 112
[5] Real-Time 3D Reconstruction Method Based on Monocular Vision
Jia, Qingyu
Chang, Liang
Qiang, Baohua
Zhang, Shihao
Xie, Wu
Yang, Xianyi
Sun, Yangchang
Yang, Minghao
[J]. SENSORS, 2021, 21 (17)
[6] Real-time active 3D shape reconstruction for 3D video
Wu, X
Matsuyama, T
[J]. ISPA 2003: PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, PTS 1 AND 2, 2003, : 186 - 191
[7] Near real-time 3D reconstruction from InIm video stream
Chaikalis, D.
Passalis, G.
Sgouros, N.
Maroulis, D.
Theoharis, T.
[J]. IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2008, 5112 : 336 - 347
[8] Parallel processing for real-time 3D reconstruction from video streams
Duckworth, Tobias
Roberts, David J.
[J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2014, 9 (03) : 427 - 445
[9] Parallel processing for real-time 3D reconstruction from video streams
Tobias Duckworth
David J. Roberts
[J]. Journal of Real-Time Image Processing, 2014, 9 : 427 - 445
[10] Flora: Dual-Frequency LOss-Compensated ReAl-Time Monocular 3D Video Reconstruction
Wang, Likang
Gong, Yue
Wang, Qirui
Zhou, Kaixuan
Chen, Lei
[J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2599 - 2607

← 1 2 3 4 5 →