NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video

被引:119
|
作者
Sun, Jiaming [1 ,2 ,3 ]
Xie, Yiming [1 ,3 ]
Chen, Linghao [1 ,3 ]
Zhou, Xiaowei [1 ,3 ]
Bao, Hujun [1 ,3 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] SenseTime Res, Hong Kong, Peoples R China
[3] State Key Lab CAD & CG & ZJU SenseTime Joint Lab, Hangzhou, Peoples R China
关键词
D O I
10.1109/CVPR46437.2021.01534
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from a monocular video. Unlike previous methods that estimate single-view depth maps separately on each key-frame and fuse them later, we propose to directly reconstruct local surfaces represented as sparse TSDF volumes for each video fragment sequentially by a neural network. A learning-based TSDF fusion module based on gated recurrent units is used to guide the network to fuse features from previous fragments. This design allows the network to capture local smoothness prior and global shape prior of 3D surfaces when sequentially reconstructing the surfaces, resulting in accurate, coherent, and real-time surface reconstruction. The experiments on ScanNet and 7-Scenes datasets show that our system outperforms state-of-the-art methods in terms of both accuracy and speed. To the best of our knowledge, this is the first learning-based system that is able to reconstruct dense coherent 3D geometry in real-time. Code is available at the project page: https://zju3dv.github.io/neuralrecon/.
引用
收藏
页码:15593 / 15602
页数:10
相关论文
共 50 条
  • [21] Time-coherent 3D animation reconstruction from RGB-D video
    Naveed Ahmed
    Salam Khalifa
    [J]. Signal, Image and Video Processing, 2016, 10 : 783 - 790
  • [22] Reconstruction of Personalized 3D Face Rigs from Monocular Video
    Garrido, Pablo
    Zollhoefer, Michael
    Casas, Dan
    Valgaerts, Levi
    Varanasi, Kiran
    Perez, Patrick
    Theobalt, Christian
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (03):
  • [23] Joint Semantic Segmentation and 3D Reconstruction from Monocular Video
    Kundu, Abhijit
    Li, Yin
    Dellaert, Frank
    Li, Fuxin
    Rehg, James M.
    [J]. COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 703 - 718
  • [24] RGB2Hands: Real-Time Tracking of 3D Hand Interactions from Monocular RGB Video
    Wang, Jiayi
    Mueller, Franziska
    Bernard, Florian
    Sorli, Suzanne
    Sotnychenko, Oleksandr
    Qian, Neng
    Otaduy, Miguel A.
    Casas, Dan
    Theobalt, Christian
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (06):
  • [25] 3D fractal compression for real-time video
    Chabarchine, A
    Creutzburg, R
    [J]. ISPA 2001: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2001, : 570 - 573
  • [26] GANerated Hands for Real-Time 3D Hand Tracking from Monocular RGB
    Mueller, Franziska
    Bernard, Florian
    Sotnychenko, Oleksandr
    Mehta, Dushyant
    Sridhar, Srinath
    Casas, Dan
    Theobalt, Christian
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 49 - 59
  • [27] StereoScan: Dense 3d Reconstruction in Real-time
    Geiger, Andreas
    Ziegler, Julius
    Stiller, Christoph
    [J]. 2011 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2011, : 963 - 968
  • [28] DenseMatch: a dataset for real-time 3D reconstruction
    Lombardi, Marco
    Savardi, Mattia
    Signoroni, Alberto
    [J]. DATA IN BRIEF, 2021, 39
  • [29] Real-Time Active Multiview 3D Reconstruction
    Ide, Kai
    Sikora, Thomas
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTER VISION IN REMOTE SENSING, 2012, : 203 - 208
  • [30] REAL-TIME 3D RECONSTRUCTION FROM IMAGES TAKEN FROM AN UAV
    Zingoni, A.
    Diani, M.
    Corsini, G.
    Masini, A.
    [J]. ISPRS GEOSPATIAL WEEK 2015, 2015, 40-3 (W3): : 313 - 319