NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video

被引:119
|
作者
Sun, Jiaming [1 ,2 ,3 ]
Xie, Yiming [1 ,3 ]
Chen, Linghao [1 ,3 ]
Zhou, Xiaowei [1 ,3 ]
Bao, Hujun [1 ,3 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] SenseTime Res, Hong Kong, Peoples R China
[3] State Key Lab CAD & CG & ZJU SenseTime Joint Lab, Hangzhou, Peoples R China
关键词
D O I
10.1109/CVPR46437.2021.01534
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from a monocular video. Unlike previous methods that estimate single-view depth maps separately on each key-frame and fuse them later, we propose to directly reconstruct local surfaces represented as sparse TSDF volumes for each video fragment sequentially by a neural network. A learning-based TSDF fusion module based on gated recurrent units is used to guide the network to fuse features from previous fragments. This design allows the network to capture local smoothness prior and global shape prior of 3D surfaces when sequentially reconstructing the surfaces, resulting in accurate, coherent, and real-time surface reconstruction. The experiments on ScanNet and 7-Scenes datasets show that our system outperforms state-of-the-art methods in terms of both accuracy and speed. To the best of our knowledge, this is the first learning-based system that is able to reconstruct dense coherent 3D geometry in real-time. Code is available at the project page: https://zju3dv.github.io/neuralrecon/.
引用
收藏
页码:15593 / 15602
页数:10
相关论文
共 50 条
  • [1] Real-Time 3D Pose Reconstruction of Human Body from Monocular Video Sequences
    Zhu, LiangJia
    Hwang, Jenq-Neng
    Chen, Chih-Chang
    Lin, Ming-Hui
    Yen, Chen-Lan
    [J]. ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 717 - +
  • [2] Detailed Real-Time Urban 3D Reconstruction from Video
    M. Pollefeys
    D. Nistér
    J.-M. Frahm
    A. Akbarzadeh
    P. Mordohai
    B. Clipp
    C. Engels
    D. Gallup
    S.-J. Kim
    P. Merrell
    C. Salmi
    S. Sinha
    B. Talton
    L. Wang
    Q. Yang
    H. Stewénius
    R. Yang
    G. Welch
    H. Towles
    [J]. International Journal of Computer Vision, 2008, 78 : 143 - 167
  • [3] Detailed real-time urban 3D reconstruction from video
    Pollefeys, M.
    Nister, D.
    Frahm, J. -M.
    Akbarzadeh, A.
    Mordohai, P.
    Clipp, B.
    Engels, C.
    Gallup, D.
    Kim, S. -J.
    Merrell, P.
    Salmi, C.
    Sinha, S.
    Talton, B.
    Wang, L.
    Yang, Q.
    Stewenius, H.
    Yang, R.
    Welch, G.
    Towles, H.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 78 (2-3) : 143 - 167
  • [4] Real-time 3D features reconstruction through monocular vision
    Liverani, Alfredo
    Leali, Francesco
    Pellicciari, Marcello
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2010, 4 (02): : 103 - 112
  • [5] Real-Time 3D Reconstruction Method Based on Monocular Vision
    Jia, Qingyu
    Chang, Liang
    Qiang, Baohua
    Zhang, Shihao
    Xie, Wu
    Yang, Xianyi
    Sun, Yangchang
    Yang, Minghao
    [J]. SENSORS, 2021, 21 (17)
  • [6] Real-time active 3D shape reconstruction for 3D video
    Wu, X
    Matsuyama, T
    [J]. ISPA 2003: PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, PTS 1 AND 2, 2003, : 186 - 191
  • [7] Near real-time 3D reconstruction from InIm video stream
    Chaikalis, D.
    Passalis, G.
    Sgouros, N.
    Maroulis, D.
    Theoharis, T.
    [J]. IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2008, 5112 : 336 - 347
  • [8] Parallel processing for real-time 3D reconstruction from video streams
    Duckworth, Tobias
    Roberts, David J.
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2014, 9 (03) : 427 - 445
  • [9] Parallel processing for real-time 3D reconstruction from video streams
    Tobias Duckworth
    David J. Roberts
    [J]. Journal of Real-Time Image Processing, 2014, 9 : 427 - 445
  • [10] Flora: Dual-Frequency LOss-Compensated ReAl-Time Monocular 3D Video Reconstruction
    Wang, Likang
    Gong, Yue
    Wang, Qirui
    Zhou, Kaixuan
    Chen, Lei
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2599 - 2607