Neural 3D Video Synthesis from Multi-view Video

被引:42
|
作者
Li, Tianye [1 ,2 ]
Slavcheva, Mira [2 ]
Zollhoefer, Michael [2 ]
Green, Simon [2 ]
Lassner, Christoph [2 ]
Kim, Changil [3 ]
Schmidt, Tanner [2 ]
Lovegrove, Steven [2 ]
Goesele, Michael [2 ]
Newcombe, Richard [2 ]
Lv, Zhaoyang [2 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90089 USA
[2] Real Labs Res, Redmond, WA 98052 USA
[3] Meta, Menlo Pk, CA USA
关键词
SILHOUETTE;
D O I
10.1109/CVPR52688.2022.00544
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel approach for 3D video synthesis that is able to represent multi-view video recordings of a dynamic real-world scene in a compact, yet expressive representation that enables high-quality view synthesis and motion interpolation. Our approach takes the high quality and compactness of static neural radiance fields in a new direction: to a model-free, dynamic setting. At the core of our approach is a novel time-conditioned neural radiance field that represents scene dynamics using a set of compact latent codes. We are able to significantly boost the training speed and perceptual quality of the generated imagery by a novel hierarchical training scheme in combination with ray importance sampling. Our learned representation is highly compact and able to represent a 10 second 30 FPS multiview video recording by 18 cameras with a model size of only 28MB. We demonstrate that our method can render high-fidelity wide-angle novel views at over 1K resolution, even for complex and dynamic scenes. We perform an extensive qualitative and quantitative evaluation that shows that our approach outperforms the state of the art. Project website: https://neural- 3d-video.github.io/.
引用
收藏
页码:5511 / 5521
页数:11
相关论文
共 50 条
  • [1] STEREOSCOPIC 3D VIEW SYNTHESIS FROM UNSYNCHRONIZED MULTI-VIEW VIDEO
    Klose, Felix
    Ruhl, Kai
    Lipski, Christian
    Linz, Christian
    Magnor, Markus
    [J]. 19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1904 - 1908
  • [2] MULTI-VIEW 3D RECONSTRUCTION FROM VIDEO WITH TRANSFORMER
    Zhong, Yijie
    Sun, Zhengxing
    Sun, Yunhan
    Luo, Shoutong
    Wang, Yi
    Zhang, Wei
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1661 - 1665
  • [3] A COMPACT 3D REPRESENTATION FOR MULTI-VIEW VIDEO
    Salvador, Jordi
    Casas, Josep R.
    [J]. INTERNATIONAL CONFERENCE ON 3D IMAGING 2011 (IC3D 2011), 2011,
  • [4] Multi-view video compression for 3D displays
    Zwicker, Matthias
    Yea, Sehoon
    Vetro, Anthony
    Forlines, Clifton
    Matusik, Wojciech
    Pfister, Hanspeter
    [J]. CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 1506 - +
  • [5] IMPROVED MULTI-VIEW DEPTH ESTIMATION FOR VIEW SYNTHESIS IN 3D VIDEO CODING
    Zhang, Qiuwen
    An, Ping
    Zhang, Yan
    Shen, Liquan
    Zhang, Zhaoyang
    [J]. 2011 3DTV CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON), 2011,
  • [6] Virtual View Adaptation for 3D Multi-View Video Streaming
    Petrovic, Goran
    Do, Luat
    Zinger, Sveta
    de With, Peter H. N.
    [J]. STEREOSCOPIC DISPLAYS AND APPLICATIONS XXI, 2010, 7524
  • [7] A Multi-view Projection 3D Video Processing Technology
    Dou, Yuchao
    Dai, Taotao
    Fu, Lei
    Huang, Ziqiang
    [J]. 2012 IEEE FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2012, : 1162 - 1165
  • [8] Color calibration of multi-view video plus depth for advanced 3D video
    Fezza, Sid Ahmed
    Larabi, Mohamed-Chaker
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 : 177 - 191
  • [9] Color calibration of multi-view video plus depth for advanced 3D video
    Sid Ahmed Fezza
    Mohamed-Chaker Larabi
    [J]. Signal, Image and Video Processing, 2015, 9 : 177 - 191
  • [10] 3D High-Efficiency Video Coding for Multi-View Video and Depth Data
    Mueller, Karsten
    Schwarz, Heiko
    Marpe, Detlev
    Bartnik, Christian
    Bosse, Sebastian
    Brust, Heribert
    Hinz, Tobias
    Lakshman, Haricharan
    Merkle, Philipp
    Rhee, Franz Hunn
    Tech, Gerhard
    Winken, Martin
    Wiegand, Thomas
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (09) : 3366 - 3378