Neural 3D Video Synthesis from Multi-view Video

被引:42
|
作者
Li, Tianye [1 ,2 ]
Slavcheva, Mira [2 ]
Zollhoefer, Michael [2 ]
Green, Simon [2 ]
Lassner, Christoph [2 ]
Kim, Changil [3 ]
Schmidt, Tanner [2 ]
Lovegrove, Steven [2 ]
Goesele, Michael [2 ]
Newcombe, Richard [2 ]
Lv, Zhaoyang [2 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90089 USA
[2] Real Labs Res, Redmond, WA 98052 USA
[3] Meta, Menlo Pk, CA USA
关键词
SILHOUETTE;
D O I
10.1109/CVPR52688.2022.00544
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel approach for 3D video synthesis that is able to represent multi-view video recordings of a dynamic real-world scene in a compact, yet expressive representation that enables high-quality view synthesis and motion interpolation. Our approach takes the high quality and compactness of static neural radiance fields in a new direction: to a model-free, dynamic setting. At the core of our approach is a novel time-conditioned neural radiance field that represents scene dynamics using a set of compact latent codes. We are able to significantly boost the training speed and perceptual quality of the generated imagery by a novel hierarchical training scheme in combination with ray importance sampling. Our learned representation is highly compact and able to represent a 10 second 30 FPS multiview video recording by 18 cameras with a model size of only 28MB. We demonstrate that our method can render high-fidelity wide-angle novel views at over 1K resolution, even for complex and dynamic scenes. We perform an extensive qualitative and quantitative evaluation that shows that our approach outperforms the state of the art. Project website: https://neural- 3d-video.github.io/.
引用
收藏
页码:5511 / 5521
页数:11
相关论文
共 50 条
  • [31] CONVERSION OF FREE-VIEWPOINT 3D MULTI-VIEW VIDEO FOR STEREOSCOPIC DISPLAYS
    Do, Luat
    Zinger, Svitlana
    de With, Peter H. N.
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1730 - 1734
  • [32] A DASH-Based 3D Multi-view Video Rate Control System
    Su, Tianyu
    Javadtalab, Abbas
    Yassine, Abdulsalam
    Shirmohammadi, Shervin
    [J]. 2014 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2014,
  • [33] Realistic 3D facial animation parameters from mirror-reflected multi-view video
    Lin, IC
    Yeh, JS
    Ouhyoung, M
    [J]. COMPUTER ANIMATION 2001, PROCEEDINGS, 2001, : 2 - +
  • [34] Joint Space-Time-View Error Concealment Algorithms for 3D Multi-View Video
    El Shafai, Walid
    Hrusovsky, Branislav
    El-Khamy, Mostafa
    El-Sharkawy, Mohamed
    [J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [36] Joint Bit Allocation and Rate Control for Coding Multi-View Video Plus Depth Based 3D Video
    Shao, Feng
    Jiang, Gangyi
    Lin, Weishi
    Yu, Mei
    Dai, Qionghai
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (08) : 1843 - 1854
  • [37] Asymmetric Coding of Multi-View Video Plus Depth Based 3-D Video for View Rendering
    Shao, Feng
    Jiang, Gangyi
    Yu, Mei
    Chen, Ken
    Ho, Yo-Sung
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (01) : 157 - 167
  • [38] VIRTUAL VIEW SYNTHESIS USING MULTI-VIEW VIDEO SEQUENCES
    Jung, Il-Lyong
    Chung, Taeyoung
    Song, Kwanwoong
    Kim, Chang-Su
    [J]. 2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 2341 - 2344
  • [39] EFFICIENT VIEW SYNTHESIS FOR MULTI-VIEW VIDEO PLUS DEPTH
    Vijayanagar, Krishna Rao
    Kim, Joohee
    Lee, Yunsik
    Kim, Jong-bok
    [J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2197 - 2201
  • [40] Compression and interpolation of 3D-stereoscopic and multi-view video
    Siegel, M
    Sethuraman, S
    McVeigh, JS
    Jordan, A
    [J]. STEREOSCOPIC DISPLAYS AND VIRTUAL REALITY SYSTEMS IV, 1997, 3012 : 227 - 238