Neural 3D Video Synthesis from Multi-view Video

被引：42

作者：

Li, Tianye ^{[1
,2
]}

Slavcheva, Mira ^{[2
]}

Zollhoefer, Michael ^{[2
]}

Green, Simon ^{[2
]}

Lassner, Christoph ^{[2
]}

Kim, Changil ^{[3
]}

Schmidt, Tanner ^{[2
]}

Lovegrove, Steven ^{[2
]}

Goesele, Michael ^{[2
]}

Newcombe, Richard ^{[2
]}

Lv, Zhaoyang ^{[2
]}

机构：

[1] Univ Southern Calif, Los Angeles, CA 90089 USA

[2] Real Labs Res, Redmond, WA 98052 USA

[3] Meta, Menlo Pk, CA USA

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

关键词：

SILHOUETTE;

D O I：

10.1109/CVPR52688.2022.00544

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel approach for 3D video synthesis that is able to represent multi-view video recordings of a dynamic real-world scene in a compact, yet expressive representation that enables high-quality view synthesis and motion interpolation. Our approach takes the high quality and compactness of static neural radiance fields in a new direction: to a model-free, dynamic setting. At the core of our approach is a novel time-conditioned neural radiance field that represents scene dynamics using a set of compact latent codes. We are able to significantly boost the training speed and perceptual quality of the generated imagery by a novel hierarchical training scheme in combination with ray importance sampling. Our learned representation is highly compact and able to represent a 10 second 30 FPS multiview video recording by 18 cameras with a model size of only 28MB. We demonstrate that our method can render high-fidelity wide-angle novel views at over 1K resolution, even for complex and dynamic scenes. We perform an extensive qualitative and quantitative evaluation that shows that our approach outperforms the state of the art. Project website: https://neural- 3d-video.github.io/.

引用

页码：5511 / 5521

页数：11

共 50 条

[31] CONVERSION OF FREE-VIEWPOINT 3D MULTI-VIEW VIDEO FOR STEREOSCOPIC DISPLAYS
Do, Luat
Zinger, Svitlana
de With, Peter H. N.
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1730 - 1734
[32] A DASH-Based 3D Multi-view Video Rate Control System
Su, Tianyu
Javadtalab, Abbas
Yassine, Abdulsalam
Shirmohammadi, Shervin
[J]. 2014 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2014,
[33] Realistic 3D facial animation parameters from mirror-reflected multi-view video
Lin, IC
Yeh, JS
Ouhyoung, M
[J]. COMPUTER ANIMATION 2001, PROCEEDINGS, 2001, : 2 - +
[34] Joint Space-Time-View Error Concealment Algorithms for 3D Multi-View Video
El Shafai, Walid
Hrusovsky, Branislav
El-Khamy, Mostafa
El-Sharkawy, Mohamed
[J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
[35] New network bandwidth-limited multi-view video plus depth coding method for 3D video
[J]. Yu, M. (jianggangyi@126.com), 1600, Academy Publisher (08)
[36] Joint Bit Allocation and Rate Control for Coding Multi-View Video Plus Depth Based 3D Video
Shao, Feng
Jiang, Gangyi
Lin, Weishi
Yu, Mei
Dai, Qionghai
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (08) : 1843 - 1854
[37] Asymmetric Coding of Multi-View Video Plus Depth Based 3-D Video for View Rendering
Shao, Feng
Jiang, Gangyi
Yu, Mei
Chen, Ken
Ho, Yo-Sung
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (01) : 157 - 167
[38] VIRTUAL VIEW SYNTHESIS USING MULTI-VIEW VIDEO SEQUENCES
Jung, Il-Lyong
Chung, Taeyoung
Song, Kwanwoong
Kim, Chang-Su
[J]. 2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 2341 - 2344
[39] EFFICIENT VIEW SYNTHESIS FOR MULTI-VIEW VIDEO PLUS DEPTH
Vijayanagar, Krishna Rao
Kim, Joohee
Lee, Yunsik
Kim, Jong-bok
[J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2197 - 2201
[40] Compression and interpolation of 3D-stereoscopic and multi-view video
Siegel, M
Sethuraman, S
McVeigh, JS
Jordan, A
[J]. STEREOSCOPIC DISPLAYS AND VIRTUAL REALITY SYSTEMS IV, 1997, 3012 : 227 - 238

← 1 2 3 4 5 →