Neural 3D Video Synthesis from Multi-view Video

被引：42

作者：

Li, Tianye ^{[1
,2
]}

Slavcheva, Mira ^{[2
]}

Zollhoefer, Michael ^{[2
]}

Green, Simon ^{[2
]}

Lassner, Christoph ^{[2
]}

Kim, Changil ^{[3
]}

Schmidt, Tanner ^{[2
]}

Lovegrove, Steven ^{[2
]}

Goesele, Michael ^{[2
]}

Newcombe, Richard ^{[2
]}

Lv, Zhaoyang ^{[2
]}

机构：

[1] Univ Southern Calif, Los Angeles, CA 90089 USA

[2] Real Labs Res, Redmond, WA 98052 USA

[3] Meta, Menlo Pk, CA USA

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

关键词：

SILHOUETTE;

D O I：

10.1109/CVPR52688.2022.00544

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel approach for 3D video synthesis that is able to represent multi-view video recordings of a dynamic real-world scene in a compact, yet expressive representation that enables high-quality view synthesis and motion interpolation. Our approach takes the high quality and compactness of static neural radiance fields in a new direction: to a model-free, dynamic setting. At the core of our approach is a novel time-conditioned neural radiance field that represents scene dynamics using a set of compact latent codes. We are able to significantly boost the training speed and perceptual quality of the generated imagery by a novel hierarchical training scheme in combination with ray importance sampling. Our learned representation is highly compact and able to represent a 10 second 30 FPS multiview video recording by 18 cameras with a model size of only 28MB. We demonstrate that our method can render high-fidelity wide-angle novel views at over 1K resolution, even for complex and dynamic scenes. We perform an extensive qualitative and quantitative evaluation that shows that our approach outperforms the state of the art. Project website: https://neural- 3d-video.github.io/.

引用

页码：5511 / 5521

页数：11

共 50 条

[1] STEREOSCOPIC 3D VIEW SYNTHESIS FROM UNSYNCHRONIZED MULTI-VIEW VIDEO
Klose, Felix
Ruhl, Kai
Lipski, Christian
Linz, Christian
Magnor, Markus
[J]. 19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1904 - 1908
[2] MULTI-VIEW 3D RECONSTRUCTION FROM VIDEO WITH TRANSFORMER
Zhong, Yijie
Sun, Zhengxing
Sun, Yunhan
Luo, Shoutong
Wang, Yi
Zhang, Wei
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1661 - 1665
[3] A COMPACT 3D REPRESENTATION FOR MULTI-VIEW VIDEO
Salvador, Jordi
Casas, Josep R.
[J]. INTERNATIONAL CONFERENCE ON 3D IMAGING 2011 (IC3D 2011), 2011,
[4] Multi-view video compression for 3D displays
Zwicker, Matthias
Yea, Sehoon
Vetro, Anthony
Forlines, Clifton
Matusik, Wojciech
Pfister, Hanspeter
[J]. CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 1506 - +
[5] IMPROVED MULTI-VIEW DEPTH ESTIMATION FOR VIEW SYNTHESIS IN 3D VIDEO CODING
Zhang, Qiuwen
An, Ping
Zhang, Yan
Shen, Liquan
Zhang, Zhaoyang
[J]. 2011 3DTV CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON), 2011,
[6] Virtual View Adaptation for 3D Multi-View Video Streaming
Petrovic, Goran
Do, Luat
Zinger, Sveta
de With, Peter H. N.
[J]. STEREOSCOPIC DISPLAYS AND APPLICATIONS XXI, 2010, 7524
[7] A Multi-view Projection 3D Video Processing Technology
Dou, Yuchao
Dai, Taotao
Fu, Lei
Huang, Ziqiang
[J]. 2012 IEEE FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2012, : 1162 - 1165
[8] Color calibration of multi-view video plus depth for advanced 3D video
Fezza, Sid Ahmed
Larabi, Mohamed-Chaker
[J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 : 177 - 191
[9] Color calibration of multi-view video plus depth for advanced 3D video
Sid Ahmed Fezza
Mohamed-Chaker Larabi
[J]. Signal, Image and Video Processing, 2015, 9 : 177 - 191
[10] 3D High-Efficiency Video Coding for Multi-View Video and Depth Data
Mueller, Karsten
Schwarz, Heiko
Marpe, Detlev
Bartnik, Christian
Bosse, Sebastian
Brust, Heribert
Hinz, Tobias
Lakshman, Haricharan
Merkle, Philipp
Rhee, Franz Hunn
Tech, Gerhard
Winken, Martin
Wiegand, Thomas
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (09) : 3366 - 3378

← 1 2 3 4 5 →