Generative View Synthesis: From Single-view Semantics to Novel-view Images

被引：0

作者：

Habtegebrial, Tewodros ^{[1
]}

Jampani, Varun ^{[2
]}

Gallo, Orazio ^{[3
]}

Stricker, Didier ^{[1
,4
]}

机构：

[1] TU Kaiserslautern, Kaiserslautern, Germany

[2] Google Res, Mountain View, CA USA

[3] NVIDIA, Santa Clara, CA USA

[4] DFKI, Kaiserslautern, Germany

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Content creation, central to applications such as virtual reality, can be tedious and time-consuming. Recent image synthesis methods simplify this task by offering tools to generate new views from as little as a single input image, or by converting a semantic map into a photorealistic image. We propose to push the envelope further, and introduce Generative View Synthesis (GVS) that can synthesize multiple photorealistic views of a scene given a single semantic map. We show that the sequential application of existing techniques, e.g., semantics-to-image translation followed by monocular view synthesis, fail at capturing the scene's structure. In contrast, we solve the semantics-to-image translation in concert with the estimation of the 3D layout of the scene, thus producing geometrically consistent novel views that preserve semantic structures. We first lift the input 2D semantic map onto a 3D layered representation of the scene in feature space, thereby preserving the semantic labels of 3D geometric structures. We then project the layered features onto the target views to generate the final novel-view images. We verify the strengths of our method and compare it with several advanced baselines on three different datasets. Our approach also allows for style manipulation and image editing operations, such as the addition or removal of objects, with simple manipulations of the input style images and semantic maps respectively. For code and additional results, visit the project page at https://gvsnet.github.io

引用

页数：11

共 50 条

[1] Single-View View Synthesis with Multiplane Images
Tucker, Richard
Snavely, Noah
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 548 - 557
[2] Novel-View Acoustic Synthesis
Chen, Changan
Richard, Alexander
Shapovalov, Roman
Ithapu, Vamsi Krishna
Neverova, Natalia
Grauman, Kristen
Vedaldi, Andrea
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6409 - 6419
[3] Two-View Mammogram Synthesis from Single-View Data Using Generative Adversarial Networks
Yamazaki, Asumi
Ishida, Takayuki
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (23):
[4] View-LSTM: Novel-View Video Synthesis Through View Decomposition
Lakhal, Mohamed Ilyes
Lanz, Oswald
Cavallaro, Andrea
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7576 - 7586
[5] Generating full-view face images from a single-view image
Zhong, Lei
Bai, ChangMin
Li, Jianfeng
[J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[6] Generating full-view face images from a single-view image
Zhong, Lei
Bai, ChangMin
Li, Jianfeng
[J]. Proceedings of the International Joint Conference on Neural Networks, 2021, 2021-July
[7] Inovis: Instant Novel-View Synthesis
Harrer, Mathias
Franke, Linus
Fink, Laura
[J]. PROCEEDINGS OF THE SIGGRAPH ASIA 2023 CONFERENCE PAPERS, 2023,
[8] A novel multi-view learning developed from single-view patterns
Wang, Zhe
Chen, Songcan
Gao, Daqi
[J]. PATTERN RECOGNITION, 2011, 44 (10-11) : 2395 - 2413
[9] ViT-MPI: Vision Transformer Multiplane Images for Surgical Single-View View Synthesis
Han, Chenming
Shao, Ruizhi
Wu, Gaochang
Shao, Hang
Liu, Yebin
[J]. ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 28 - 40
[10] Novel-View Synthesis of Human Tourist Photos
Freer, Jonathan
Yi, Kwang Moo
Jiang, Wei
Choi, Jongwon
Chang, Hyung Jin
[J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 857 - 864

← 1 2 3 4 5 →