Generative View Synthesis: From Single-view Semantics to Novel-view Images

被引：0

作者：

Habtegebrial, Tewodros ^{[1
]}

Jampani, Varun ^{[2
]}

Gallo, Orazio ^{[3
]}

Stricker, Didier ^{[1
,4
]}

机构：

[1] TU Kaiserslautern, Kaiserslautern, Germany

[2] Google Res, Mountain View, CA USA

[3] NVIDIA, Santa Clara, CA USA

[4] DFKI, Kaiserslautern, Germany

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Content creation, central to applications such as virtual reality, can be tedious and time-consuming. Recent image synthesis methods simplify this task by offering tools to generate new views from as little as a single input image, or by converting a semantic map into a photorealistic image. We propose to push the envelope further, and introduce Generative View Synthesis (GVS) that can synthesize multiple photorealistic views of a scene given a single semantic map. We show that the sequential application of existing techniques, e.g., semantics-to-image translation followed by monocular view synthesis, fail at capturing the scene's structure. In contrast, we solve the semantics-to-image translation in concert with the estimation of the 3D layout of the scene, thus producing geometrically consistent novel views that preserve semantic structures. We first lift the input 2D semantic map onto a 3D layered representation of the scene in feature space, thereby preserving the semantic labels of 3D geometric structures. We then project the layered features onto the target views to generate the final novel-view images. We verify the strengths of our method and compare it with several advanced baselines on three different datasets. Our approach also allows for style manipulation and image editing operations, such as the addition or removal of objects, with simple manipulations of the input style images and semantic maps respectively. For code and additional results, visit the project page at https://gvsnet.github.io

引用

页数：11

共 50 条

[31] Novel-view synthesis based interactive video synopsis browsing
Nanjing Audit University, Nanjing
Jiangsu
210029, China
不详
Guangdong
510006, China
不详
Hubei
430072, China
[J]. Tien Tzu Hsueh Pao, 11 (2263-2270):
[32] Shape and Refractive Index from Single-View Spectro-Polarimetric Images
Huynh, Cong Phuoc
Robles-Kelly, Antonio
Hancock, Edwin R.
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (01) : 64 - 94
[33] Learning Camera Parameters With Weighted Edge Attention From Single-View Images
Jeong, Moonsoo
Byun, Hyogeun
Lee, Sungkil
[J]. IEEE ACCESS, 2023, 11 : 16896 - 16906
[34] Shape and Refractive Index from Single-View Spectro-Polarimetric Images
Cong Phuoc Huynh
Antonio Robles-Kelly
Edwin R. Hancock
[J]. International Journal of Computer Vision, 2013, 101 : 64 - 94
[35] Performance comparison of single-view digital breast tomosynthesis plus single-view digital mammography with two-view digital mammography
Gisella Gennaro
R. Edward Hendrick
Patricia Ruppel
Roberta Chersevani
Cosimo di Maggio
Manuela La Grassa
Luigi Pescarini
Ilaria Polico
Alessandro Proietti
Enrica Baldan
Elisabetta Bezzon
Fabio Pomerri
Pier Carlo Muzzio
[J]. European Radiology, 2013, 23 : 664 - 672
[36] Performance comparison of single-view digital breast tomosynthesis plus single-view digital mammography with two-view digital mammography
Gennaro, Gisella
Hendrick, R. Edward
Ruppel, Patricia
Chersevani, Roberta
di Maggio, Cosimo
La Grassa, Manuela
Pescarini, Luigi
Polico, Ilaria
Proietti, Alessandro
Baldan, Enrica
Bezzon, Elisabetta
Pomerri, Fabio
Muzzio, Pier Carlo
[J]. EUROPEAN RADIOLOGY, 2013, 23 (03) : 664 - 672
[37] EMS: 3D Eyebrow Modeling from Single-view Images
Li, Chenghong
Jin, Leyang
Zheng, Yujian
Yu, Yizhou
Han, Xiaoguang
[J]. ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (06):
[38] Comparison of Single-View and Dual-View Digital Chest Tomosynthesis
Zhong, Y.
You, Z.
Liu, X.
Wang, T.
Shen, Y.
Lai, C.
Shaw, C.
[J]. MEDICAL PHYSICS, 2012, 39 (06) : 3608 - 3608
[39] Learning View Priors for Single-view 3D Reconstruction
Kato, Hiroharu
Harada, Tatsuya
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9770 - 9779
[40] On the Uncertain Single-View Depths in Colonoscopies
Rodriguez-Puigvert, Javier
Recasens, David
Civera, Javier
Martinez-Cantin, Ruben
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT III, 2022, 13433 : 130 - 140

← 1 2 3 4 5 →