Generative View Synthesis: From Single-view Semantics to Novel-view Images

被引:0
|
作者
Habtegebrial, Tewodros [1 ]
Jampani, Varun [2 ]
Gallo, Orazio [3 ]
Stricker, Didier [1 ,4 ]
机构
[1] TU Kaiserslautern, Kaiserslautern, Germany
[2] Google Res, Mountain View, CA USA
[3] NVIDIA, Santa Clara, CA USA
[4] DFKI, Kaiserslautern, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Content creation, central to applications such as virtual reality, can be tedious and time-consuming. Recent image synthesis methods simplify this task by offering tools to generate new views from as little as a single input image, or by converting a semantic map into a photorealistic image. We propose to push the envelope further, and introduce Generative View Synthesis (GVS) that can synthesize multiple photorealistic views of a scene given a single semantic map. We show that the sequential application of existing techniques, e.g., semantics-to-image translation followed by monocular view synthesis, fail at capturing the scene's structure. In contrast, we solve the semantics-to-image translation in concert with the estimation of the 3D layout of the scene, thus producing geometrically consistent novel views that preserve semantic structures. We first lift the input 2D semantic map onto a 3D layered representation of the scene in feature space, thereby preserving the semantic labels of 3D geometric structures. We then project the layered features onto the target views to generate the final novel-view images. We verify the strengths of our method and compare it with several advanced baselines on three different datasets. Our approach also allows for style manipulation and image editing operations, such as the addition or removal of objects, with simple manipulations of the input style images and semantic maps respectively. For code and additional results, visit the project page at https://gvsnet.github.io
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Single-View View Synthesis with Multiplane Images
    Tucker, Richard
    Snavely, Noah
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 548 - 557
  • [2] Novel-View Acoustic Synthesis
    Chen, Changan
    Richard, Alexander
    Shapovalov, Roman
    Ithapu, Vamsi Krishna
    Neverova, Natalia
    Grauman, Kristen
    Vedaldi, Andrea
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6409 - 6419
  • [3] Two-View Mammogram Synthesis from Single-View Data Using Generative Adversarial Networks
    Yamazaki, Asumi
    Ishida, Takayuki
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (23):
  • [4] View-LSTM: Novel-View Video Synthesis Through View Decomposition
    Lakhal, Mohamed Ilyes
    Lanz, Oswald
    Cavallaro, Andrea
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7576 - 7586
  • [5] Generating full-view face images from a single-view image
    Zhong, Lei
    Bai, ChangMin
    Li, Jianfeng
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [6] Generating full-view face images from a single-view image
    Zhong, Lei
    Bai, ChangMin
    Li, Jianfeng
    [J]. Proceedings of the International Joint Conference on Neural Networks, 2021, 2021-July
  • [7] Inovis: Instant Novel-View Synthesis
    Harrer, Mathias
    Franke, Linus
    Fink, Laura
    [J]. PROCEEDINGS OF THE SIGGRAPH ASIA 2023 CONFERENCE PAPERS, 2023,
  • [8] A novel multi-view learning developed from single-view patterns
    Wang, Zhe
    Chen, Songcan
    Gao, Daqi
    [J]. PATTERN RECOGNITION, 2011, 44 (10-11) : 2395 - 2413
  • [9] ViT-MPI: Vision Transformer Multiplane Images for Surgical Single-View View Synthesis
    Han, Chenming
    Shao, Ruizhi
    Wu, Gaochang
    Shao, Hang
    Liu, Yebin
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 28 - 40
  • [10] Novel-View Synthesis of Human Tourist Photos
    Freer, Jonathan
    Yi, Kwang Moo
    Jiang, Wei
    Choi, Jongwon
    Chang, Hyung Jin
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 857 - 864