Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis

被引:5
|
作者
Zhang, Xuanmeng [1 ,2 ]
Zheng, Zhedong [3 ]
Gao, Daiheng [2 ]
Zhang, Bang [2 ]
Yang, Yi [4 ]
Chua, Tat-Seng [3 ]
机构
[1] Univ Technol Sydney, ReLER Lab, AAII, Ultimo, Australia
[2] DAMO Acad, Alibaba Grp, Hangzhou, Peoples R China
[3] Natl Univ Singapore, Sea NExT Joint Lab, Singapore, Singapore
[4] Zhejiang Univ, Hangzhou, Peoples R China
关键词
Generative adversarial networks; Neural radiance fields; 3D-aware image synthesis;
D O I
10.1007/s11263-023-01805-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies compositional 3D-aware image synthesis for both single-object and multi-object scenes. We observe that two challenges remain in this field: existing approaches (1) lack geometry constraints and thus compromise the multi-view consistency of the single object, and (2) can not scale to multi-object scenes with complex backgrounds. To address these challenges coherently, we propose multi-view consistent generative adversarial networks (MVCGAN) for compositional 3D-aware image synthesis. First, we build the geometry constraints on the single object by leveraging the underlying 3D information. Specifically, we enforce the photometric consistency between pairs of views, encouraging the model to learn the inherent 3D shape. Second, we adapt MVCGAN to multi-object scenarios. In particular, we formulate the multi-object scene generation as a "decompose and compose" process. During training, we adopt the top-down strategy to decompose training images into objects and backgrounds. When rendering, we deploy a reverse bottom-up manner by composing the generated objects and background into the holistic scene. Extensive experiments on both single-object and multi-object datasets show that the proposed method achieves competitive performance for 3D-aware image synthesis.
引用
收藏
页码:2219 / 2242
页数:24
相关论文
共 50 条
  • [31] Semi- and Self-supervised Multi-view Fusion of 3D Microscopy Images Using Generative Adversarial Networks
    Yang, Canyu
    Eschweiler, Dennis
    Stegmaier, Johannes
    MACHINE LEARNING FOR MEDICAL IMAGE RECONSTRUCTION (MLMIR 2021), 2021, 12964 : 130 - 139
  • [32] MEGAN: A Generative Adversarial Network for Multi-View Network Embedding
    Sun, Yiwei
    Wang, Suhang
    Hsieh, Tsung-Yu
    Tang, Xianfeng
    Honavar, Vasant
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3527 - 3533
  • [33] Multi-View Gait Recognition Based on Generative Adversarial Network
    Wen, Jiamin
    Shen, Yongliang
    Yang, Jun
    NEURAL PROCESSING LETTERS, 2022, 54 (03) : 1855 - 1877
  • [34] Textured Mesh Generation Using Multi-View and Multi-Source Supervision and Generative Adversarial Networks
    Wen, Mingyun
    Park, Jisun
    Cho, Kyungeun
    REMOTE SENSING, 2021, 13 (21)
  • [35] Multi-View Gait Recognition Based on Generative Adversarial Network
    Jiamin Wen
    Yongliang Shen
    Jun Yang
    Neural Processing Letters, 2022, 54 : 1855 - 1877
  • [36] Multi-Contrast MRI Image Synthesis Using Switchable Cycle-Consistent Generative Adversarial Networks
    Zhang, Huixian
    Li, Hailong
    Dillman, Jonathan R.
    Parikh, Nehal A.
    He, Lili
    DIAGNOSTICS, 2022, 12 (04)
  • [37] Multi-View Image Capture for Glasses Free Multi-View 3D Displays
    Gurbuz, Sabri
    Yano, Sumio
    Iwasawa, Shoichiro
    Ando, Hiroshi
    IDW'10: PROCEEDINGS OF THE 17TH INTERNATIONAL DISPLAY WORKSHOPS, VOLS 1-3, 2010, : 2091 - 2094
  • [38] Photoacoustic image synthesis with generative adversarial networks
    Schellenberg, Melanie
    Groehl, Janek
    Dreher, Kris K.
    Noelke, Jan-Hinrich
    Holzwarth, Niklas
    Tizabi, Minu D.
    Seitel, Alexander
    Maier-Hein, Lena
    PHOTOACOUSTICS, 2022, 28
  • [39] Enhanced Magnetic Resonance Image Synthesis with Contrast-Aware Generative Adversarial Networks
    Denck, Jonas
    Guehring, Jens
    Maier, Andreas
    Rothgang, Eva
    JOURNAL OF IMAGING, 2021, 7 (08)
  • [40] VQ3D: Learning a 3D-Aware Generative Model on ImageNet
    Sargent, Kyle
    Koh, Jing Yu
    Zhang, Han
    Chang, Huiwen
    Herrmann, Charles
    Srinivasan, Pratul
    Wu, Jiajun
    Sun, Deqing
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4217 - 4227