Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis

被引:5
|
作者
Zhang, Xuanmeng [1 ,2 ]
Zheng, Zhedong [3 ]
Gao, Daiheng [2 ]
Zhang, Bang [2 ]
Yang, Yi [4 ]
Chua, Tat-Seng [3 ]
机构
[1] Univ Technol Sydney, ReLER Lab, AAII, Ultimo, Australia
[2] DAMO Acad, Alibaba Grp, Hangzhou, Peoples R China
[3] Natl Univ Singapore, Sea NExT Joint Lab, Singapore, Singapore
[4] Zhejiang Univ, Hangzhou, Peoples R China
关键词
Generative adversarial networks; Neural radiance fields; 3D-aware image synthesis;
D O I
10.1007/s11263-023-01805-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies compositional 3D-aware image synthesis for both single-object and multi-object scenes. We observe that two challenges remain in this field: existing approaches (1) lack geometry constraints and thus compromise the multi-view consistency of the single object, and (2) can not scale to multi-object scenes with complex backgrounds. To address these challenges coherently, we propose multi-view consistent generative adversarial networks (MVCGAN) for compositional 3D-aware image synthesis. First, we build the geometry constraints on the single object by leveraging the underlying 3D information. Specifically, we enforce the photometric consistency between pairs of views, encouraging the model to learn the inherent 3D shape. Second, we adapt MVCGAN to multi-object scenarios. In particular, we formulate the multi-object scene generation as a "decompose and compose" process. During training, we adopt the top-down strategy to decompose training images into objects and backgrounds. When rendering, we deploy a reverse bottom-up manner by composing the generated objects and background into the holistic scene. Extensive experiments on both single-object and multi-object datasets show that the proposed method achieves competitive performance for 3D-aware image synthesis.
引用
收藏
页码:2219 / 2242
页数:24
相关论文
共 50 条
  • [21] Deep-plane sweep generative adversarial network for consistent multi-view depth estimation
    Shu, Dong Wook
    Jang, Wonbeom
    Yoo, Heebin
    Shin, Hong-Chang
    Kwon, Junseok
    MACHINE VISION AND APPLICATIONS, 2022, 33 (01)
  • [22] Deep-plane sweep generative adversarial network for consistent multi-view depth estimation
    Dong Wook Shu
    Wonbeom Jang
    Heebin Yoo
    Hong-Chang Shin
    Junseok Kwon
    Machine Vision and Applications, 2022, 33
  • [23] 3D-Aware Semantic-Guided Generative Model for Human Synthesis
    Zhang, Jichao
    Sangineto, Enver
    Tang, Hao
    Siarohin, Aliaksandr
    Zhong, Zhun
    Sebe, Nicu
    Wang, Wei
    COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 339 - 356
  • [24] Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator
    Shi, Zifan
    Xu, Yinghao
    Shen, Yujun
    Zhao, Deli
    Chen, Qifeng
    Yeung, Dit-Yan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [25] Learning 3D-aware Image Synthesis with Unknown Pose Distribution
    Shi, Zifan
    Shen, Yujun
    Xu, Yinghao
    Peng, Sida
    Liao, Yiyi
    Guo, Sheng
    Chen, Qifeng
    Yeung, Dit-Yan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13062 - 13071
  • [26] A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis
    Pan, Xingang
    Xu, Xudong
    Loy, Chen Change
    Theobalt, Christian
    Dai, Bo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [27] A Co-Attention Method Based on Generative Adversarial Networks for Multi-view Images
    Huang, Qi-Xian
    Shi, Shu-Pei
    Lin, Guo-Shiang
    Shen, Day-Fann
    Sun, Hung-Min
    22ND IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2021-FALL), 2021, : 171 - 173
  • [28] Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis
    Do, Hoseok
    Yoo, EunKyung
    Kim, Taehyeong
    Lee, Chul
    Choi, Tin Young
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8529 - 8538
  • [29] 3D-Aware Multi-Class Image-to-Image Translation with NeRFs
    Li, Senmao
    van de Weijer, Joost
    Wang, Yaxing
    Khan, Fahad Shahbaz
    Liu, Meiqin
    Yang, Jian
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12652 - 12662
  • [30] Adversarial Multi-view Networks for Activity Recognition
    Bai, Lei
    Yao, Lina
    Wang, Xianzhi
    Kanhere, Salil S.
    Bin Guo
    Yu, Zhiwen
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2020, 4 (02):