Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis

被引：5

作者：

Zhang, Xuanmeng ^{[1
,2
]}

Zheng, Zhedong ^{[3
]}

Gao, Daiheng ^{[2
]}

Zhang, Bang ^{[2
]}

Yang, Yi ^{[4
]}

Chua, Tat-Seng ^{[3
]}

机构：

[1] Univ Technol Sydney, ReLER Lab, AAII, Ultimo, Australia

[2] DAMO Acad, Alibaba Grp, Hangzhou, Peoples R China

[3] Natl Univ Singapore, Sea NExT Joint Lab, Singapore, Singapore

[4] Zhejiang Univ, Hangzhou, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2023年 / 131卷 / 08期

关键词：

Generative adversarial networks; Neural radiance fields; 3D-aware image synthesis;

D O I：

10.1007/s11263-023-01805-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper studies compositional 3D-aware image synthesis for both single-object and multi-object scenes. We observe that two challenges remain in this field: existing approaches (1) lack geometry constraints and thus compromise the multi-view consistency of the single object, and (2) can not scale to multi-object scenes with complex backgrounds. To address these challenges coherently, we propose multi-view consistent generative adversarial networks (MVCGAN) for compositional 3D-aware image synthesis. First, we build the geometry constraints on the single object by leveraging the underlying 3D information. Specifically, we enforce the photometric consistency between pairs of views, encouraging the model to learn the inherent 3D shape. Second, we adapt MVCGAN to multi-object scenarios. In particular, we formulate the multi-object scene generation as a "decompose and compose" process. During training, we adopt the top-down strategy to decompose training images into objects and backgrounds. When rendering, we deploy a reverse bottom-up manner by composing the generated objects and background into the holistic scene. Extensive experiments on both single-object and multi-object datasets show that the proposed method achieves competitive performance for 3D-aware image synthesis.

引用

页码：2219 / 2242

页数：24

共 50 条

[21] Deep-plane sweep generative adversarial network for consistent multi-view depth estimation
Shu, Dong Wook
Jang, Wonbeom
Yoo, Heebin
Shin, Hong-Chang
Kwon, Junseok
MACHINE VISION AND APPLICATIONS, 2022, 33 (01)
[22] Deep-plane sweep generative adversarial network for consistent multi-view depth estimation
Dong Wook Shu
Wonbeom Jang
Heebin Yoo
Hong-Chang Shin
Junseok Kwon
Machine Vision and Applications, 2022, 33
[23] 3D-Aware Semantic-Guided Generative Model for Human Synthesis
Zhang, Jichao
Sangineto, Enver
Tang, Hao
Siarohin, Aliaksandr
Zhong, Zhun
Sebe, Nicu
Wang, Wei
COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 339 - 356
[24] Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator
Shi, Zifan
Xu, Yinghao
Shen, Yujun
Zhao, Deli
Chen, Qifeng
Yeung, Dit-Yan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[25] Learning 3D-aware Image Synthesis with Unknown Pose Distribution
Shi, Zifan
Shen, Yujun
Xu, Yinghao
Peng, Sida
Liao, Yiyi
Guo, Sheng
Chen, Qifeng
Yeung, Dit-Yan
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13062 - 13071
[26] A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis
Pan, Xingang
Xu, Xudong
Loy, Chen Change
Theobalt, Christian
Dai, Bo
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[27] A Co-Attention Method Based on Generative Adversarial Networks for Multi-view Images
Huang, Qi-Xian
Shi, Shu-Pei
Lin, Guo-Shiang
Shen, Day-Fann
Sun, Hung-Min
22ND IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2021-FALL), 2021, : 171 - 173
[28] Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis
Do, Hoseok
Yoo, EunKyung
Kim, Taehyeong
Lee, Chul
Choi, Tin Young
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8529 - 8538
[29] 3D-Aware Multi-Class Image-to-Image Translation with NeRFs
Li, Senmao
van de Weijer, Joost
Wang, Yaxing
Khan, Fahad Shahbaz
Liu, Meiqin
Yang, Jian
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12652 - 12662
[30] Adversarial Multi-view Networks for Activity Recognition
Bai, Lei
Yao, Lina
Wang, Xianzhi
Kanhere, Salil S.
Bin Guo
Yu, Zhiwen
PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2020, 4 (02):

← 1 2 3 4 5 →