Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis

被引：5

作者：

Zhang, Xuanmeng ^{[1
,2
]}

Zheng, Zhedong ^{[3
]}

Gao, Daiheng ^{[2
]}

Zhang, Bang ^{[2
]}

Yang, Yi ^{[4
]}

Chua, Tat-Seng ^{[3
]}

机构：

[1] Univ Technol Sydney, ReLER Lab, AAII, Ultimo, Australia

[2] DAMO Acad, Alibaba Grp, Hangzhou, Peoples R China

[3] Natl Univ Singapore, Sea NExT Joint Lab, Singapore, Singapore

[4] Zhejiang Univ, Hangzhou, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2023年 / 131卷 / 08期

关键词：

Generative adversarial networks; Neural radiance fields; 3D-aware image synthesis;

D O I：

10.1007/s11263-023-01805-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper studies compositional 3D-aware image synthesis for both single-object and multi-object scenes. We observe that two challenges remain in this field: existing approaches (1) lack geometry constraints and thus compromise the multi-view consistency of the single object, and (2) can not scale to multi-object scenes with complex backgrounds. To address these challenges coherently, we propose multi-view consistent generative adversarial networks (MVCGAN) for compositional 3D-aware image synthesis. First, we build the geometry constraints on the single object by leveraging the underlying 3D information. Specifically, we enforce the photometric consistency between pairs of views, encouraging the model to learn the inherent 3D shape. Second, we adapt MVCGAN to multi-object scenarios. In particular, we formulate the multi-object scene generation as a "decompose and compose" process. During training, we adopt the top-down strategy to decompose training images into objects and backgrounds. When rendering, we deploy a reverse bottom-up manner by composing the generated objects and background into the holistic scene. Extensive experiments on both single-object and multi-object datasets show that the proposed method achieves competitive performance for 3D-aware image synthesis.

引用

页码：2219 / 2242

页数：24

共 50 条

[1] Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis
Xuanmeng Zhang
Zhedong Zheng
Daiheng Gao
Bang Zhang
Yi Yang
Tat-Seng Chua
International Journal of Computer Vision, 2023, 131 : 2219 - 2242
[2] Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis
Zhang, Xuanmeng
Zheng, Zhedong
Gao, Daiheng
Zhang, Bang
Pan, Pan
Yang, Yi
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18429 - 18438
[3] pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis
Chan, Eric R.
Monteiro, Marco
Kellnhofer, Petr
Wu, Jiajun
Wetzstein, Gordon
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5795 - 5805
[4] Multi-view Generative Adversarial Networks
Chen, Mickael
Denoyer, Ludovic
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT II, 2017, 10535 : 175 - 188
[5] 3D-Aware Generative Model for Improved Side-View Image Synthesis
Jo, Kyungmin
Jin, Wonjoon
Choo, Jaegul
Lee, Hyunjoon
Cho, Sunghyun
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22805 - 22815
[6] A Survey on Deep Generative 3D-aware Image Synthesis
Xia, Weihao
Xue, Jing-Hao
ACM COMPUTING SURVEYS, 2024, 56 (04)
[7] 3D-aware Facial Landmark Detection via Multi-view Consistent Training on Synthetic Data
Zeng, Libing
Chen, Lele
Bao, Wentao
Li, Zhong
Xu, Yi
Yuan, Junsong
Kalantari, Nima K.
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12747 - 12758
[8] Generative Novel View Synthesis with 3D-Aware Diffusion Models
Chan, Eric R.
Nagano, Koki
Chan, Matthew A.
Bergman, Alexander W.
Park, Jeong Joon
Levy, Axel
Aittala, Miika
De Mello, Shalini
Karras, Tero
Wetzstein, Gordon
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4194 - 4206
[9] Lifespan Face Age Progression using 3D-Aware Generative Adversarial Networks
Jensen, Eric Kastl
Bjerre, Morten
Grimmer, Marcel
Busch, Christoph
2023 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS, IJCB, 2023,
[10] GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis
Schwarz, Katja
Liao, Yiyi
Niemeyer, Michael
Geiger, Andreas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33

← 1 2 3 4 5 →