Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis

被引：5

作者：

Zhang, Xuanmeng ^{[1
,2
]}

Zheng, Zhedong ^{[3
]}

Gao, Daiheng ^{[2
]}

Zhang, Bang ^{[2
]}

Yang, Yi ^{[4
]}

Chua, Tat-Seng ^{[3
]}

机构：

[1] Univ Technol Sydney, ReLER Lab, AAII, Ultimo, Australia

[2] DAMO Acad, Alibaba Grp, Hangzhou, Peoples R China

[3] Natl Univ Singapore, Sea NExT Joint Lab, Singapore, Singapore

[4] Zhejiang Univ, Hangzhou, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2023年 / 131卷 / 08期

关键词：

Generative adversarial networks; Neural radiance fields; 3D-aware image synthesis;

D O I：

10.1007/s11263-023-01805-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper studies compositional 3D-aware image synthesis for both single-object and multi-object scenes. We observe that two challenges remain in this field: existing approaches (1) lack geometry constraints and thus compromise the multi-view consistency of the single object, and (2) can not scale to multi-object scenes with complex backgrounds. To address these challenges coherently, we propose multi-view consistent generative adversarial networks (MVCGAN) for compositional 3D-aware image synthesis. First, we build the geometry constraints on the single object by leveraging the underlying 3D information. Specifically, we enforce the photometric consistency between pairs of views, encouraging the model to learn the inherent 3D shape. Second, we adapt MVCGAN to multi-object scenarios. In particular, we formulate the multi-object scene generation as a "decompose and compose" process. During training, we adopt the top-down strategy to decompose training images into objects and backgrounds. When rendering, we deploy a reverse bottom-up manner by composing the generated objects and background into the holistic scene. Extensive experiments on both single-object and multi-object datasets show that the proposed method achieves competitive performance for 3D-aware image synthesis.

引用

页码：2219 / 2242

页数：24

共 50 条

[31] Semi- and Self-supervised Multi-view Fusion of 3D Microscopy Images Using Generative Adversarial Networks
Yang, Canyu
Eschweiler, Dennis
Stegmaier, Johannes
MACHINE LEARNING FOR MEDICAL IMAGE RECONSTRUCTION (MLMIR 2021), 2021, 12964 : 130 - 139
[32] MEGAN: A Generative Adversarial Network for Multi-View Network Embedding
Sun, Yiwei
Wang, Suhang
Hsieh, Tsung-Yu
Tang, Xianfeng
Honavar, Vasant
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3527 - 3533
[33] Multi-View Gait Recognition Based on Generative Adversarial Network
Wen, Jiamin
Shen, Yongliang
Yang, Jun
NEURAL PROCESSING LETTERS, 2022, 54 (03) : 1855 - 1877
[34] Textured Mesh Generation Using Multi-View and Multi-Source Supervision and Generative Adversarial Networks
Wen, Mingyun
Park, Jisun
Cho, Kyungeun
REMOTE SENSING, 2021, 13 (21)
[35] Multi-View Gait Recognition Based on Generative Adversarial Network
Jiamin Wen
Yongliang Shen
Jun Yang
Neural Processing Letters, 2022, 54 : 1855 - 1877
[36] Multi-Contrast MRI Image Synthesis Using Switchable Cycle-Consistent Generative Adversarial Networks
Zhang, Huixian
Li, Hailong
Dillman, Jonathan R.
Parikh, Nehal A.
He, Lili
DIAGNOSTICS, 2022, 12 (04)
[37] Multi-View Image Capture for Glasses Free Multi-View 3D Displays
Gurbuz, Sabri
Yano, Sumio
Iwasawa, Shoichiro
Ando, Hiroshi
IDW'10: PROCEEDINGS OF THE 17TH INTERNATIONAL DISPLAY WORKSHOPS, VOLS 1-3, 2010, : 2091 - 2094
[38] Photoacoustic image synthesis with generative adversarial networks
Schellenberg, Melanie
Groehl, Janek
Dreher, Kris K.
Noelke, Jan-Hinrich
Holzwarth, Niklas
Tizabi, Minu D.
Seitel, Alexander
Maier-Hein, Lena
PHOTOACOUSTICS, 2022, 28
[39] Enhanced Magnetic Resonance Image Synthesis with Contrast-Aware Generative Adversarial Networks
Denck, Jonas
Guehring, Jens
Maier, Andreas
Rothgang, Eva
JOURNAL OF IMAGING, 2021, 7 (08)
[40] VQ3D: Learning a 3D-Aware Generative Model on ImageNet
Sargent, Kyle
Koh, Jing Yu
Zhang, Han
Chang, Huiwen
Herrmann, Charles
Srinivasan, Pratul
Wu, Jiajun
Sun, Deqing
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4217 - 4227

← 1 2 3 4 5 →