Realistic image generation using adversarial generative networks combined with depth information

被引：1

作者：

Yu, Qi ^{[1
]}

Yu, Lan ^{[1
]}

Li, Guangju ^{[1
]}

Jin, Dehu ^{[1
]}

Qi, Meng ^{[1
]}

机构：

[1] Shandong Normal Univ, Jinan 250358, Shandong, Peoples R China

来源：

DIGITAL SIGNAL PROCESSING | 2023年 / 143卷

基金：

中国国家自然科学基金;

关键词：

CGAN; Image generation; Depth information; Attention mechanisms;

D O I：

10.1016/j.dsp.2023.104263

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Existing image generation tasks produce blurry, unrealistic results and images that lack layers and structure. Depth information can be used to accurately control the relative positions and hierarchies between different objects in an image. Our goal is to enhance the realism, hierarchy, and quality of generated images by using depth information in image-to-image tasks. To address these issues, we propose a multi-conditional semantic image generation method that fuses depth information. The method is based on the network structure of Generative Adversarial Networks and fuses the depth information of multi-conditional inputs by using pairs of semantic labels and depth maps as inputs through our proposed Multi-scale Feature Extraction and Information Fusion Module. Furthermore, we add a channel-attention mechanism to the generator to strengthen the interconnectivity between channels and suppress confusion between different semantic features. With less increase in training cost, the module proposed in this paper can generate real images that match the input semantic layout. Through extensive testing on three challenging datasets, the images generated by this model produce superior visuals and data metrics, demonstrating the effectiveness of our proposed method.

引用

页数：9

共 50 条

[31] Image Inpainting Using Generative Adversarial Networks
Luo H.-L.
Ao Y.
Yuan P.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (10): : 1891 - 1898
[32] Realistic Data Synthesis Using Enhanced Generative Adversarial Networks
Baowaly, Mrinal Kanti
Liu, Chao-Lin
Chen, Kuan-Ta
2019 IEEE SECOND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE), 2019, : 289 - 292
[33] Monocular Depth Prediction using Generative Adversarial Networks
Kumar, Arun C. S.
Bhandarkar, Suchendra M.
Prasad, Mukta
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 413 - 421
[34] Image Generation Using StyleVGG19-NST Generative Adversarial Networks
Esan, Dorcas Oladayo
Owolawi, Pius Adewale
Tu, Chunling
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 70 - 80
[35] Augmented Reality Image Generation with Optical Consistency using Generative Adversarial Networks
Iketani, Shunya
Sato, Masaaki
Imura, Masataka
2020 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES WORKSHOPS (VRW 2020), 2020, : 615 - 616
[36] Successive Future Image Generation of a Walking Pedestrian Using Generative Adversarial Networks
He, Bate
Kita, Eisuke
REVIEW OF SOCIONETWORK STRATEGIES, 2021, 15 (02): : 309 - 325
[37] Generating Realistic Aircraft Trajectories Using Generative Adversarial Networks
Lukes, Petr
Kulmon, Pavel
2023 24TH INTERNATIONAL RADAR SYMPOSIUM, IRS, 2023,
[38] Successive Future Image Generation of a Walking Pedestrian Using Generative Adversarial Networks
Bate He
Eisuke Kita
The Review of Socionetwork Strategies, 2021, 15 : 309 - 325
[39] Random generation of three-dimensional realistic ballast particles using generative adversarial networks
Zhang, Jie
Nie, Rusong
Li, Yan
Tan, Yongchang
Computers and Geotechnics, 2025, 178
[40] SEMANTICGAN: GENERATIVE ADVERSARIAL NETWORKS FOR SEMANTIC IMAGE TO PHOTO-REALISTIC IMAGE TRANSLATION
Liu, Junling
Zou, Yuexian
Yang, Dongming
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2528 - 2532

← 1 2 3 4 5 →