Realistic image generation using adversarial generative networks combined with depth information

被引:1
|
作者
Yu, Qi [1 ]
Yu, Lan [1 ]
Li, Guangju [1 ]
Jin, Dehu [1 ]
Qi, Meng [1 ]
机构
[1] Shandong Normal Univ, Jinan 250358, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
CGAN; Image generation; Depth information; Attention mechanisms;
D O I
10.1016/j.dsp.2023.104263
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Existing image generation tasks produce blurry, unrealistic results and images that lack layers and structure. Depth information can be used to accurately control the relative positions and hierarchies between different objects in an image. Our goal is to enhance the realism, hierarchy, and quality of generated images by using depth information in image-to-image tasks. To address these issues, we propose a multi-conditional semantic image generation method that fuses depth information. The method is based on the network structure of Generative Adversarial Networks and fuses the depth information of multi-conditional inputs by using pairs of semantic labels and depth maps as inputs through our proposed Multi-scale Feature Extraction and Information Fusion Module. Furthermore, we add a channel-attention mechanism to the generator to strengthen the interconnectivity between channels and suppress confusion between different semantic features. With less increase in training cost, the module proposed in this paper can generate real images that match the input semantic layout. Through extensive testing on three challenging datasets, the images generated by this model produce superior visuals and data metrics, demonstrating the effectiveness of our proposed method.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Image Inpainting Using Generative Adversarial Networks
    Luo H.-L.
    Ao Y.
    Yuan P.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (10): : 1891 - 1898
  • [32] Realistic Data Synthesis Using Enhanced Generative Adversarial Networks
    Baowaly, Mrinal Kanti
    Liu, Chao-Lin
    Chen, Kuan-Ta
    2019 IEEE SECOND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE), 2019, : 289 - 292
  • [33] Monocular Depth Prediction using Generative Adversarial Networks
    Kumar, Arun C. S.
    Bhandarkar, Suchendra M.
    Prasad, Mukta
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 413 - 421
  • [34] Image Generation Using StyleVGG19-NST Generative Adversarial Networks
    Esan, Dorcas Oladayo
    Owolawi, Pius Adewale
    Tu, Chunling
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 70 - 80
  • [35] Augmented Reality Image Generation with Optical Consistency using Generative Adversarial Networks
    Iketani, Shunya
    Sato, Masaaki
    Imura, Masataka
    2020 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES WORKSHOPS (VRW 2020), 2020, : 615 - 616
  • [36] Successive Future Image Generation of a Walking Pedestrian Using Generative Adversarial Networks
    He, Bate
    Kita, Eisuke
    REVIEW OF SOCIONETWORK STRATEGIES, 2021, 15 (02): : 309 - 325
  • [37] Generating Realistic Aircraft Trajectories Using Generative Adversarial Networks
    Lukes, Petr
    Kulmon, Pavel
    2023 24TH INTERNATIONAL RADAR SYMPOSIUM, IRS, 2023,
  • [38] Successive Future Image Generation of a Walking Pedestrian Using Generative Adversarial Networks
    Bate He
    Eisuke Kita
    The Review of Socionetwork Strategies, 2021, 15 : 309 - 325
  • [39] Random generation of three-dimensional realistic ballast particles using generative adversarial networks
    Zhang, Jie
    Nie, Rusong
    Li, Yan
    Tan, Yongchang
    Computers and Geotechnics, 2025, 178
  • [40] SEMANTICGAN: GENERATIVE ADVERSARIAL NETWORKS FOR SEMANTIC IMAGE TO PHOTO-REALISTIC IMAGE TRANSLATION
    Liu, Junling
    Zou, Yuexian
    Yang, Dongming
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2528 - 2532