Exploration of Semantic Label Decomposition and Dataset Size in Semantic Indoor Scenes Synthesis via Optimized Residual Generative Adversarial Networks

被引:1
|
作者
Ibrahem, Hatem [1 ]
Salem, Ahmed [1 ,2 ]
Kang, Hyun-Soo [1 ]
机构
[1] Chungbuk Natl Univ, Sch Elect & Comp Engn, Dept Informat & Commun Engn, Cheongju 28644, South Korea
[2] Assiut Univ, Fac Engn, Elect Engn Dept, Assiut 71515, Egypt
关键词
generative adversarial networks; convolutional neural networks; image-to-image translation; semantic image synthesis;
D O I
10.3390/s22218306
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In this paper, we revisit the paired image-to-image translation using the conditional generative adversarial network, the so-called "Pix2Pix", and propose efficient optimization techniques for the architecture and the training method to maximize the architecture's performance to boost the realism of the generated images. We propose a generative adversarial network-based technique to create new artificial indoor scenes using a user-defined semantic segmentation map as an input to define the location, shape, and category of each object in the scene, exactly similar to Pix2Pix. We train different residual connections-based architectures of the generator and discriminator on the NYU depth-v2 dataset and a selected indoor subset from the ADE20K dataset, showing that the proposed models have fewer parameters, less computational complexity, and can generate better quality images than the state of the art methods following the same technique to generate realistic indoor images. We also prove that using extra specific labels and more training samples increases the quality of the generated images; however, the proposed residual connections-based models can learn better from small datasets (i.e., NYU depth-v2) and can improve the realism of the generated images in training on bigger datasets (i.e., ADE20K indoor subset) in comparison to Pix2Pix. The proposed method achieves an LPIPS value of 0.505 and an FID value of 81.067, generating better quality images than that produced by Pix2Pix and other recent paired Image-to-image translation methods and outperforming them in terms of LPIPS and FID.
引用
收藏
页数:15
相关论文
共 17 条
  • [1] Remote Sensing Image Synthesis via Semantic Embedding Generative Adversarial Networks
    Wang, Chendan
    Chen, Bowen
    Zou, Zhengxia
    Shi, Zhenwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [2] Semantic Image Synthesis via Conditional Cycle-Generative Adversarial Networks
    Liu, Xiyan
    Meng, Gaofeng
    Xiang, Shiming
    Pan, Chunhong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 988 - 993
  • [3] Secure Semantic Communication via Paired Adversarial Residual Networks
    He, Boxiang
    Wang, Fanggang
    Quek, Tony Q. S.
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (10) : 2832 - 2836
  • [4] Traffic Flow Synthesis Using Generative Adversarial Networks via Semantic Latent Codes Manipulation
    Chen, Yuanyuan
    Lv, Yisheng
    Zhu, Fenghua
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1451 - 1456
  • [5] Semantic Image Synthesis via Location Aware Generative Adversarial Network
    Xu, Jiawei
    Liu, Rui
    Dong, Jing
    Yi, Pengfei
    Fan, Wanshu
    Zhou, Dongsheng
    2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 791 - 796
  • [6] Enhanced Brain Tumor Classification Through Optimized Semantic Preserved Generative Adversarial Networks
    Chaitanya, Durbhakula M. K.
    Aouthu, Srilakshmi
    Dhanalakshmi, Narra
    Srinivas, Yerram
    Dhanikonda, Srinivasa Rao
    Chinna Rao, B.
    MICROSCOPY RESEARCH AND TECHNIQUE, 2024,
  • [7] Generative Adversarial Networks with Adaptive Semantic Normalization for text-to-image synthesis
    Huang, Siyue
    Chen, Ying
    DIGITAL SIGNAL PROCESSING, 2022, 120
  • [8] Generative Adversarial Networks with Bi-directional Normalization for Semantic Image Synthesis
    Long, Jia
    Lu, Hongtao
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 219 - 226
  • [9] FACE PHOTO SYNTHESIS VIA INTERMEDIATE SEMANTIC ENHANCEMENT GENERATIVE ADVERSARIAL NETWORK
    Li, Haoxian
    Zheng, Jieying
    Liu, Feng
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 96 - 100
  • [10] Face Sketch Synthesis via Semantic-Driven Generative Adversarial Network
    Qi, Xingqun
    Sun, Muyi
    Wang, Weining
    Dong, Xiaoxiao
    Li, Qi
    Shan, Caifeng
    2021 INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2021), 2021,