Text to Image Generation Using Gan

被引:0
|
作者
Jindal, Rajni [1 ]
Sriram, V. [1 ]
Aggarwal, Vishesh [1 ]
Jain, Vishesh [1 ]
机构
[1] Delhi Technol Univ, New Delhi, India
关键词
Generative adversarial networks; Text to image generation; Progressive GAN; stackGAN; Image generation; Nearest neighbour interpolation; Generator; Discriminator; Wasserstein loss; Equalised learning rate; Mini-batch standard deviation;
D O I
10.1007/978-981-19-2840-6_51
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text to image synthesis, one of the most fascinating applications of GANs, is one of the hottest topics in all of machine learning and artificial intelligence. This paper comprises techniques for training a GAN to synthesise human faces and images of flowers from text descriptions. In this paper, we are proposing to train the GAN progressively as proposed in the ProGAN architecture and along with that trying to improve its results by proposing a custom update rule for alpha which controls the fading rate during the progressive growth of the architecture. With experimental testing using the Oxford102 and LFW datasets, our proposed architecture and training process ensures fast learning and smooth transitions between each trained generation.
引用
收藏
页码:673 / 684
页数:12
相关论文
共 50 条
  • [1] A Survey on Text Description to Image Generation Using GAN
    Yeshasvi, Mogula
    Kayal, P.
    Subetha, T.
    SOFT COMPUTING FOR SECURITY APPLICATIONS, ICSCS 2022, 2023, 1428 : 665 - 675
  • [2] Text to Image Generation with Conformer-GAN
    Deng, Zhiyu
    Yu, Wenxin
    Che, Lu
    Chen, Shiyu
    Zhang, Zhiqiang
    Shang, Jun
    Chen, Peng
    Gong, Jun
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT V, 2024, 14451 : 3 - 14
  • [3] Text to Image Generation with Semantic-Spatial Aware GAN
    Liao, Wentong
    Hu, Kai
    Yang, Michael Ying
    Rosenhahn, Bodo
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18166 - 18175
  • [4] DR-GAN: Distribution Regularization for Text-to-Image Generation
    Tan, Hongchen
    Liu, Xiuping
    Yin, Baocai
    Li, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 10309 - 10323
  • [5] TEXT TO REALISTIC IMAGE GENERATION USING STACKGAN
    Dhivya, K.
    Navas, N. Sharfaras
    2020 7TH IEEE INTERNATIONAL CONFERENCE ON SMART STRUCTURES AND SYSTEMS (ICSSS 2020), 2020, : 508 - 514
  • [6] Stacking VAE and GAN for Context-aware Text-to-Image Generation
    Zhang, Chenrui
    Peng, Yuxin
    2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [7] AraBERT and DF-GAN fusion for Arabic text-to-image generation
    Bahani, Mourad
    El Ouaazizi, Aziza
    Maalmi, Khalil
    ARRAY, 2022, 16
  • [8] AraBERT and DF-GAN fusion for Arabic text-to-image generation
    Bahani, Mourad
    El Ouaazizi, Aziza
    Maalmi, Khalil
    Array, 2022, 16
  • [9] ReFIGG: retinal fundus image generation using GAN
    Nair, Sharika Sasidharan
    Meharban, M. S.
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2023, 26 (03) : 316 - 323
  • [10] Image Generation Using GAN and Its Classification Using SVM and CNN
    Singh, Aadarsh
    Bansal, Aashutosh
    Chauhan, Nishant
    Sahu, Satya Prakash
    Dewangan, Deepak Kumar
    PROCEEDINGS OF EMERGING TRENDS AND TECHNOLOGIES ON INTELLIGENT SYSTEMS (ETTIS 2021), 2022, 1371 : 89 - 100