Text to Image Generation Using Gan

被引:0
|
作者
Jindal, Rajni [1 ]
Sriram, V. [1 ]
Aggarwal, Vishesh [1 ]
Jain, Vishesh [1 ]
机构
[1] Delhi Technol Univ, New Delhi, India
关键词
Generative adversarial networks; Text to image generation; Progressive GAN; stackGAN; Image generation; Nearest neighbour interpolation; Generator; Discriminator; Wasserstein loss; Equalised learning rate; Mini-batch standard deviation;
D O I
10.1007/978-981-19-2840-6_51
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text to image synthesis, one of the most fascinating applications of GANs, is one of the hottest topics in all of machine learning and artificial intelligence. This paper comprises techniques for training a GAN to synthesise human faces and images of flowers from text descriptions. In this paper, we are proposing to train the GAN progressively as proposed in the ProGAN architecture and along with that trying to improve its results by proposing a custom update rule for alpha which controls the fading rate during the progressive growth of the architecture. With experimental testing using the Oxford102 and LFW datasets, our proposed architecture and training process ensures fast learning and smooth transitions between each trained generation.
引用
收藏
页码:673 / 684
页数:12
相关论文
共 50 条
  • [21] Text and image generation from intracranial electroencephalography using an embedding space for text and images
    Ikegawa, Yuya
    Fukuma, Ryohei
    Sugano, Hidenori
    Oshino, Satoru
    Tani, Naoki
    Tamura, Kentaro
    Iimura, Yasushi
    Suzuki, Hiroharu
    Yamamoto, Shota
    Fujita, Yuya
    Nishimoto, Shinji
    Kishima, Haruhiko
    Yanagisawa, Takufumi
    JOURNAL OF NEURAL ENGINEERING, 2024, 21 (03)
  • [22] The impact of synthetic text generation for sentiment analysis using GAN based models
    Imran, Ali Shariq
    Yang, Ru
    Kastrati, Zenun
    Daudpota, Sher Muhammad
    Shaikh, Sarang
    EGYPTIAN INFORMATICS JOURNAL, 2022, 23 (03) : 547 - 557
  • [23] Evaluating Text-to-Visual Generation with Image-to-Text Generation
    Lin, Zhiqiu
    Athaki, Deepak
    Li, Baiqi
    Li, Jiayao
    Xia, Xide
    Neubig, Graham
    Zhang, Pengchuan
    Ramanan, Deva
    COMPUTER VISION - ECCV 2024, PT IX, 2025, 15067 : 366 - 384
  • [24] Expressive Text-to-Image Generation with Rich Text
    Ge, Songwei
    Park, Taesung
    Zhu, Jun-Yan
    Huang, Jia-Bin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7511 - 7522
  • [25] CONTEXT-GAN: CONTROLLABLE CONTEXT IMAGE GENERATION USING GANS
    Hostin, Marc-Adrien
    Sivtsov, Vladimir
    Attarian, Shahram
    Bendahan, David
    Bellemare, Marc-Emmanuel
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [26] DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation
    Huang, Mengqi
    Mao, Zhendong
    Wang, Penghui
    Wang, Quan
    Zhang, Yongdong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4345 - 4354
  • [27] DAC-GAN: Dual Auxiliary Consistency Generative Adversarial Network for Text-to-Image Generation
    Wang, Zhiwei
    Yang, Jing
    Cui, Jiajun
    Liu, Jiawei
    Wang, Jiahao
    COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 3 - 19
  • [28] Text-guided floral image generation based on lightweight deep attention feature fusion GAN
    Yang, Wenji
    An, Hang
    Hu, Wenchao
    Ma, Xinxin
    Xie, Liping
    VISUAL COMPUTER, 2024, : 3519 - 3535
  • [29] Image Generation from Text and Segmentation
    Osugi, Masato
    Vargas, Danilo Vasconcellos
    2022 TENTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING WORKSHOPS, CANDARW, 2022, : 206 - 211
  • [30] Controllable Text-to-Image Generation
    Li, Bowen
    Qi, Xiaojuan
    Lukasiewicz, Thomas
    Torr, Philip H. S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32