Text to Image Generation Using Gan

被引：0

作者：

Jindal, Rajni ^{[1
]}

Sriram, V. ^{[1
]}

Aggarwal, Vishesh ^{[1
]}

Jain, Vishesh ^{[1
]}

机构：

[1] Delhi Technol Univ, New Delhi, India

来源：

PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2022 | 2023年 / 475卷

关键词：

Generative adversarial networks; Text to image generation; Progressive GAN; stackGAN; Image generation; Nearest neighbour interpolation; Generator; Discriminator; Wasserstein loss; Equalised learning rate; Mini-batch standard deviation;

D O I：

10.1007/978-981-19-2840-6_51

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Text to image synthesis, one of the most fascinating applications of GANs, is one of the hottest topics in all of machine learning and artificial intelligence. This paper comprises techniques for training a GAN to synthesise human faces and images of flowers from text descriptions. In this paper, we are proposing to train the GAN progressively as proposed in the ProGAN architecture and along with that trying to improve its results by proposing a custom update rule for alpha which controls the fading rate during the progressive growth of the architecture. With experimental testing using the Oxford102 and LFW datasets, our proposed architecture and training process ensures fast learning and smooth transitions between each trained generation.

引用

页码：673 / 684

页数：12

共 50 条

[21] Text and image generation from intracranial electroencephalography using an embedding space for text and images
Ikegawa, Yuya
Fukuma, Ryohei
Sugano, Hidenori
Oshino, Satoru
Tani, Naoki
Tamura, Kentaro
Iimura, Yasushi
Suzuki, Hiroharu
Yamamoto, Shota
Fujita, Yuya
Nishimoto, Shinji
Kishima, Haruhiko
Yanagisawa, Takufumi
JOURNAL OF NEURAL ENGINEERING, 2024, 21 (03)
[22] The impact of synthetic text generation for sentiment analysis using GAN based models
Imran, Ali Shariq
Yang, Ru
Kastrati, Zenun
Daudpota, Sher Muhammad
Shaikh, Sarang
EGYPTIAN INFORMATICS JOURNAL, 2022, 23 (03) : 547 - 557
[23] Evaluating Text-to-Visual Generation with Image-to-Text Generation
Lin, Zhiqiu
Athaki, Deepak
Li, Baiqi
Li, Jiayao
Xia, Xide
Neubig, Graham
Zhang, Pengchuan
Ramanan, Deva
COMPUTER VISION - ECCV 2024, PT IX, 2025, 15067 : 366 - 384
[24] Expressive Text-to-Image Generation with Rich Text
Ge, Songwei
Park, Taesung
Zhu, Jun-Yan
Huang, Jia-Bin
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7511 - 7522
[25] CONTEXT-GAN: CONTROLLABLE CONTEXT IMAGE GENERATION USING GANS
Hostin, Marc-Adrien
Sivtsov, Vladimir
Attarian, Shahram
Bendahan, David
Bellemare, Marc-Emmanuel
2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
[26] DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation
Huang, Mengqi
Mao, Zhendong
Wang, Penghui
Wang, Quan
Zhang, Yongdong
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4345 - 4354
[27] DAC-GAN: Dual Auxiliary Consistency Generative Adversarial Network for Text-to-Image Generation
Wang, Zhiwei
Yang, Jing
Cui, Jiajun
Liu, Jiawei
Wang, Jiahao
COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 3 - 19
[28] Text-guided floral image generation based on lightweight deep attention feature fusion GAN
Yang, Wenji
An, Hang
Hu, Wenchao
Ma, Xinxin
Xie, Liping
VISUAL COMPUTER, 2024, : 3519 - 3535
[29] Image Generation from Text and Segmentation
Osugi, Masato
Vargas, Danilo Vasconcellos
2022 TENTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING WORKSHOPS, CANDARW, 2022, : 206 - 211
[30] Controllable Text-to-Image Generation
Li, Bowen
Qi, Xiaojuan
Lukasiewicz, Thomas
Torr, Philip H. S.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32

← 1 2 3 4 5 →