Text to Image Generation Using Gan

被引：0

作者：

Jindal, Rajni ^{[1
]}

Sriram, V. ^{[1
]}

Aggarwal, Vishesh ^{[1
]}

Jain, Vishesh ^{[1
]}

机构：

[1] Delhi Technol Univ, New Delhi, India

来源：

PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2022 | 2023年 / 475卷

关键词：

Generative adversarial networks; Text to image generation; Progressive GAN; stackGAN; Image generation; Nearest neighbour interpolation; Generator; Discriminator; Wasserstein loss; Equalised learning rate; Mini-batch standard deviation;

D O I：

10.1007/978-981-19-2840-6_51

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Text to image synthesis, one of the most fascinating applications of GANs, is one of the hottest topics in all of machine learning and artificial intelligence. This paper comprises techniques for training a GAN to synthesise human faces and images of flowers from text descriptions. In this paper, we are proposing to train the GAN progressively as proposed in the ProGAN architecture and along with that trying to improve its results by proposing a custom update rule for alpha which controls the fading rate during the progressive growth of the architecture. With experimental testing using the Oxford102 and LFW datasets, our proposed architecture and training process ensures fast learning and smooth transitions between each trained generation.

引用

页码：673 / 684

页数：12

共 50 条

[1] A Survey on Text Description to Image Generation Using GAN
Yeshasvi, Mogula
Kayal, P.
Subetha, T.
SOFT COMPUTING FOR SECURITY APPLICATIONS, ICSCS 2022, 2023, 1428 : 665 - 675
[2] Text to Image Generation with Conformer-GAN
Deng, Zhiyu
Yu, Wenxin
Che, Lu
Chen, Shiyu
Zhang, Zhiqiang
Shang, Jun
Chen, Peng
Gong, Jun
NEURAL INFORMATION PROCESSING, ICONIP 2023, PT V, 2024, 14451 : 3 - 14
[3] Text to Image Generation with Semantic-Spatial Aware GAN
Liao, Wentong
Hu, Kai
Yang, Michael Ying
Rosenhahn, Bodo
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18166 - 18175
[4] DR-GAN: Distribution Regularization for Text-to-Image Generation
Tan, Hongchen
Liu, Xiuping
Yin, Baocai
Li, Xin
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 10309 - 10323
[5] TEXT TO REALISTIC IMAGE GENERATION USING STACKGAN
Dhivya, K.
Navas, N. Sharfaras
2020 7TH IEEE INTERNATIONAL CONFERENCE ON SMART STRUCTURES AND SYSTEMS (ICSSS 2020), 2020, : 508 - 514
[6] Stacking VAE and GAN for Context-aware Text-to-Image Generation
Zhang, Chenrui
Peng, Yuxin
2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
[7] AraBERT and DF-GAN fusion for Arabic text-to-image generation
Bahani, Mourad
El Ouaazizi, Aziza
Maalmi, Khalil
ARRAY, 2022, 16
[8] AraBERT and DF-GAN fusion for Arabic text-to-image generation
Bahani, Mourad
El Ouaazizi, Aziza
Maalmi, Khalil
Array, 2022, 16
[9] ReFIGG: retinal fundus image generation using GAN
Nair, Sharika Sasidharan
Meharban, M. S.
INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2023, 26 (03) : 316 - 323
[10] Image Generation Using GAN and Its Classification Using SVM and CNN
Singh, Aadarsh
Bansal, Aashutosh
Chauhan, Nishant
Sahu, Satya Prakash
Dewangan, Deepak Kumar
PROCEEDINGS OF EMERGING TRENDS AND TECHNOLOGIES ON INTELLIGENT SYSTEMS (ETTIS 2021), 2022, 1371 : 89 - 100

← 1 2 3 4 5 →