High-Resolution Realistic Image Synthesis from Text Using Iterative Generative Adversarial Network

被引:0
|
作者
Ullah, Anwar [1 ]
Yu, Xinguo [1 ]
Majid, Abdul [1 ]
Rahman, Hafiz Ur [2 ]
Mughal, M. Farhan [3 ]
机构
[1] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan, Peoples R China
[2] Guangzhou Univ, Sch Comp Sci, Guangzhou 510006, Peoples R China
[3] Tianjin Univ Finance & Econ, Tianjin, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Generative Adversarial Network (GAN); Iterative GAN; Text-to-image synthesis; CUB dataset; Oxford-102; dataset; Inception score; Human rank;
D O I
10.1007/978-3-030-34879-3_17
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Synthesizing high-resolution realistic images from text description using one iteration Generative Adversarial Network (GAN) is difficult without using any additional techniques because mostly the blurry artifacts and mode collapse problems are occurring. To reduce these problems, this paper proposes an Iterative Generative Adversarial Network (iGAN) which takes three iterations to synthesize highresolution realistic image from their text description. In the 1st iteration, GAN synthesizes a low-resolution 64 x 64 pixels basic shape and basic color image from the text description with less mode collapse and blurry artifacts problems. In the 2nd iteration, GAN takes the result of the 1st iteration and text description again and synthesizes a better resolution 128 x 128 pixels better shape and well color image with very less mode collapse and blurry artifacts problems. In the last iteration, GAN takes the result of the 2nd iteration and text description as well and synthesizes a high-resolution 256x256 well shape and clear image with almost no mode collapse and blurry artifacts problems. Our proposed iGAN shows a significant performance on CUB birds and Oxford-102 flowers datasets. Moreover, iGAN improves the inception score and human rank as compare to the other state-of-the-art methods.
引用
收藏
页码:211 / 224
页数:14
相关论文
共 50 条
  • [1] High-resolution concrete damage image synthesis using conditional generative adversarial network
    Li, Shengyuan
    Zhao, Xuefeng
    [J]. AUTOMATION IN CONSTRUCTION, 2023, 147
  • [2] Hierarchically-fused Generative Adversarial Network for text to realistic image synthesis
    Huang, Xin
    Wang, Mingjie
    Gong, Minglun
    [J]. 2019 16TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2019), 2019, : 73 - 80
  • [3] Realistic High-Resolution Body Computed Tomography Image Synthesis by Using Progressive Growing Generative Adversarial Network: Visual Turing Test
    Park, Ho Young
    Bae, Hyun-Jin
    Hong, Gil-Sun
    Kim, Minjee
    Yun, JiHye
    Park, Sungwon
    Chung, Won Jung
    Kim, NamKug
    [J]. JMIR MEDICAL INFORMATICS, 2021, 9 (03)
  • [4] Synthesizing realistic high-resolution retina image by style-based generative adversarial network and its utilization
    Kim, Mingyu
    Kim, You Na
    Jang, Miso
    Hwang, Jeongeun
    Kim, Hong-Kyu
    Yoon, Sang Chul
    Kim, Yoon Jeon
    Kim, Namkug
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [5] Synthesizing realistic high-resolution retina image by style-based generative adversarial network and its utilization
    Mingyu Kim
    You Na Kim
    Miso Jang
    Jeongeun Hwang
    Hong-Kyu Kim
    Sang Chul Yoon
    Yoon Jeon Kim
    Namkug Kim
    [J]. Scientific Reports, 12
  • [6] TEXT TO IMAGE SYNTHESIS WITH BIDIRECTIONAL GENERATIVE ADVERSARIAL NETWORK
    Wang, Zixu
    Quan, Zhe
    Wang, Zhi-Jie
    Hu, Xinjian
    Chen, Yangyang
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [7] High-resolution dermoscopy image synthesis with conditional generative adversarial networks
    Ding, Saisai
    Zheng, Jian
    Liu, Zhaobang
    Zheng, Yanyan
    Chen, Yanmei
    Xu, Xiaomin
    Lu, Jia
    Xie, Jing
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 64
  • [8] High-resolution knee plain radiography image synthesis using style generative adversarial network adaptive discriminator augmentation
    Gun Ahn
    Choi, Byung S.
    Ko, Sunho
    Jo, Changwung
    Han, Hyuk-Soo
    Lee, Myung Chul
    Du Hyun Ro
    [J]. JOURNAL OF ORTHOPAEDIC RESEARCH, 2023, 41 (01) : 84 - 93
  • [9] Realistic Sonar Image Simulation Using Generative Adversarial Network
    Sung, Minsung
    Kim, Jason
    Kim, Juhwan
    Yu, Son-Cheol
    [J]. IFAC PAPERSONLINE, 2019, 52 (21): : 291 - 296
  • [10] Text to photo-realistic image synthesis via chained deep recurrent generative adversarial network
    Wang, Min
    Lang, Congyan
    Feng, Songhe
    Wang, Tao
    Jin, Yi
    Li, Yidong
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74