A survey on generative adversarial network-based text-to-image synthesis

被引:26
|
作者
Zhou, Rui [1 ]
Jiang, Cong [1 ]
Xu, Qingyang [1 ]
机构
[1] Shandong Univ, Sch Mech Elect & Informat Engn, Weihai 264209, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Generative adversarial network (GAN); Text-to-image synthesis; Scene layout;
D O I
10.1016/j.neucom.2021.04.069
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of text-to-image synthesis is a new challenge in the field of image synthesis. In the earlier research, the task of text-to-image synthesis is mainly to achieve the alignment of words and images by the way of retrieval based on the sentences or keywords. With the development of deep learning, especially the application of deep generative models in image synthesis, image synthesis achieves promising progress. The Generative adversarial networks (GANs) are one of the most significant generative models, and GANs have been successfully applied in computer vision, natural language processing and so on. In this paper, we review and summarize the recent research in GANs-based text-to-image synthesis, and provide a summary of the development of classic and advanced models. The input of the GANs-based text-to image synthesis is not only the general text description as earlier studies, also includes scene layout and dialog text. The typical structure of each categories is elaborated. The general text-based image synthesis is the most commonly in the text-to-image synthesis, and it is subdivided into three groups based on the improvements of text information utilization, network structure and output control conditions. Through the survey, the detailed and logical overview of the evolution of GANs-based text-to-image synthesis is presented. Finally, the challenged problems and the future development of text-to-image synthesis are discussed. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:316 / 336
页数:21
相关论文
共 50 条
  • [1] Survey About Generative Adversarial Network and Text-to-Image Synthesis
    Lai, Lina
    Mi, Yu
    Zhou, Longlong
    Rao, Jiyong
    Xu, Tianyang
    Song, Xiaoning
    [J]. Computer Engineering and Applications, 2023, 59 (19): : 21 - 39
  • [2] A survey of generative adversarial networks and their application in text-to-image synthesis
    Zeng, Wu
    Zhu, Heng-liang
    Lin, Chuan
    Xiao, Zheng-ying
    [J]. ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (12): : 7142 - 7181
  • [3] Text-to-image synthesis based on modified deep convolutional generative adversarial network
    Li, Yunhong
    Zhu, Mianyun
    Ren, Jie
    Su, Xueping
    Zhou, Xiaoji
    Yu, Huikang
    [J]. Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2023, 49 (08): : 1875 - 1883
  • [4] Knowledge-Driven Generative Adversarial Network for Text-to-Image Synthesis
    Peng, Jun
    Zhou, Yiyi
    Sun, Xiaoshuai
    Cao, Liujuan
    Wu, Yongjian
    Huang, Feiyue
    Ji, Rongrong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 (4356-4366) : 4356 - 4366
  • [5] Generative adversarial network based on semantic consistency for text-to-image generation
    Yue Ma
    Li Liu
    Huaxiang Zhang
    Chunjing Wang
    Zekang Wang
    [J]. Applied Intelligence, 2023, 53 : 4703 - 4716
  • [6] Generative adversarial network based on semantic consistency for text-to-image generation
    Ma, Yue
    Liu, Li
    Zhang, Huaxiang
    Wang, Chunjing
    Wang, Zekang
    [J]. APPLIED INTELLIGENCE, 2023, 53 (04) : 4703 - 4716
  • [7] GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
    Tao, Ming
    Bao, Bing-Kun
    Tang, Hao
    Xu, Changsheng
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14214 - 14223
  • [8] Text-to-image generation method based on single stage generative adversarial network
    Yang, Bing
    Na, Wei
    Xiang, Xue-Qin
    [J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (12): : 2412 - 2420
  • [9] TextControlGAN: Text-to-Image Synthesis with Controllable Generative Adversarial Networks
    Ku, Hyeeun
    Lee, Minhyeok
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [10] Enhanced Text-to-Image Synthesis Conditional Generative Adversarial Networks
    Tan, Yong Xuan
    Lee, Chin Poo
    Neo, Mai
    Lim, Kian Ming
    Lim, Jit Yan
    [J]. IAENG International Journal of Computer Science, 2022, 49 (01) : 1 - 7