Generative adversarial network based on semantic consistency for text-to-image generation

被引:0
|
作者
Yue Ma
Li Liu
Huaxiang Zhang
Chunjing Wang
Zekang Wang
机构
[1] Shandong Normal University,School of Information Science and Engineering
来源
Applied Intelligence | 2023年 / 53卷
关键词
Text-to-image generation; Generative adversarial network; Hybrid attention mechanism; Semantic consistency;
D O I
暂无
中图分类号
学科分类号
摘要
Although text-to-image generation technology has made significant progress in visually realistic images, the generated images cannot be completely consistent with the texts. In this paper, a novel generative adversarial network based on semantic consistency is proposed to generate semantically consistent and realistic images according to text descriptions. The proposed method explores the semantic consistency between text and image for an efficient cross-modal generation that combines image generation and semantic correlation. A generation network with a hybrid attention is utilized to generate different resolution images, which improves the authenticity of the generated images. In addition, a semantic comparison module is presented to map the texts and the generated images to the same semantic space for comparison through consistency refinement and information classification. Extensive experiments on public benchmark datasets demonstrate that the proposed method outperforms the comparative methods.
引用
收藏
页码:4703 / 4716
页数:13
相关论文
共 50 条
  • [1] Generative adversarial network based on semantic consistency for text-to-image generation
    Ma, Yue
    Liu, Li
    Zhang, Huaxiang
    Wang, Chunjing
    Wang, Zekang
    [J]. APPLIED INTELLIGENCE, 2023, 53 (04) : 4703 - 4716
  • [2] Multi-Semantic Fusion Generative Adversarial Network for Text-to-Image Generation
    Huang, Pingda
    Liu, Yedan
    Fu, Chunjiang
    Zhao, Liang
    [J]. 2023 IEEE 8TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS, ICBDA, 2023, : 159 - 164
  • [3] DAC-GAN: Dual Auxiliary Consistency Generative Adversarial Network for Text-to-Image Generation
    Wang, Zhiwei
    Yang, Jing
    Cui, Jiajun
    Liu, Jiawei
    Wang, Jiahao
    [J]. COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 3 - 19
  • [4] Text-to-image generation method based on single stage generative adversarial network
    Yang, Bing
    Na, Wei
    Xiang, Xue-Qin
    [J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (12): : 2412 - 2420
  • [5] DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation
    Huang, Mengqi
    Mao, Zhendong
    Wang, Penghui
    Wang, Quan
    Zhang, Yongdong
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4345 - 4354
  • [6] STRUCTURE-AWARE GENERATIVE ADVERSARIAL NETWORK FOR TEXT-TO-IMAGE GENERATION
    Chen, Wenjie
    Ni, Zhangkai
    Wang, Hanli
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2075 - 2079
  • [7] Text-to-Image Generation Method Based on Image-Text Semantic Consistency
    Xue, Zhihang
    Xu, Zheming
    Lang, Congyan
    Feng, Songhe
    Wang, Tao
    Li, Yidong
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (09): : 2180 - 2190
  • [8] PMGAN: pretrained model-based generative adversarial network for text-to-image generation
    Yu, Yue
    Yang, Yue
    Xing, Jingshuo
    [J]. VISUAL COMPUTER, 2024,
  • [9] Generative adversarial text-to-image generation with style image constraint
    Zekang Wang
    Li Liu
    Huaxiang Zhang
    Dongmei Liu
    Yu Song
    [J]. Multimedia Systems, 2023, 29 : 3291 - 3303
  • [10] Generative adversarial text-to-image generation with style image constraint
    Wang, Zekang
    Liu, Li
    Zhang, Huaxiang
    Liu, Dongmei
    Song, Yu
    [J]. MULTIMEDIA SYSTEMS, 2023, 29 (06) : 3291 - 3303