Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

被引:0
|
作者
Nam, Seonghyeon [1 ]
Kim, Yunji [1 ]
Kim, Seon Joo [1 ]
机构
[1] Yonsei Univ, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of manipulating images using natural language description. Our task aims to semantically modify visual attributes of an object in an image according to the text describing the new visual appearance. Although existing methods synthesize images having new attributes, they do not fully preserve text-irrelevant contents of the original image. In this paper, we propose the text-adaptive generative adversarial network (TAGAN) to generate semantically manipulated images while preserving text-irrelevant contents. The key to our method is the text-adaptive discriminator that creates word-level local discriminators according to input text to classify fine-grained attributes independently. With this discriminator, the generator learns to generate images where only regions that correspond to the given text are modified. Experimental results show that our method outperforms existing methods on CUB and Oxford-102 datasets, and our results were mostly preferred on a user study. Extensive analysis shows that our method is able to effectively disentangle visual attributes and produce pleasing outputs.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Text-to-Text Generative Adversarial Networks
    Li, Changliang
    Su, Yixin
    Liu, Wenju
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [2] Learning to Draw Text in Natural Images with Conditional Adversarial Networks
    Fang, Shancheng
    Xie, Hongtao
    Chen, Jianjun
    Tan, Jianlong
    Zhang, Yongdong
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 715 - 722
  • [3] Text Conditioned Generative Adversarial Networks Generating Images and Videos: A Critical Review
    Rayeesa Mehmood
    Rumaan Bashir
    Kaiser J. Giri
    SN Computer Science, 5 (7)
  • [4] Generative adversarial networks for reconstructing natural images from brain activity
    Seeliger, K.
    Guclu, U.
    Ambrogioni, L.
    Gucluturk, Y.
    van Gerven, M. A. J.
    NEUROIMAGE, 2018, 181 : 775 - 785
  • [5] Sign Language Video Generation from Text Using Generative Adversarial Networks
    Sreemathy, R.
    Chordiya, Param
    Khurana, Soumya
    Turuk, Mousami
    OPTICAL MEMORY AND NEURAL NETWORKS, 2024, 33 (04) : 466 - 476
  • [6] Generative Adversarial Networks with Adaptive Semantic Normalization for text-to-image synthesis
    Huang, Siyue
    Chen, Ying
    DIGITAL SIGNAL PROCESSING, 2022, 120
  • [7] Beautification of images by generative adversarial networks
    Music, Amar
    Maerten, Anne-Sofie
    Wagemans, Johan
    JOURNAL OF VISION, 2023, 23 (10): : 14
  • [8] Survey on Latest Advances in Natural Language Processing Applications of Generative Adversarial Networks
    Koc, Canan
    Ozyurt, Fatih
    Iantovics, Lazsla Barna
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2025, 15 (01)
  • [9] Generative Adversarial Networks Using Adaptive Convolution
    Nguyen, Nhat M.
    Ray, Nilanjan
    2019 16TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2019), 2019, : 129 - 134
  • [10] A Research on Generative Adversarial Networks Applied to Text Generation
    Zhang, Chao
    Xiong, Caiquan
    Wang, Lingyun
    14TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND EDUCATION (ICCSE 2019), 2019, : 913 - 917