Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

被引:0
|
作者
Nam, Seonghyeon [1 ]
Kim, Yunji [1 ]
Kim, Seon Joo [1 ]
机构
[1] Yonsei Univ, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of manipulating images using natural language description. Our task aims to semantically modify visual attributes of an object in an image according to the text describing the new visual appearance. Although existing methods synthesize images having new attributes, they do not fully preserve text-irrelevant contents of the original image. In this paper, we propose the text-adaptive generative adversarial network (TAGAN) to generate semantically manipulated images while preserving text-irrelevant contents. The key to our method is the text-adaptive discriminator that creates word-level local discriminators according to input text to classify fine-grained attributes independently. With this discriminator, the generator learns to generate images where only regions that correspond to the given text are modified. Experimental results show that our method outperforms existing methods on CUB and Oxford-102 datasets, and our results were mostly preferred on a user study. Extensive analysis shows that our method is able to effectively disentangle visual attributes and produce pleasing outputs.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] A Natural Scene Text Extraction Approach Based on Generative Adversarial Learning
    Xu, Huali
    Su, Xiangdong
    Liu, Tongyang
    Guo, Pengcheng
    Gao, Guanglai
    Bao, Feilong
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 65 - 73
  • [42] Survey: application and analysis of generative adversarial networks in medical images
    Heng, Yang
    Ma, Yinghua
    Khan, Fiaz Gul
    Khan, Ahmad
    Ali, Farman
    Alzubi, Ahmad Ali
    Hui, Zeng
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 58 (02)
  • [43] Fusion of Images of Different Spectra Based on Generative Adversarial Networks
    Yu. V. Vizil’ter
    O. V. Vygolov
    D. V. Komarov
    M. A. Lebedev
    Journal of Computer and Systems Sciences International, 2019, 58 : 441 - 453
  • [44] Fusion of Images of Different Spectra Based on Generative Adversarial Networks
    Vizil'ter, Yu. V.
    Vygolov, O. V.
    Komarov, D. V.
    Lebedev, M. A.
    JOURNAL OF COMPUTER AND SYSTEMS SCIENCES INTERNATIONAL, 2019, 58 (03) : 441 - 453
  • [45] Segmentation of Cervical Cell Images Based on Generative Adversarial Networks
    Huang, Jinjie
    Yang, Guihua
    Li, Biao
    He, Yongjun
    Liang, Yani
    IEEE ACCESS, 2021, 9 : 115415 - 115428
  • [46] Standardised images of novel objects created with generative adversarial networks
    Cooper, Patrick S.
    Colton, Emily
    Bode, Stefan
    Chong, Trevor T. -J.
    SCIENTIFIC DATA, 2023, 10 (01)
  • [47] Road Extraction with UAV Images Based on Generative Adversarial Networks
    He L.
    Li Y.-X.
    Peng B.
    Wu H.-P.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2019, 48 (04): : 580 - 585
  • [48] SYNTHESIS OF IMAGES BY TWO-STAGE GENERATIVE ADVERSARIAL NETWORKS
    Huang, Qiang
    Jackson, Philip J. B.
    Plumbley, Mark D.
    Wang, Wenwu
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 1593 - 1597
  • [49] Improved generative adversarial networks for denoising fabric defect images
    Hu, Xudong
    Wang, Tao
    Yu, Bo
    Dai, Ning
    Shen, Chunya
    Yuan, Yanhong
    TEXTILE RESEARCH JOURNAL, 2025,
  • [50] Visualizing Near Infrared Hyperspectral Images with Generative Adversarial Networks
    Tang, Rongxin
    Liu, Hualin
    Wei, Jingbo
    REMOTE SENSING, 2020, 12 (23) : 1 - 19