Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

被引:0
|
作者
Nam, Seonghyeon [1 ]
Kim, Yunji [1 ]
Kim, Seon Joo [1 ]
机构
[1] Yonsei Univ, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of manipulating images using natural language description. Our task aims to semantically modify visual attributes of an object in an image according to the text describing the new visual appearance. Although existing methods synthesize images having new attributes, they do not fully preserve text-irrelevant contents of the original image. In this paper, we propose the text-adaptive generative adversarial network (TAGAN) to generate semantically manipulated images while preserving text-irrelevant contents. The key to our method is the text-adaptive discriminator that creates word-level local discriminators according to input text to classify fine-grained attributes independently. With this discriminator, the generator learns to generate images where only regions that correspond to the given text are modified. Experimental results show that our method outperforms existing methods on CUB and Oxford-102 datasets, and our results were mostly preferred on a user study. Extensive analysis shows that our method is able to effectively disentangle visual attributes and produce pleasing outputs.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] DRAWGAN: TEXT TO IMAGE SYNTHESIS WITH DRAWING GENERATIVE ADVERSARIAL NETWORKS
    Zhang, Zhiqiang
    Zhou, Jinjia
    Yu, Wenxin
    Jiang, Ning
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4195 - 4199
  • [32] Training Generative Adversarial Networks with Adaptive Composite Gradient
    Huiqing Qi
    Fang Li
    Shengli Tan
    Xiangyun Zhang
    Data Intelligence, 2024, 6 (01) : 120 - 157
  • [33] Can Generative Adversarial Networks Teach Themselves Text Segmentation?
    Al-Rawi, Mohammed
    Bazazian, Dena
    Valveny, Ernest
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3342 - 3350
  • [34] Learning Representations of Natural Language Texts with Generative Adversarial Networks at Document, Sentence, and Aspect Level
    Vlachostergiou, Aggeliki
    Caridakis, George
    Mylonas, Phivos
    Stafylopatis, Andreas
    ALGORITHMS, 2018, 11 (10)
  • [35] Generate Desired Images from Trained Generative Adversarial Networks
    Li, Ming
    Xi, Rui
    Chen, Beier
    Hou, Mengshu
    Liu, Daibo
    Guo, Lei
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [36] Resizing and cleaning of histopathological images using generative adversarial networks
    Celik, Gaffari
    Talu, Muhammed Fatih
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2020, 554
  • [37] Classification of Hyperspectral Images via Multitask Generative Adversarial Networks
    Hang, Renlong
    Zhou, Feng
    Liu, Qingshan
    Ghamisi, Pedram
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (02): : 1424 - 1436
  • [38] On Generating Synthetic Histopathology Images Using Generative Adversarial Networks
    Carmody, Sean
    John, Deepu
    2023 34TH IRISH SIGNALS AND SYSTEMS CONFERENCE, ISSC, 2023,
  • [39] Standardised images of novel objects created with generative adversarial networks
    Patrick S. Cooper
    Emily Colton
    Stefan Bode
    Trevor T.-J. Chong
    Scientific Data, 10 (1)
  • [40] Survey of Quantum Generative Adversarial Networks (QGAN) to Generate Images
    Pajuhanfard, Mohammadsaleh
    Kiani, Rasoul
    Sheng, Victor S.
    MATHEMATICS, 2024, 12 (23)