Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

被引:0
|
作者
Nam, Seonghyeon [1 ]
Kim, Yunji [1 ]
Kim, Seon Joo [1 ]
机构
[1] Yonsei Univ, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of manipulating images using natural language description. Our task aims to semantically modify visual attributes of an object in an image according to the text describing the new visual appearance. Although existing methods synthesize images having new attributes, they do not fully preserve text-irrelevant contents of the original image. In this paper, we propose the text-adaptive generative adversarial network (TAGAN) to generate semantically manipulated images while preserving text-irrelevant contents. The key to our method is the text-adaptive discriminator that creates word-level local discriminators according to input text to classify fine-grained attributes independently. With this discriminator, the generator learns to generate images where only regions that correspond to the given text are modified. Experimental results show that our method outperforms existing methods on CUB and Oxford-102 datasets, and our results were mostly preferred on a user study. Extensive analysis shows that our method is able to effectively disentangle visual attributes and produce pleasing outputs.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Robust Semantic Transmission of Images with Generative Adversarial Networks
    He, Qi
    Yuan, Haohan
    Feng, Daquan
    Che, Bo
    Chen, Zhi
    Xia, Xiang-Gen
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3953 - 3958
  • [22] Attribute Manipulation Generative Adversarial Networks for Fashion Images
    Ak, Kenan E.
    Lim, Joo Hwee
    Tham, Jo Yew
    Kassim, Ashraf A.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10540 - 10549
  • [23] Generative adversarial networks for extrapolation of corrosion in automobile images
    Von Zuben, Andre
    Viana, Felipe A. C.
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [24] Anonymizing Personal Images Using Generative Adversarial Networks
    Piacentino, Esteban
    Angulo, Cecilio
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2020), 2020, 12108 : 395 - 405
  • [25] Improving Generative Adversarial Networks with Adaptive Control Learning
    Ma, Xiaohan
    Jin, Rize
    Sohn, Kyung-Ah
    Paik, JoonYoung
    Sun, Jing
    Chung, Tae-Sun
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [26] Modified-generative adversarial networks for imbalance text classification
    Poonam Rani
    Om Prakash Verma
    Multimedia Tools and Applications, 2025, 84 (14) : 13865 - 13884
  • [27] Adaptive Weighted Discriminator for Training Generative Adversarial Networks
    Zadorozhnyy, Vasily
    Cheng, Qiang
    Ye, Qiang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4779 - 4788
  • [28] Using Generative Adversarial Networks to Break and Protect Text Captchas
    Ye, Guixin
    Tang, Zhanyong
    Fang, Dingyi
    Zhu, Zhanxing
    Feng, Yansong
    Xu, Pengfei
    Chen, Xiaojiang
    Han, Jungong
    Wang, Zheng
    ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2020, 23 (02)
  • [29] Synthetic Dataset Generation for Text Recognition with Generative Adversarial Networks
    Efimova, Valeria
    Shalamov, Viacheslav
    Filchenkov, Andrey
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [30] Training Generative Adversarial Networks with Adaptive Composite Gradient
    Qi, Huiqing
    Li, Fang
    Tan, Shengli
    Zhang, Xiangyun
    DATA INTELLIGENCE, 2024, 6 (01) : 120 - 157