Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

被引：0

作者：

Nam, Seonghyeon ^{[1
]}

Kim, Yunji ^{[1
]}

Kim, Seon Joo ^{[1
]}

机构：

[1] Yonsei Univ, Seoul, South Korea

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018) | 2018年 / 31卷

基金：

新加坡国家研究基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the problem of manipulating images using natural language description. Our task aims to semantically modify visual attributes of an object in an image according to the text describing the new visual appearance. Although existing methods synthesize images having new attributes, they do not fully preserve text-irrelevant contents of the original image. In this paper, we propose the text-adaptive generative adversarial network (TAGAN) to generate semantically manipulated images while preserving text-irrelevant contents. The key to our method is the text-adaptive discriminator that creates word-level local discriminators according to input text to classify fine-grained attributes independently. With this discriminator, the generator learns to generate images where only regions that correspond to the given text are modified. Experimental results show that our method outperforms existing methods on CUB and Oxford-102 datasets, and our results were mostly preferred on a user study. Extensive analysis shows that our method is able to effectively disentangle visual attributes and produce pleasing outputs.

引用

页数：10

共 50 条

[31] DRAWGAN: TEXT TO IMAGE SYNTHESIS WITH DRAWING GENERATIVE ADVERSARIAL NETWORKS
Zhang, Zhiqiang
Zhou, Jinjia
Yu, Wenxin
Jiang, Ning
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4195 - 4199
[32] Training Generative Adversarial Networks with Adaptive Composite Gradient
Huiqing Qi
Fang Li
Shengli Tan
Xiangyun Zhang
Data Intelligence, 2024, 6 (01) : 120 - 157
[33] Can Generative Adversarial Networks Teach Themselves Text Segmentation?
Al-Rawi, Mohammed
Bazazian, Dena
Valveny, Ernest
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3342 - 3350
[34] Learning Representations of Natural Language Texts with Generative Adversarial Networks at Document, Sentence, and Aspect Level
Vlachostergiou, Aggeliki
Caridakis, George
Mylonas, Phivos
Stafylopatis, Andreas
ALGORITHMS, 2018, 11 (10)
[35] Generate Desired Images from Trained Generative Adversarial Networks
Li, Ming
Xi, Rui
Chen, Beier
Hou, Mengshu
Liu, Daibo
Guo, Lei
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[36] Resizing and cleaning of histopathological images using generative adversarial networks
Celik, Gaffari
Talu, Muhammed Fatih
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2020, 554
[37] Classification of Hyperspectral Images via Multitask Generative Adversarial Networks
Hang, Renlong
Zhou, Feng
Liu, Qingshan
Ghamisi, Pedram
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (02): : 1424 - 1436
[38] On Generating Synthetic Histopathology Images Using Generative Adversarial Networks
Carmody, Sean
John, Deepu
2023 34TH IRISH SIGNALS AND SYSTEMS CONFERENCE, ISSC, 2023,
[39] Standardised images of novel objects created with generative adversarial networks
Patrick S. Cooper
Emily Colton
Stefan Bode
Trevor T.-J. Chong
Scientific Data, 10 (1)
[40] Survey of Quantum Generative Adversarial Networks (QGAN) to Generate Images
Pajuhanfard, Mohammadsaleh
Kiani, Rasoul
Sheng, Victor S.
MATHEMATICS, 2024, 12 (23)

← 1 2 3 4 5 →