Text-Guided Image Manipulation via Generative Adversarial Network With Referring Image Segmentation-Based Guidance

被引：1

作者：

Watanabe, Yuto ^{[1
]}

Togo, Ren ^{[2
]}

Maeda, Keisuke ^{[2
]}

Ogawa, Takahiro ^{[2
]}

Haseyama, Miki ^{[2
]}

机构：

[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo 0600814, Japan

[2] Hokkaido Univ, Fac Informat Sci & Technol, Sapporo 0600814, Japan

来源：

IEEE ACCESS | 2023年 / 11卷

基金：

日本学术振兴会;

关键词：

Image segmentation; Text recognition; Generative adversarial networks; Image color analysis; Visualization; Image reconstruction; Text processing; Text-guided image manipulation; text-to-image synthesis; generative adversarial network; referring image segmentation;

D O I：

10.1109/ACCESS.2023.3269847

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This study proposes a novel text-guided image manipulation method that introduces referring image segmentation into a generative adversarial network. The proposed text-guided image manipulation method aims to manipulate images containing multiple objects while preserving text-unrelated regions. The proposed method assigns the task of distinguishing between text-related and unrelated regions in an image to segmentation guidance based on referring image segmentation. With this architecture, the adversarial generative network can focus on generating new attributes according to the text description and reconstructing text-unrelated regions. For the challenging input images with multiple objects, the experimental results demonstrate that the proposed method outperforms conventional methods in terms of image manipulation precision.

引用

页码：42534 / 42545

页数：12

共 50 条

[41] Edge-Guided Generative Adversarial Network for Image Inpainting
Xu, Shunxin
Liu, Dong
Xiong, Zhiwei
2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
[42] Interactions Guided Generative Adversarial Network for unsupervised image captioning
Cao, Shan
An, Gaoyun
Zheng, Zhenxing
Ruan, Qiuqi
NEUROCOMPUTING, 2020, 417 : 419 - 431
[43] A CONTEXT-BASED NETWORK FOR REFERRING IMAGE SEGMENTATION
Li, Xinyu
Liu, Yu
Xu, Kaiping
Zhao, Zhehuan
Liu, Sipei
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1436 - 1440
[44] Text-to-image synthesis based on modified deep convolutional generative adversarial network
Li Y.
Zhu M.
Ren J.
Su X.
Zhou X.
Yu H.
Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2023, 49 (08): : 1875 - 1883
[45] Text-to-image generation method based on single stage generative adversarial network
Yang B.
Na W.
Xiang X.-Q.
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (12): : 2412 - 2420
[46] Compound Text-Guided Prompt Tuning via Image-Adaptive Cues
Tan, Hao
Li, Jun
Zhou, Yizhuang
Wan, Jun
Lei, Zhen
Zhang, Xiangyu
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 5061 - 5069
[47] Fundus Image Segmentation Based on Improved Generative Adversarial Network for Retinal Vessel Analysis
He, Jin
Jiang, Dan
2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), 2020, : 231 - 236
[48] Survey About Generative Adversarial Network and Text-to-Image Synthesis
Lai, Lina
Mi, Yu
Zhou, Longlong
Rao, Jiyong
Xu, Tianyang
Song, Xiaoning
Computer Engineering and Applications, 2023, 59 (19): : 21 - 39
[49] Semantic Map Based Image Compression via Conditional Generative Adversarial Network
Wei, Zhensong
Liao, Zeyi
Bai, Huihui
Zhao, Yao
IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 13 - 22
[50] Infrared and visible image fusion based on guided hybrid model and generative adversarial network
Tang, LiLi
Liu, Gang
Xiao, Gang
Bavirisetti, Durga Prasad
Zhang, XiangBo
INFRARED PHYSICS & TECHNOLOGY, 2022, 120

← 1 2 3 4 5 →