Text-Guided Image Manipulation via Generative Adversarial Network With Referring Image Segmentation-Based Guidance

被引:1
|
作者
Watanabe, Yuto [1 ]
Togo, Ren [2 ]
Maeda, Keisuke [2 ]
Ogawa, Takahiro [2 ]
Haseyama, Miki [2 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo 0600814, Japan
[2] Hokkaido Univ, Fac Informat Sci & Technol, Sapporo 0600814, Japan
基金
日本学术振兴会;
关键词
Image segmentation; Text recognition; Generative adversarial networks; Image color analysis; Visualization; Image reconstruction; Text processing; Text-guided image manipulation; text-to-image synthesis; generative adversarial network; referring image segmentation;
D O I
10.1109/ACCESS.2023.3269847
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study proposes a novel text-guided image manipulation method that introduces referring image segmentation into a generative adversarial network. The proposed text-guided image manipulation method aims to manipulate images containing multiple objects while preserving text-unrelated regions. The proposed method assigns the task of distinguishing between text-related and unrelated regions in an image to segmentation guidance based on referring image segmentation. With this architecture, the adversarial generative network can focus on generating new attributes according to the text description and reconstructing text-unrelated regions. For the challenging input images with multiple objects, the experimental results demonstrate that the proposed method outperforms conventional methods in terms of image manipulation precision.
引用
收藏
页码:42534 / 42545
页数:12
相关论文
共 50 条
  • [1] GENERATIVE ADVERSARIAL NETWORK INCLUDING REFERRING IMAGE SEGMENTATION FOR TEXT-GUIDED IMAGE MANIPULATION
    Watanabe, Yuto
    Togo, Ren
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4818 - 4822
  • [2] Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation
    Li, Bowen
    Qi, Xiaojuan
    Torr, Philip H. S.
    Lukasiewicz, Thomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [3] SEGMENTATION-AWARE TEXT-GUIDED IMAGE MANIPULATION
    Haruyama, Tomoki
    Togo, Ren
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2433 - 2437
  • [4] Text-Guided Human Image Manipulation via Image-Text Shared Space
    Xu, Xiaogang
    Chen, Ying-Cong
    Tao, Xin
    Jia, Jiaya
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6486 - 6500
  • [5] Referring Image Segmentation by Generative Adversarial Learning
    Qiu, Shuang
    Zhao, Yao
    Jiao, Jianbo
    Wei, Yunchao
    Wei, Shikui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (05) : 1333 - 1344
  • [6] Dilated Residual Aggregation Network for Text-Guided Image Manipulation
    Lu, Siwei
    Luo, Di
    Yang, Zhenguo
    Hao, Tianyong
    Li, Qing
    Liu, Wenyin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT III, 2021, 12893 : 28 - 40
  • [7] Text-Guided Customizable Image Synthesis and Manipulation
    Zhang, Zhiqiang
    Fu, Chen
    Weng, Wei
    Zhou, Jinjia
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [8] Text-Guided Style Transfer-Based Image Manipulation Using Multimodal Generative Models
    Togo, Ren
    Kotera, Megumi
    Ogawa, Takahiro
    Haseyama, Miki
    IEEE ACCESS, 2021, 9 : 64860 - 64870
  • [9] Manipulation Direction: Evaluating Text-Guided Image Manipulation Based on Similarity between Changes in Image and Text Modalities
    Watanabe, Yuto
    Togo, Ren
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    SENSORS, 2023, 23 (22)
  • [10] Diffusion model-based text-guided enhancement network for medical image segmentation
    Dong, Zhiwei
    Yuan, Genji
    Hua, Zhen
    Li, Jinjiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249