Text-Guided Image Manipulation via Generative Adversarial Network With Referring Image Segmentation-Based Guidance

被引:1
|
作者
Watanabe, Yuto [1 ]
Togo, Ren [2 ]
Maeda, Keisuke [2 ]
Ogawa, Takahiro [2 ]
Haseyama, Miki [2 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo 0600814, Japan
[2] Hokkaido Univ, Fac Informat Sci & Technol, Sapporo 0600814, Japan
基金
日本学术振兴会;
关键词
Image segmentation; Text recognition; Generative adversarial networks; Image color analysis; Visualization; Image reconstruction; Text processing; Text-guided image manipulation; text-to-image synthesis; generative adversarial network; referring image segmentation;
D O I
10.1109/ACCESS.2023.3269847
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study proposes a novel text-guided image manipulation method that introduces referring image segmentation into a generative adversarial network. The proposed text-guided image manipulation method aims to manipulate images containing multiple objects while preserving text-unrelated regions. The proposed method assigns the task of distinguishing between text-related and unrelated regions in an image to segmentation guidance based on referring image segmentation. With this architecture, the adversarial generative network can focus on generating new attributes according to the text description and reconstructing text-unrelated regions. For the challenging input images with multiple objects, the experimental results demonstrate that the proposed method outperforms conventional methods in terms of image manipulation precision.
引用
收藏
页码:42534 / 42545
页数:12
相关论文
共 50 条
  • [41] Edge-Guided Generative Adversarial Network for Image Inpainting
    Xu, Shunxin
    Liu, Dong
    Xiong, Zhiwei
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [42] Interactions Guided Generative Adversarial Network for unsupervised image captioning
    Cao, Shan
    An, Gaoyun
    Zheng, Zhenxing
    Ruan, Qiuqi
    NEUROCOMPUTING, 2020, 417 : 419 - 431
  • [43] A CONTEXT-BASED NETWORK FOR REFERRING IMAGE SEGMENTATION
    Li, Xinyu
    Liu, Yu
    Xu, Kaiping
    Zhao, Zhehuan
    Liu, Sipei
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1436 - 1440
  • [44] Text-to-image synthesis based on modified deep convolutional generative adversarial network
    Li Y.
    Zhu M.
    Ren J.
    Su X.
    Zhou X.
    Yu H.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2023, 49 (08): : 1875 - 1883
  • [45] Text-to-image generation method based on single stage generative adversarial network
    Yang B.
    Na W.
    Xiang X.-Q.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (12): : 2412 - 2420
  • [46] Compound Text-Guided Prompt Tuning via Image-Adaptive Cues
    Tan, Hao
    Li, Jun
    Zhou, Yizhuang
    Wan, Jun
    Lei, Zhen
    Zhang, Xiangyu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 5061 - 5069
  • [47] Fundus Image Segmentation Based on Improved Generative Adversarial Network for Retinal Vessel Analysis
    He, Jin
    Jiang, Dan
    2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), 2020, : 231 - 236
  • [48] Survey About Generative Adversarial Network and Text-to-Image Synthesis
    Lai, Lina
    Mi, Yu
    Zhou, Longlong
    Rao, Jiyong
    Xu, Tianyang
    Song, Xiaoning
    Computer Engineering and Applications, 2023, 59 (19): : 21 - 39
  • [49] Semantic Map Based Image Compression via Conditional Generative Adversarial Network
    Wei, Zhensong
    Liao, Zeyi
    Bai, Huihui
    Zhao, Yao
    IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 13 - 22
  • [50] Infrared and visible image fusion based on guided hybrid model and generative adversarial network
    Tang, LiLi
    Liu, Gang
    Xiao, Gang
    Bavirisetti, Durga Prasad
    Zhang, XiangBo
    INFRARED PHYSICS & TECHNOLOGY, 2022, 120