Text-Guided Image Manipulation via Generative Adversarial Network With Referring Image Segmentation-Based Guidance

被引:1
|
作者
Watanabe, Yuto [1 ]
Togo, Ren [2 ]
Maeda, Keisuke [2 ]
Ogawa, Takahiro [2 ]
Haseyama, Miki [2 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo 0600814, Japan
[2] Hokkaido Univ, Fac Informat Sci & Technol, Sapporo 0600814, Japan
基金
日本学术振兴会;
关键词
Image segmentation; Text recognition; Generative adversarial networks; Image color analysis; Visualization; Image reconstruction; Text processing; Text-guided image manipulation; text-to-image synthesis; generative adversarial network; referring image segmentation;
D O I
10.1109/ACCESS.2023.3269847
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study proposes a novel text-guided image manipulation method that introduces referring image segmentation into a generative adversarial network. The proposed text-guided image manipulation method aims to manipulate images containing multiple objects while preserving text-unrelated regions. The proposed method assigns the task of distinguishing between text-related and unrelated regions in an image to segmentation guidance based on referring image segmentation. With this architecture, the adversarial generative network can focus on generating new attributes according to the text description and reconstructing text-unrelated regions. For the challenging input images with multiple objects, the experimental results demonstrate that the proposed method outperforms conventional methods in terms of image manipulation precision.
引用
收藏
页码:42534 / 42545
页数:12
相关论文
共 50 条
  • [21] StyleMC: Multi-Channel Based Fast Text-Guided Image Generation and Manipulation
    Kocasari, Umut
    Dirik, Alara
    Tiftikci, Mert
    Yanardag, Pinar
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3441 - 3450
  • [22] CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation
    Xu, Sihan
    Ma, Ziqiao
    Huang, Yidong
    Lee, Honglak
    Chai, Joyce
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [23] CLIPVG: Text-Guided Image Manipulation Using Differentiable Vector Graphics
    Song, Yiren
    Shao, Xuning
    Chen, Kang
    Zhang, Weidong
    Jing, Zhongliang
    Li, Minzhe
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2312 - 2320
  • [24] TEXT TO IMAGE SYNTHESIS WITH BIDIRECTIONAL GENERATIVE ADVERSARIAL NETWORK
    Wang, Zixu
    Quan, Zhe
    Wang, Zhi-Jie
    Hu, Xinjian
    Chen, Yangyang
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [25] Lung image segmentation via generative adversarial networks
    Cai, Jiaxin
    Zhu, Hongfeng
    Liu, Siyu
    Qi, Yang
    Chen, Rongshang
    FRONTIERS IN PHYSIOLOGY, 2024, 15
  • [26] Text-guided Unsupervised Latent Transformation for Multi-attribute Image Manipulation
    Wei, Xiwen
    Xu, Zhen
    Liu, Cheng
    Wu, Si
    Yu, Zhiwen
    Wong, Hau San
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19285 - 19294
  • [27] DE-net: Dynamic Text-Guided Image Editing Adversarial Networks
    Tao, Ming
    Bao, Bing-Kun
    Tang, Hao
    Wu, Fei
    Wei, Longhui
    Tian, Qi
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9971 - 9979
  • [28] Generative adversarial network based on semantic consistency for text-to-image generation
    Yue Ma
    Li Liu
    Huaxiang Zhang
    Chunjing Wang
    Zekang Wang
    Applied Intelligence, 2023, 53 : 4703 - 4716
  • [29] A survey on generative adversarial network-based text-to-image synthesis
    Zhou, Rui
    Jiang, Cong
    Xu, Qingyang
    NEUROCOMPUTING, 2021, 451 : 316 - 336
  • [30] Generative adversarial network based on semantic consistency for text-to-image generation
    Ma, Yue
    Liu, Li
    Zhang, Huaxiang
    Wang, Chunjing
    Wang, Zekang
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4703 - 4716