Mask-based Style-Controlled Image Synthesis Using a Mask Style Encoder

被引:0
|
作者
Cho, Jaehyeong [1 ]
Shimoda, Wataru [1 ]
Yanai, Keiji [1 ]
机构
[1] Univ Electrocommun, Dept Informat, Tokyo, Japan
关键词
D O I
10.1109/ICPR48806.2021.9412647
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the advances in Generative Adversarial Networks (GANs) have shown impressive results for image generation and translation tasks. In particular, the image-toimage translation is a method of learning mapping from a source domain to a target domain and synthesizing an image. Image-toimage translation can be applied to a variety of tasks, making it possible to quickly and easily synthesize realistic images from semantic segmentation masks. However, in the existing image-toimage translation method, there is a limitation on controlling the style of the translated image, and it is not easy to synthesize an image by controlling the style of each mask element in detail. Therefore, we propose an image synthesis method that controls the style of each element by improving the existing image-to-image translation method. In the proposed method, we implement a mask style encoder that extracts style features for each mask element. The extracted style features are concatenated to the semantic mask in the normalization layer, and used the style-controlled image synthesis of each mask element. In the experiments, we performed style-controlled images synthesis using the datasets consisting of semantic segmentation masks and real images. The results show that the proposed method has excellent performance for style-controlled images synthesis for each element.
引用
收藏
页码:5176 / 5183
页数:8
相关论文
共 50 条
  • [1] Style-Controlled Synthesis of Clothing Segments for Fashion Image Manipulation
    Kim, Bo-Kyeong
    Kim, Geonmin
    Lee, Soo-Young
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (02) : 298 - 310
  • [2] STYLE-CONTROLLED WILTING OF FLOWER
    GILISSEN, LJW
    PLANTA, 1977, 133 (03) : 275 - 280
  • [3] Style-Controlled Image Synthesis of Concrete Damages Based on Fusion of Convolutional Encoder and Attention-Enhanced Conditional Generative Adversarial Network
    Li, Shengyuan
    Le, Yushan
    Zhao, Xuefeng
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2024, 38 (06)
  • [4] Using Mask-Based Enhancement and Feature Aggregation for Single Image Deraining
    Qin, Shengdi
    Zhang, Shunli
    Zhang, Yu
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 828 - 832
  • [5] On mask-based image set desensitization with recognition support
    Li, Qilong
    Liu, Ji
    Sun, Yifan
    Zhang, Chongsheng
    Dou, Dejing
    APPLIED INTELLIGENCE, 2024, 54 (01) : 886 - 898
  • [6] Image reconstruction with transformer for mask-based lensless imaging
    Pan, Xiuxi
    Chen, Xiao
    Takeyama, Saori
    Yamaguchi, Masahiro
    OPTICS LETTERS, 2022, 47 (07) : 1843 - 1846
  • [7] IMAGE AND DEPTH ESTIMATION WITH MASK-BASED LENSLESS CAMERAS
    Zheng, Yucheng
    Asif, M. Salman
    2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), 2019, : 91 - 95
  • [8] On mask-based image set desensitization with recognition support
    Qilong Li
    Ji Liu
    Yifan Sun
    Chongsheng Zhang
    Dejing Dou
    Applied Intelligence, 2024, 54 : 886 - 898
  • [9] MASK-BASED MICROSPHERE PHOTOLITHOGRAPHY
    Qu, Chuang
    Kinzel, Edward C.
    PROCEEDINGS OF THE ASME 13TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE, 2018, VOL 4, 2018,
  • [10] New style for nasopharyngeal swab with a mask: image-evaluation
    Takahashi, Kazuomi
    Okachi, Shotaro
    Yasui, Hirotoshi
    Taki, Shunichi
    Ito, Takayasu
    Fukatsu, Noriaki
    Sato, Kazuhide
    INTERNATIONAL JOURNAL OF INFECTIOUS DISEASES, 2021, 109 : 112 - 113