Mask-based Style-Controlled Image Synthesis Using a Mask Style Encoder

被引:0
|
作者
Cho, Jaehyeong [1 ]
Shimoda, Wataru [1 ]
Yanai, Keiji [1 ]
机构
[1] Univ Electrocommun, Dept Informat, Tokyo, Japan
关键词
D O I
10.1109/ICPR48806.2021.9412647
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the advances in Generative Adversarial Networks (GANs) have shown impressive results for image generation and translation tasks. In particular, the image-toimage translation is a method of learning mapping from a source domain to a target domain and synthesizing an image. Image-toimage translation can be applied to a variety of tasks, making it possible to quickly and easily synthesize realistic images from semantic segmentation masks. However, in the existing image-toimage translation method, there is a limitation on controlling the style of the translated image, and it is not easy to synthesize an image by controlling the style of each mask element in detail. Therefore, we propose an image synthesis method that controls the style of each element by improving the existing image-to-image translation method. In the proposed method, we implement a mask style encoder that extracts style features for each mask element. The extracted style features are concatenated to the semantic mask in the normalization layer, and used the style-controlled image synthesis of each mask element. In the experiments, we performed style-controlled images synthesis using the datasets consisting of semantic segmentation masks and real images. The results show that the proposed method has excellent performance for style-controlled images synthesis for each element.
引用
收藏
页码:5176 / 5183
页数:8
相关论文
共 50 条
  • [21] DeepLIR: Attention-based approach for Mask-Based Lensless Image Reconstruction
    Poudel, Arpan
    Nakarmi, Ukash
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 431 - 439
  • [22] Image Style Transfering Based on StarGAN and Class Encoder
    Xu X.-Z.
    Chang J.-Y.
    Ding S.-F.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (04): : 1516 - 1526
  • [23] Moving Object Tracking Using Symmetric Mask-Based Scheme
    Hsia, Chih-Hsien
    Huang, Ding-Wei
    Chiang, Jen-Shiun
    Wu, Zong-Jheng
    FIFTH INTERNATIONAL CONFERENCE ON INFORMATION ASSURANCE AND SECURITY, VOL 1, PROCEEDINGS, 2009, : 173 - 176
  • [24] Improvement of Mask-Based Speech Source Separation Using DNN
    Zhan, Ge
    Huang, Zhaoqiong
    Ying, Dongwen
    Pan, Jielin
    Yan, Yonghong
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [25] Toward Depth hstimation Using Mask-Based Lensless Cameras
    Asif, M. Salman
    2017 FIFTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2017, : 1467 - 1470
  • [26] Mask-based denoising scheme for ghost imaging
    周阳
    郭树旭
    钟菲
    张天
    Chinese Physics B, 2019, 28 (08) : 152 - 159
  • [27] A Mask-based enhancement method for historical documents
    Smith, Elisa H. Barney
    Darbon, Jerome
    Likforman-Sulem, Laurence
    DOCUMENT RECOGNITION AND RETRIEVAL XVIII, 2011, 7874
  • [28] Mask-based Latent Reconstruction for Reinforcement Learning
    Yu, Tao
    Zhang, Zhizheng
    Lan, Cuiling
    Lu, Yan
    Chen, Zhibo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [29] Mask-based denoising scheme for ghost imaging
    Zhou, Yang
    Guo, Shu-Xu
    Zhong, Fei
    Zhang, Tian
    CHINESE PHYSICS B, 2019, 28 (08)
  • [30] Beethoven's Mask and the Physiognomy of Late Style
    Fine, Abigail
    NINETEENTH CENTURY MUSIC, 2020, 43 (03): : 143 - 169