Mask-based Style-Controlled Image Synthesis Using a Mask Style Encoder

被引:0
|
作者
Cho, Jaehyeong [1 ]
Shimoda, Wataru [1 ]
Yanai, Keiji [1 ]
机构
[1] Univ Electrocommun, Dept Informat, Tokyo, Japan
关键词
D O I
10.1109/ICPR48806.2021.9412647
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the advances in Generative Adversarial Networks (GANs) have shown impressive results for image generation and translation tasks. In particular, the image-toimage translation is a method of learning mapping from a source domain to a target domain and synthesizing an image. Image-toimage translation can be applied to a variety of tasks, making it possible to quickly and easily synthesize realistic images from semantic segmentation masks. However, in the existing image-toimage translation method, there is a limitation on controlling the style of the translated image, and it is not easy to synthesize an image by controlling the style of each mask element in detail. Therefore, we propose an image synthesis method that controls the style of each element by improving the existing image-to-image translation method. In the proposed method, we implement a mask style encoder that extracts style features for each mask element. The extracted style features are concatenated to the semantic mask in the normalization layer, and used the style-controlled image synthesis of each mask element. In the experiments, we performed style-controlled images synthesis using the datasets consisting of semantic segmentation masks and real images. The results show that the proposed method has excellent performance for style-controlled images synthesis for each element.
引用
收藏
页码:5176 / 5183
页数:8
相关论文
共 50 条
  • [31] Unsupervised training of neural mask-based beamforming
    Drude, Lukas
    Heymann, Jahn
    Haeb-Umbach, Reinhold
    INTERSPEECH 2019, 2019, : 1253 - 1257
  • [32] Phase Mask-Based Multimodal Superresolution Microscopy
    Beams, Ryan
    Woodcock, Jeremiah W.
    Gilman, Jeffrey W.
    Stranick, Stephan J.
    PHOTONICS, 2017, 4 (03)
  • [33] Phase Mask-Based Multimodal Superresolution Microscopy
    Beams, Ryan
    Stranick, Stephan J.
    2016 CONFERENCE ON LASERS AND ELECTRO-OPTICS (CLEO), 2016,
  • [34] ENHANCEMENT OF CODED SPEECH USING A MASK-BASED POST-FILTER
    Korse, Srikanth
    Gupta, Kishan
    Fuchs, Guillaume
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6764 - 6768
  • [35] LENSLESS 3D IMAGING USING MASK-BASED CAMERAS
    Asif, M. Salman
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6498 - 6502
  • [36] Is the Ideal Ratio Mask Really the Best? - Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers
    Hiroe, Atsuo
    Itoyama, Katsutoshi
    Nakadai, Kazuhiro
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1843 - 1850
  • [37] SHARED TRANSFORMER ENCODER WITH MASK-BASED 3D MODEL ESTIMATION FOR CONTAINER MASS ESTIMATION
    Matsubara, Tomoya
    Otsuki, Seitaro
    Wada, Yuiga
    Matsuo, Haruka
    Komatsu, Takumi
    Iioka, Yui
    Sugiura, Komei
    Saito, Hideo
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 9142 - 9146
  • [38] Mask Transformer: Unpaired Text Style Transfer Based on Masked Language
    Wu, Chunhua
    Chen, Xiaolong
    Li, Xingbiao
    APPLIED SCIENCES-BASEL, 2020, 10 (18):
  • [39] USING SEPARATE LOSSES FOR SPEECH AND NOISE IN MASK-BASED SPEECH ENHANCEMENT
    Xu, Ziyi
    Elshamy, Samy
    Fingscheidt, Tim
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7519 - 7523
  • [40] Disentangled Image Attribute Editing in Latent Space via Mask-based Retention Loss
    Ohaga, Shunya
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,