Mask-based Style-Controlled Image Synthesis Using a Mask Style Encoder

被引：0

作者：

Cho, Jaehyeong ^{[1
]}

Shimoda, Wataru ^{[1
]}

Yanai, Keiji ^{[1
]}

机构：

[1] Univ Electrocommun, Dept Informat, Tokyo, Japan

来源：

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年

关键词：

D O I：

10.1109/ICPR48806.2021.9412647

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, the advances in Generative Adversarial Networks (GANs) have shown impressive results for image generation and translation tasks. In particular, the image-toimage translation is a method of learning mapping from a source domain to a target domain and synthesizing an image. Image-toimage translation can be applied to a variety of tasks, making it possible to quickly and easily synthesize realistic images from semantic segmentation masks. However, in the existing image-toimage translation method, there is a limitation on controlling the style of the translated image, and it is not easy to synthesize an image by controlling the style of each mask element in detail. Therefore, we propose an image synthesis method that controls the style of each element by improving the existing image-to-image translation method. In the proposed method, we implement a mask style encoder that extracts style features for each mask element. The extracted style features are concatenated to the semantic mask in the normalization layer, and used the style-controlled image synthesis of each mask element. In the experiments, we performed style-controlled images synthesis using the datasets consisting of semantic segmentation masks and real images. The results show that the proposed method has excellent performance for style-controlled images synthesis for each element.

引用

页码：5176 / 5183

页数：8

共 50 条

[41] Fabrication of micro/nanotubes by mask-based diffraction lithography
Tan, Xianhua
Shi, Tielin
Gao, Yang
Sheng, Wenjun
Sun, Bo
Liao, Guanglan
JOURNAL OF MICROMECHANICS AND MICROENGINEERING, 2014, 24 (05)
[42] A Mask-based Model for Mandarin Chinese Polyphone Disambiguation
Zhang, Haiteng
Pan, Huashan
Li, Xiulin
INTERSPEECH 2020, 2020, : 1728 - 1732
[43] MASK-BASED ENHANCEMENT FOR VERY LOW QUALITY SPEECH
Gonzalez, Sira
Brookes, Mike
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[44] Mask-based generative adversarial networking for crowd counting
Duan, Guoxiu
Zhu, Aichun
Zhao, Lu
Zhu, Xiaomei
Hu, Fangqiang
Guan, Xinjie
JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (04)
[45] Mask-Based Panoptic LiDAR Segmentation for Autonomous Driving
Marcuzzi, Rodrigo
Nunes, Lucas
Wiesmann, Louis
Behley, Jens
Stachniss, Cyrill
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (02) : 1141 - 1148
[46] Learned reconstructions for practical mask-based lensless imaging
Monakhova, Kristina
Yurtsever, Joshua
Kuo, Grace
Antipa, Nick
Yanny, Kyrollos
Waller, Laura
OPTICS EXPRESS, 2019, 27 (20): : 28075 - 28090
[47] Achieving mask-based imaging with optical maskless lithography
Stone, Elizabeth M.
Hintersteiner, Jason D.
Cebuhar, Wenceslao A.
Albright, Ronald
Eib, Nicholas K.
Latypovi, Azat
Baba-Ali, Nabila
Poultney, Sherman K.
Croffie, Ebo H.
EMERGING LITHOGRAPHIC TECHNOLOGIES X, PTS 1 AND 2, 2006, 6151
[48] Modeling effects of oxygen inhibition in mask-based stereolithography
Jariwala, Amit S.
Ding, Fei
Boddapati, Aparna
Breedveld, Victor
Grover, Martha A.
Henderson, Clifford L.
Rosen, David W.
RAPID PROTOTYPING JOURNAL, 2011, 17 (03) : 168 - 175
[49] Mask-based fingerprinting scheme for digital video broadcasting
Emmanuel, Sabu
Kankanhalli, Mohan S.
MULTIMEDIA TOOLS AND APPLICATIONS, 2006, 31 (02) : 145 - 170
[50] Mask-based fingerprinting scheme for digital video broadcasting
Sabu Emmanuel
Mohan S. Kankanhalli
Multimedia Tools and Applications, 2006, 31 : 145 - 170

← 1 2 3 4 5 →