Diagonal Attention and Style-based GAN for Content-Style Disentanglement in Image Generation and Translation

被引:5
|
作者
Kwon, Gihyun [1 ]
Ye, Jong Chul [1 ,2 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Bio & Brain Engn, Daejeon, South Korea
[2] Korea Adv Inst Sci & Technol, Grad Sch AI, Daejeon, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/ICCV48922.2021.01372
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the important research topics in image generative models is to disentangle the spatial contents and styles for their separate control. Although StyleGAN can generate content feature vectors from random noises, the resulting spatial content control is primarily intended for minor spatial variations, and the disentanglement of global content and styles is by no means complete. Inspired by a mathematical understanding of normalization and attention, here we present a novel hierarchical adaptive Diagonal spatial ATtention (DAT) layers to separately manipulate the spatial contents from styles in a hierarchical manner. Using DAT and AdaIN, our method enables coarse-to-fine level disentanglement of spatial contents and styles. In addition, our generator can be easily integrated into the GAN inversion framework so that the content and style of translated images from multi-domain image translation tasks can be flexibly controlled. By using various datasets, we confirm that the proposed method not only outperforms the existing models in disentanglement scores, but also provides more flexible control over spatial features in the generated images.
引用
收藏
页码:13960 / 13969
页数:10
相关论文
共 50 条
  • [41] Brain tumor image generation using an aggregation of GAN models with style transfer
    Mukherkjee, Debadyuti
    Saha, Pritam
    Kaplun, Dmitry
    Sinitca, Aleksandr
    Sarkar, Ram
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [42] ISF-GAN: An Implicit Style Function for High-Resolution Image-to-Image Translation
    Liu, Yahui
    Chen, Yajing
    Bao, Linchao
    Sebe, Nicu
    Lepri, Bruno
    De Nadai, Marco
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3343 - 3353
  • [43] Font Style that Fits an Image - Font Generation Based on Image Context
    Miyazono, Taiga
    Iwana, Brian Kenji
    Haraguchi, Daichi
    Uchida, Seiichi
    [J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT III, 2021, 12823 : 569 - 584
  • [44] Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion
    Lan, Yushi
    Meng, Xuyi
    Yang, Shuai
    Loy, Chen Change
    Dai, Bo
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 20940 - 20949
  • [45] Reconstruction of three-dimension digital rock guided by prior information with a combination of InfoGAN and style-based GAN
    Cao, Danping
    Hou, Zhiyu
    Liu, Qiang
    Fu, Feiqi
    [J]. JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2022, 208
  • [46] The Application of Image Style Transformation Based on GAN in the Intelligent Mobile Terminal
    Dong, Huazhi
    Qian, Yushan
    Wang, Linlin
    Lu, Pei
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020, : 314 - 318
  • [47] CBA-GAN: Cartoonization style transformation based on the convolutional attention module
    Zhang, Feng
    Zhao, Huihuang
    Li, Yuhua
    Wu, Yichun
    Sun, Xianfang
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2023, 106
  • [48] MISS GAN: A Multi-IlluStrator style generative adversarial network for image to illustration translation
    Barzilay, Noa
    Shalev, Tal Berkovitz
    Giryes, Raja
    [J]. PATTERN RECOGNITION LETTERS, 2021, 151 : 140 - 147
  • [49] Cross Attention Based Style Distribution for Controllable Person Image Synthesis
    Zhou, Xinyue
    Yin, Mingyu
    Chen, Xinyuan
    Sun, Li
    Gao, Changxin
    Li, Qingli
    [J]. COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 161 - 178
  • [50] Digital printing image generation method based on style transfer
    Su, Zebin
    Zhao, Siyuan
    Zhang, Huanhuan
    Li, Pengfei
    Lu, Yanjun
    [J]. TEXTILE RESEARCH JOURNAL, 2023, 93 (23-24) : 5211 - 5223