Self-Attention Generative Adversarial Networks

被引:0
|
作者
Zhang, Han [1 ,2 ]
Goodfellow, Ian [2 ]
Metaxas, Dimitris [1 ]
Odena, Augustus [2 ]
机构
[1] Rutgers State Univ, Dept Comp Sci, New Brunswick, NJ 08854 USA
[2] Google Res, Brain Team, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose the Self-Attention Generative Adversarial Network (SAGAN) which allows attention-driven, long-range dependency modeling for image generation tasks. Traditional convolutional GANs generate high-resolution details as a function of only spatially local points in lower-resolution feature maps. In SAGAN, details can be generated using cues from all feature locations. Moreover, the discriminator can check that highly detailed features in distant portions of the image are consistent with each other. Furthermore, recent work has shown that generator conditioning affects GAN performance. Leveraging this insight, we apply spectral normalization to the GAN generator and find that this improves training dynamics. The proposed SAGAN performs better than prior work(1), boosting the best published Inception score from 36.8 to 52.52 and reducing Frechet Inception distance from 27.62 to 18.65 on the challenging ImageNet dataset. Visualization of the attention layers shows that the generator leverages neighborhoods that correspond to object shapes rather than local regions of fixed shape.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Self-attention and generative adversarial networks for algae monitoring
    Nhut Hai Huynh
    Boer, Gordon
    Schramm, Hauke
    [J]. EUROPEAN JOURNAL OF REMOTE SENSING, 2022, 55 (01) : 10 - 22
  • [2] PEGANs: Phased Evolutionary Generative Adversarial Networks with Self-Attention Module
    Xue, Yu
    Tong, Weinan
    Neri, Ferrante
    Zhang, Yixia
    [J]. MATHEMATICS, 2022, 10 (15)
  • [3] Self-attention generative adversarial networks applied to conditional music generation
    Pedro Lucas Tomaz Neves
    José Fornari
    João Batista Florindo
    [J]. Multimedia Tools and Applications, 2022, 81 : 24419 - 24430
  • [4] Self-attention generative adversarial networks applied to conditional music generation
    Tomaz Neves, Pedro Lucas
    Fornari, Jose
    Florindo, Joao Batista
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24419 - 24430
  • [5] Improving Human Pose Estimation With Self-Attention Generative Adversarial Networks
    Wang, Xiangyang
    Cao, Zhongzheng
    Wang, Rui
    Liu, Zhi
    Zhu, Xiaoqiang
    [J]. IEEE ACCESS, 2019, 7 : 119668 - 119680
  • [6] IMPROVING HUMAN POSE ESTIMATION WITH SELF-ATTENTION GENERATIVE ADVERSARIAL NETWORKS
    Cao, Zhongzheng
    Wang, Rui
    Wang, Xiangyang
    Liu, Zhi
    Zhu, Xiaoqiang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 567 - 572
  • [7] A Self-Attention Based Wasserstein Generative Adversarial Networks for Single Image Inpainting
    Mao, Yuanxin
    Zhang, Tianzhuang
    Fu, Bo
    Thanh, Dang N. H.
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, 2022, 32 (03) : 591 - 599
  • [8] A Self-Attention Based Wasserstein Generative Adversarial Networks for Single Image Inpainting
    Yuanxin Mao
    Tianzhuang Zhang
    Bo Fu
    Dang N. H. Thanh
    [J]. Pattern Recognition and Image Analysis, 2022, 32 : 591 - 599
  • [9] SELF-ATTENTION GENERATIVE ADVERSARIAL NETWORK FOR SPEECH ENHANCEMENT
    Huy Phan
    Nguyen, Huy Le
    Chen, Oliver Y.
    Koch, Philipp
    Duong, Ngoc Q. K.
    McLoughlin, Ian
    Mertins, Alfred
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7103 - 7107
  • [10] Attentive Semantic and Perceptual Faces Completion Using Self-attention Generative Adversarial Networks
    Xiaowei Liu
    Kenli Li
    Keqin Li
    [J]. Neural Processing Letters, 2020, 51 : 211 - 229