Swin-GAN: generative adversarial network based on shifted windows transformer architecture for image generation

被引:5
|
作者
Wang, Shibin [1 ]
Gao, Zidiao [1 ]
Liu, Dong [1 ]
机构
[1] Henan Normal Univ, Sch Comp & Informat Engn, Xinxiang 453007, Henan, Peoples R China
来源
VISUAL COMPUTER | 2023年 / 39卷 / 12期
基金
中国国家自然科学基金;
关键词
GAN; Transformer; Self-attention; Image generation;
D O I
10.1007/s00371-022-02714-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
It is well known that every successful generative adversarial network (GAN) relies on the convolutional neural networks (CNN)-based generators and discriminators. However, CNN cannot process the long-range dependencies because its convolution operator has a local receptive field, which can bring some issues to GAN, such as the optimization, the loss of feature resolution and the fine details. To meet the problem of long-term dependence, we propose a GAN model based on shifted windows Transformer architecture, called Swin-GAN, in which the CNN architecture is replaced by Transformer. In our model, we build a memory-friendly generator based on the shifted window attention mechanism to gradually increase the resolution of feature maps at each stage. Another, we build a multi-scale discriminator to split the image into patches of different sizes as the input at different stages, which can achieve the balance between capturing global contextual semantic information and local detailed features. To further improve the fidelity and stability, we use the techniques such as data enhancement, layer normalization and relative position coding in our model. Compared with the current schemes, the experimental results show that our scheme has better performance, fewer parameters and lower computational cost. Specifically, Params value of Swin-GAN model is 30.254M, and Floating-Point Operations Per Second (FLOPs) value is 4.086G. Inception Score (IS) is 9.04 and Frechet Inception Distance (FID) is 9.23 in CIFAR-10.
引用
收藏
页码:6085 / 6095
页数:11
相关论文
共 50 条
  • [31] Diversifying Tire-Defect Image Generation Based on Generative Adversarial Network
    Zhang, Yulong
    Wang, Yilin
    Jiang, Zhiqiang
    Liao, Fagen
    Zheng, Li
    Tan, Dongzeng
    Chen, Jinshui
    Lu, Jiangang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [32] Generative adversarial network based on semantic consistency for text-to-image generation
    Yue Ma
    Li Liu
    Huaxiang Zhang
    Chunjing Wang
    Zekang Wang
    Applied Intelligence, 2023, 53 : 4703 - 4716
  • [33] Generative adversarial network based on semantic consistency for text-to-image generation
    Ma, Yue
    Liu, Li
    Zhang, Huaxiang
    Wang, Chunjing
    Wang, Zekang
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4703 - 4716
  • [34] Spatial Transformer Generative Adversarial Network for Image Super-Resolution
    Rempakos, Pantelis
    Vrigkas, Michalis
    Plissiti, Marina E.
    Nikou, Christophoros
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 : 399 - 411
  • [35] TEGAN: Transformer Embedded Generative Adversarial Network for Underwater Image Enhancement
    Gao, Zhi
    Yang, Jing
    Zhang, Lu
    Jiang, Fengling
    Jiao, Xixiang
    COGNITIVE COMPUTATION, 2024, 16 (01) : 191 - 214
  • [36] Cycle Generative Adversarial Network Based on Gradient Normalization for Infrared Image Generation
    Yi, Xing
    Pan, Hao
    Zhao, Huaici
    Liu, Pengfei
    Zhang, Canyu
    Wang, Junpeng
    Wang, Hao
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [37] Thermal image generation for blast furnace chute based on generative adversarial network
    Xiaoman Cheng
    Shusen Cheng
    Signal, Image and Video Processing, 2023, 17 : 2595 - 2606
  • [38] Thermal image generation for blast furnace chute based on generative adversarial network
    Cheng, Xiaoman
    Cheng, Shusen
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (05) : 2595 - 2606
  • [39] TEGAN: Transformer Embedded Generative Adversarial Network for Underwater Image Enhancement
    Zhi Gao
    Jing Yang
    Lu Zhang
    Fengling Jiang
    Xixiang Jiao
    Cognitive Computation, 2024, 16 : 191 - 214
  • [40] HyperViTGAN: Semisupervised Generative Adversarial Network With Transformer for Hyperspectral Image Classification
    He, Ziping
    Xia, Kewen
    Ghamisi, Pedram
    Hu, Yuhen
    Fan, Shurui
    Zu, Baokai
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 6053 - 6068