Swin-GAN: generative adversarial network based on shifted windows transformer architecture for image generation

被引:5
|
作者
Wang, Shibin [1 ]
Gao, Zidiao [1 ]
Liu, Dong [1 ]
机构
[1] Henan Normal Univ, Sch Comp & Informat Engn, Xinxiang 453007, Henan, Peoples R China
来源
VISUAL COMPUTER | 2023年 / 39卷 / 12期
基金
中国国家自然科学基金;
关键词
GAN; Transformer; Self-attention; Image generation;
D O I
10.1007/s00371-022-02714-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
It is well known that every successful generative adversarial network (GAN) relies on the convolutional neural networks (CNN)-based generators and discriminators. However, CNN cannot process the long-range dependencies because its convolution operator has a local receptive field, which can bring some issues to GAN, such as the optimization, the loss of feature resolution and the fine details. To meet the problem of long-term dependence, we propose a GAN model based on shifted windows Transformer architecture, called Swin-GAN, in which the CNN architecture is replaced by Transformer. In our model, we build a memory-friendly generator based on the shifted window attention mechanism to gradually increase the resolution of feature maps at each stage. Another, we build a multi-scale discriminator to split the image into patches of different sizes as the input at different stages, which can achieve the balance between capturing global contextual semantic information and local detailed features. To further improve the fidelity and stability, we use the techniques such as data enhancement, layer normalization and relative position coding in our model. Compared with the current schemes, the experimental results show that our scheme has better performance, fewer parameters and lower computational cost. Specifically, Params value of Swin-GAN model is 30.254M, and Floating-Point Operations Per Second (FLOPs) value is 4.086G. Inception Score (IS) is 9.04 and Frechet Inception Distance (FID) is 9.23 in CIFAR-10.
引用
收藏
页码:6085 / 6095
页数:11
相关论文
共 50 条
  • [21] Multistage Evolutionary Generative Adversarial Network for Image Generation
    Zhang, Xiu
    Sun, Baiwei
    Zhang, Xin
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (03) : 5483 - 5492
  • [22] TTS-GAN: A Transformer-Based Time-Series Generative Adversarial Network
    Li, Xiaomin
    Metsis, Vangelis
    Wang, Huangyingrui
    Ngu, Anne Hee Hiong
    ARTIFICIAL INTELLIGENCE IN MEDICINE, AIME 2022, 2022, 13263 : 133 - 143
  • [23] Transformer Based Generative Adversarial Network for Liver Segmentation
    Demir, Ugur
    Zhang, Zheyuan
    Wang, Bin
    Antalek, Matthew
    Keles, Elif
    Jha, Debesh
    Borhani, Amir
    Ladner, Daniela
    Bagci, Ulas
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022 WORKSHOPS, PT II, 2022, 13374 : 340 - 347
  • [24] ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing
    Lin, Chen-Hsuan
    Yumer, Ersin
    Wang, Oliver
    Shechtman, Eli
    Lucey, Simon
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9455 - 9464
  • [25] Window-based transformer generative adversarial network for autonomous underwater image enhancement
    Ummar, Mehnaz
    Dharejo, Fayaz Ali
    Alawode, Basit
    Mahbub, Taslim
    Piran, Md. Jalil
    Javed, Sajid
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [26] MBD-GAN: Model-based image deblurring with a generative adversarial network
    Song, Li
    Lam, Edmund Y.
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7306 - 7313
  • [27] Image Dehazing Based on Generative Adversarial Network
    Huang S.
    Wang B.
    Li H.
    Yang Y.
    Hu W.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (11): : 990 - 1003
  • [28] Image Demosaicing Based on Generative Adversarial Network
    Luo, Jingrui
    Wang, Jie
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [29] Image Generation with Global Photographic Aesthetic Based on Disentangled Generative Adversarial Network
    Zhang, Hua
    Wang, Muwei
    Zhang, Lingjun
    Wu, Yifan
    Luo, Yizhang
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [30] Presentation Attack Face Image Generation Based on a Deep Generative Adversarial Network
    Dat Tien Nguyen
    Tuyen Danh Pham
    Batchuluun, Ganbayar
    Noh, Kyoung Jun
    Park, Kang Ryoung
    SENSORS, 2020, 20 (07) : 1 - 25