UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired image-to-image translation

被引:76
|
作者
Torbunov, Dmitrii [1 ]
Huang, Yi [1 ]
Yu, Haiwang [1 ]
Huang, Jin [1 ]
Yoo, Shinjae [1 ]
Lin, Meifeng [1 ]
Viren, Brett [1 ]
Ren, Yihui [1 ]
机构
[1] Brookhaven Natl Lab, Upton, NY 11973 USA
关键词
D O I
10.1109/WACV56688.2023.00077
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unpaired image-to-image translation has broad applications in art, design, and scientific simulations. One early breakthrough was CycleGAN that emphasizes one-to-one mappings between two unpaired image domains via generative-adversarial networks (GAN) coupled with the cycle-consistency constraint, while more recent works promote one-to-many mapping to boost diversity of the translated images. Motivated by scientific simulation and one-to-one needs, this work revisits the classic CycleGAN framework and boosts its performance to outperform more contemporary models without relaxing the cycle-consistency constraint. To achieve this, we equip the generator with a Vision Transformer (ViT) and employ necessary training and regularization techniques. Compared to previous best-performing models, our model performs better and retains a strong correlation between the original and translated image. An accompanying ablation study shows that both the gradient penalty and self-supervised pre-training are crucial to the improvement. To promote reproducibility and open science, the source code, hyperparameter configurations, and pre-trained model are available at https: //github.com/LS4GAN/uvcgan.
引用
收藏
页码:702 / 712
页数:11
相关论文
共 50 条
  • [1] Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
    Zhu, Jun-Yan
    Park, Taesung
    Isola, Phillip
    Efros, Alexei A.
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2242 - 2251
  • [2] Dynamic residual distillation Cycle-Consistent GAN for unpaired ultrasound image translation
    Jiang, Cunang
    Wang, Yaqi
    Dai, Yu
    Zhang, Jianxun
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 105
  • [3] Trans-Cycle: Unpaired Image-to-Image Translation Network by Transformer
    Tian, Kai
    Pan, Mengze
    Lu, Zongqing
    Liao, Qingmin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 576 - 587
  • [4] Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks
    Li, Minjun
    Huang, Haozhi
    Ma, Lin
    Liu, Wei
    Zhang, Tong
    Jiang, Yugang
    COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 186 - 201
  • [5] An Extensive Study of Cycle-Consistent Generative Networks for Image-to-Image Translation
    Liu, Yu
    Guo, Yanming
    Chen, Wei
    Lew, Michael S.
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 219 - 224
  • [6] Asymmetric GAN for Unpaired Image-to-Image Translation
    Li, Yu
    Tang, Sheng
    Zhang, Rui
    Zhang, Yongdong
    Li, Jintao
    Yan, Shuicheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (12) : 5881 - 5896
  • [7] Image-to-image Translation Based on Improved Cycle-consistent Generative Adversarial Network
    Zhang Jinglei
    Hou Yawei
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (05) : 1216 - 1222
  • [8] Consistent Embedded GAN for Image-to-Image Translation
    Xiong, Feng
    Wang, Qianqian
    Gao, Quanxue
    IEEE ACCESS, 2019, 7 : 126651 - 126661
  • [9] Unsupervised image-to-image translation via long-short cycle-consistent adversarial networks
    Gang Wang
    Haibo Shi
    Yufei Chen
    Bin Wu
    Applied Intelligence, 2023, 53 : 17243 - 17259
  • [10] Unsupervised image-to-image translation via long-short cycle-consistent adversarial networks
    Wang, Gang
    Shi, Haibo
    Chen, Yufei
    Wu, Bin
    APPLIED INTELLIGENCE, 2023, 53 (14) : 17243 - 17259