Unsupervised Image Translation Using Multi-Scale Residual GAN

被引:1
|
作者
Zhang, Yifei [1 ]
Li, Weipeng [1 ]
Wang, Daling [1 ]
Feng, Shi [1 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110169, Peoples R China
基金
中国国家自然科学基金;
关键词
image translation; generative adversarial network; unsupervised learning; object migration; multi-scale residual network;
D O I
10.3390/math10224347
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Image translation is a classic problem of image processing and computer vision for transforming an image from one domain to another by learning the mapping between an input image and an output image. A novel Multi-scale Residual Generative Adversarial Network (MRGAN) based on unsupervised learning is proposed in this paper for transforming images between different domains using unpaired data. In the model, a dual generater architecture is used to eliminate the dependence on paired training samples and introduce a multi-scale layered residual network in generators for reducing semantic loss of images in the process of encoding. The Wasserstein GAN architecture with gradient penalty (WGAN-GP) is employed in the discriminator to optimize the training process and speed up the network convergence. Comparative experiments on several image translation tasks over style transfers and object migrations show that the proposed MRGAN outperforms strong baseline models by large margins.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Multi-scale semantic image inpainting with residual learning and GAN
    Jiao, Libin
    Wu, Hao
    Wang, Haodi
    Bie, Rongfang
    NEUROCOMPUTING, 2019, 331 : 199 - 212
  • [2] Image-to-image translation using an offset-based multi-scale codes GAN encoder
    Guo, Zihao
    Shao, Mingwen
    Li, Shunhang
    VISUAL COMPUTER, 2023,
  • [3] Image-to-image translation using an offset-based multi-scale codes GAN encoder
    Zihao Guo
    Mingwen Shao
    Shunhang Li
    The Visual Computer, 2024, 40 (2) : 699 - 715
  • [4] Multi-scale GAN with residual image learning for removing heterogeneous blur
    Khan, Rayyan Azam
    Luo, Yigang
    Wu, Fang-Xiang
    IET IMAGE PROCESSING, 2022, 16 (09) : 2412 - 2431
  • [5] Self-supervised multi-scale semantic consistency regularization for unsupervised image-to-image translation
    Zhang, Heng
    Yang, Yi-Jun
    Zeng, Wei
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 241
  • [6] Unsupervised image segmentation evaluation and refinement using a multi-scale approach
    Johnson, Brian
    Xie, Zhixiao
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2011, 66 (04) : 473 - 483
  • [7] Image steganalysis with multi-scale residual network
    Hao Chen
    Qi Han
    Qiong Li
    Xiaojun Tong
    Multimedia Tools and Applications, 2023, 82 : 22009 - 22031
  • [8] Image steganalysis with multi-scale residual network
    Chen, Hao
    Han, Qi
    Li, Qiong
    Tong, Xiaojun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (14) : 22009 - 22031
  • [9] MULTI-SCALE RESIDUAL NETWORK FOR IMAGE CLASSIFICATION
    Zhong, Xian
    Gong, Oubo
    Huang, Wenxin
    Yuan, Jingling
    Ma, Bo
    Li, Ryan Wen
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2023 - 2027
  • [10] Unsupervised segmentation of noisy image in a multi-scale framework
    Zhang, YB
    Ma, S
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 905 - 909