Contrastive learning for unsupervised image-to-image translation

被引:2
|
作者
Lee, Hanbit [1 ]
Seol, Jinseok [2 ]
Lee, Sang-goo [2 ]
Park, Jaehui [3 ]
Shim, Junho [4 ]
机构
[1] SK Telecom, AIX Ctr, Seongnam, South Korea
[2] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul, South Korea
[3] Univ Seoul, Dept Stat, Seoul, South Korea
[4] Sookmyung Womens Univ, Dept Comp Sci, Seoul, South Korea
关键词
Image-to-image translation; Generative adversarial networks; Contrastive learning; Self-supervised learning; Style transfer;
D O I
10.1016/j.asoc.2023.111170
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image -to-image translation (I2I) aims to learn a mapping function to transform images into different styles or domains while preserving their key structures. Typically, I2I models require manually defined image domains as a training set to learn the visual differences among the image domains and achieve the ability to translate images across them. However, constructing such multi-domain datasets on a large scale requires expensive data collection and annotation processes. Moreover, if the target domain changes or is expanded, a new dataset should be collected, and the model should be retrained. To address these challenges, this article presents a novel unsupervised I2I method that does not require manually defined image domains. The proposed method automatically learns the visual similarity between individual samples and leverages the learned similarity function to transfer a specific style or appearance across images. Therefore, the developed method does not rely on cost-intensive manual domains or unstable clustering results, leading to improved translation accuracy at minimal cost. For quantitative evaluation, we implemented a state -of -the -art I2I models and performed image transformation on the same input image using the baselines and our method. The image quality was then assessed using two quantitative metrics: Frechet inception distance (FID) and translation accuracy. The proposed method exhibited significant improvements in image quality and translation accuracy compared with the latest unsupervised I2I methods. Specifically, the developed technique achieved a 25% and 19% improvement over the best-performing unsupervised baseline in terms of FID and translation accuracy, respectively. Furthermore, this approach demonstrated performance nearly comparable to those of supervised learning-based methods trained using manually collected and constructed domains.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Unaligned Image-to-Image Translation by Learning to Reweight
    Xie, Shaoan
    Gong, Mingming
    Xu, Yanwu
    Zhang, Kun
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14154 - 14164
  • [42] Underwater dam crack image generation based on unsupervised image-to-image translation
    Huang, Ben
    Kang, Fei
    Li, Xinyu
    Zhu, Sisi
    [J]. AUTOMATION IN CONSTRUCTION, 2024, 163
  • [43] Object-stable unsupervised dual contrastive learning image-to-image translation with query-selected attention and convolutional block attention module
    Oh, Yunseok
    Oh, Seonhye
    Noh, Sangwoo
    Kim, Hangyu
    Seo, Hyeon
    [J]. PLOS ONE, 2023, 18 (11):
  • [44] Smoothing the Disentangled Latent Style Space for Unsupervised Image-to-Image Translation
    Liu, Yahui
    Sangineto, Enver
    Chen, Yajing
    Bao, Linchao
    Zhang, Haoxian
    Sebe, Nicu
    Lepri, Bruno
    Wang, Wei
    De Nadai, Marco
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10780 - 10789
  • [45] UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION VIA FAIR REPRESENTATI ON OF GENDER BIAS
    Hwang, Sunhee
    Byun, Hyeran
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1953 - 1957
  • [46] Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation
    Saxena, Divya
    Kulshrestha, Tarun
    Cao, Jiannong
    Cheung, Shing-Chi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1601 - 1612
  • [47] Allowing Supervision in Unsupervised Deformable- Instances Image-to-Image Translation
    Liu, Yu
    Su, Sitong
    Zhu, Junchen
    Zheng, Feng
    Gao, Lianli
    Song, Jingkuan
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5335 - 5349
  • [48] Unsupervised image-to-image translation by semantics consistency and self-attention
    Zhibin Zhang
    Wanli Xue
    Guokai Fu
    [J]. Optoelectronics Letters, 2022, 18 : 175 - 180
  • [49] SPatchGAN: A Statistical Feature Based Discriminator for Unsupervised Image-to-Image Translation
    Shao, Xuning
    Zhang, Weidong
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6526 - 6535
  • [50] Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
    Gomez, Raul
    Liu, Yahui
    De Nadai, Marco
    Karatzas, Dimosthenis
    Lepri, Bruno
    Sebe, Nicu
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3164 - 3172