Contrastive learning for unsupervised image-to-image translation

被引:2
|
作者
Lee, Hanbit [1 ]
Seol, Jinseok [2 ]
Lee, Sang-goo [2 ]
Park, Jaehui [3 ]
Shim, Junho [4 ]
机构
[1] SK Telecom, AIX Ctr, Seongnam, South Korea
[2] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul, South Korea
[3] Univ Seoul, Dept Stat, Seoul, South Korea
[4] Sookmyung Womens Univ, Dept Comp Sci, Seoul, South Korea
关键词
Image-to-image translation; Generative adversarial networks; Contrastive learning; Self-supervised learning; Style transfer;
D O I
10.1016/j.asoc.2023.111170
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image -to-image translation (I2I) aims to learn a mapping function to transform images into different styles or domains while preserving their key structures. Typically, I2I models require manually defined image domains as a training set to learn the visual differences among the image domains and achieve the ability to translate images across them. However, constructing such multi-domain datasets on a large scale requires expensive data collection and annotation processes. Moreover, if the target domain changes or is expanded, a new dataset should be collected, and the model should be retrained. To address these challenges, this article presents a novel unsupervised I2I method that does not require manually defined image domains. The proposed method automatically learns the visual similarity between individual samples and leverages the learned similarity function to transfer a specific style or appearance across images. Therefore, the developed method does not rely on cost-intensive manual domains or unstable clustering results, leading to improved translation accuracy at minimal cost. For quantitative evaluation, we implemented a state -of -the -art I2I models and performed image transformation on the same input image using the baselines and our method. The image quality was then assessed using two quantitative metrics: Frechet inception distance (FID) and translation accuracy. The proposed method exhibited significant improvements in image quality and translation accuracy compared with the latest unsupervised I2I methods. Specifically, the developed technique achieved a 25% and 19% improvement over the best-performing unsupervised baseline in terms of FID and translation accuracy, respectively. Furthermore, this approach demonstrated performance nearly comparable to those of supervised learning-based methods trained using manually collected and constructed domains.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Memory-guided Unsupervised Image-to-image Translation
    Jeong, Somi
    Kim, Youngjung
    Lee, Eungbean
    Sohn, Kwanghoon
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6554 - 6563
  • [22] Unsupervised Attention-guided Image-to-Image Translation
    Mejjati, Youssef A.
    Richardt, Christian
    Tompkin, James
    Cosker, Darren
    Kim, Kwang In
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [23] Deliberation Learning for Image-to-Image Translation
    He, Tianyu
    Xia, Yingce
    Lin, Jianxin
    Tan, Xu
    He, Di
    Qin, Tao
    Chen, Zhibo
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2484 - 2490
  • [24] Edge-guided Adversarial Network Based on Contrastive Learning for Image-to-Image Translation
    Zhu, Chen
    Lai, Ru
    Bi, Luzheng
    Wang, Xuyang
    Du, Jiarong
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7949 - 7954
  • [25] Multi-Domain Image-to-Image Translation with Cross-Granularity Contrastive Learning
    Fu, Huiyuan
    Liu, Jin
    Yu, Ting
    Wang, Xin
    Ma, Huadong
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [26] Multi-attention bidirectional contrastive learning method for unpaired image-to-image translation
    Yang, Benchen
    Liu, Xuzhao
    Li, Yize
    Jin, Haibo
    Qu, Yetian
    [J]. PLOS ONE, 2024, 19 (04):
  • [27] Spectral normalization and dual contrastive regularization for image-to-image translation
    Zhao, Chen
    Cai, Wei-Ling
    Yuan, Zheng
    [J]. VISUAL COMPUTER, 2024,
  • [28] Unsupervised Image-to-Image Translation with Self-Attention Networks
    Kang, Taewon
    Lee, Kwang Hee
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 102 - 108
  • [29] Learning Unsupervised Cross-domain Image-to-Image Translation using a Shared Discriminator
    Kumar, Rajiv
    Dabral, Rishabh
    Sivakumar, G.
    [J]. VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 256 - 264
  • [30] SUNIT: multimodal unsupervised image-to-image translation with shared encoder
    Lin, Liyuan
    Ji, Shulin
    Zhou, Yuan
    Zhang, Shun
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)