Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation

被引:7
|
作者
Gomez, Raul [1 ]
Liu, Yahui [2 ,3 ]
De Nadai, Marco [3 ]
Karatzas, Dimosthenis [4 ]
Lepri, Bruno [3 ]
Sebe, Nicu [2 ,5 ]
机构
[1] Comp Vis Ctr, Ctr Tecnol Catalunya, Barcelona, Spain
[2] Univ Trento, Trento, Italy
[3] Fdn Bruno Kessler, Povo, Italy
[4] Univ Autonoma Barcelona, Comp Vis Ctr, Barcelona, Spain
[5] Huawei Res, Dublin, Ireland
关键词
GANs; image-to-image translation; retrieval system; unsupervised learning; SIMILARITY;
D O I
10.1145/3394171.3413785
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image to image translation aims to learn a mapping that transforms an image from one visual domain to another. Recent works assume that images descriptors can be disentangled into a domain-invariant content representation and a domain-specific style representation. Thus, translation models seek to preserve the content of source images while changing the style to a target visual domain. However, synthesizing new images is extremely challenging especially in multi-domain translations, as the network has to compose content and style to generate reliable and diverse images in multiple domains. In this paper we propose the use of an image retrieval system to assist the image-to-image translation task. First, we train an image-to-image translation model to map images to multiple domains. Then, we train an image retrieval model using real and generated images to find images similar to a query one in content but in a different domain. Finally, we exploit the image retrieval system to fine-tune the image-to-image translation model and generate higher quality images. Our experiments show the effectiveness of the proposed solution and highlight the contribution of the retrieval network, which can benefit from additional unlabeled data and help image-to-image translation models in the presence of scarce data.
引用
收藏
页码:3164 / 3172
页数:9
相关论文
共 50 条
  • [1] MULTI-DOMAIN UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION WITH APPEARANCE ADAPTIVE CONVOLUTION
    Jeong, Somi
    Lee, Jiyoung
    Sohn, Kwanghoon
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1750 - 1754
  • [2] Unsupervised multi-domain multimodal image-to-image translation with explicit domain-constrained disentanglement
    Xia, Weihao
    Yang, Yujiu
    Xue, Jing-Hao
    [J]. NEURAL NETWORKS, 2020, 131 : 50 - 63
  • [3] Crossing-Domain Generative Adversarial Networks for Unsupervised Multi-Domain Image-to-Image Translation
    Yang, Xuewen
    Xie, Dongliang
    Wang, Xin
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 374 - 382
  • [4] DMDIT: Diverse multi-domain image-to-image translation
    Shao, Mingwen
    Zhang, Youcai
    Liu, Huan
    Wang, Chao
    Li, Le
    Shao, Xun
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [5] AMMUNIT: An Attention-Based Multimodal Multi-domain UNsupervised Image-to-Image Translation Framework
    Luo, Lei
    Hsu, William H.
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 358 - 370
  • [6] Multi-Domain Image-to-Image Translation with Adaptive Inference Graph
    The-Phuc Nguyen
    Lathuiliere, Stephane
    Ricci, Elisa
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5368 - 5375
  • [7] Multi-Domain Image-to-Image Translation via a Unified Circular Framework
    Wang, Yuxi
    Zhang, Zhaoxiang
    Hao, Wangli
    Song, Chunfeng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 670 - 684
  • [8] Cross-Granularity Learning for Multi-Domain Image-to-Image Translation
    Fu, Huiyuan
    Yu, Ting
    Wang, Xin
    Ma, Huadong
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3099 - 3107
  • [9] Self-attention StarGAN for Multi-domain Image-to-Image Translation
    He, Ziliang
    Yang, Zhenguo
    Mao, Xudong
    Lv, Jianming
    Li, Qing
    Liu, Wenyin
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: IMAGE PROCESSING, PT III, 2019, 11729 : 537 - 549
  • [10] RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes
    Wu, Po-Wei
    Lin, Yu-Jing
    Chang, Che-Han
    Chang, Edward Y.
    Liao, Shih-Wei
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5913 - 5921