Disentangling latent space better for few-shot image-to-image translation

被引:2
|
作者
Liu, Peng [1 ]
Wang, Yueyue [1 ]
Du, Angang [2 ]
Zhang, Liqiang [2 ]
Wei, Bin [4 ,5 ]
Gu, Zhaorui [2 ]
Wang, Xiaodong [3 ]
Zheng, Haiyong [2 ]
Li, Juan [6 ]
机构
[1] Ocean Univ China, Comp Ctr, Qingdao 266100, Peoples R China
[2] Ocean Univ China, Dept Elect Engn, Qingdao 266100, Peoples R China
[3] Ocean Univ China, Dept Comp Sci & Technol, Qingdao 266100, Peoples R China
[4] Qingdao Univ, Affiliated Hosp, Qingdao 266000, Peoples R China
[5] Shandong Key Lab Digital Med & Comp Assisted Surg, Qingdao 266000, Peoples R China
[6] Qingdao Agr Univ, Coll Mech & Elect Engn, Qingdao 266000, Peoples R China
基金
中国国家自然科学基金;
关键词
Image-to-image translation; Generative adversarial network; Latent space; Few-shot learning; Disentanglement;
D O I
10.1007/s13042-022-01552-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In an unpaired image-to-image translation, the main concept is to learn an underlying mapping between the source and target domains. Previous approaches required large numbers of data from both domains to learn this mapping. However, under a few-shot condition, that is, few-shot image-to-image translation, only one domain can meet the required number of data , and thus, the underlying mapping becomes ill-conditioned owing to the limited data as well as the imbalanced distribution of the two domains. We argue that a powerful model with a better disentangled representation of the latent space can better tackle the more challenging few-shot image-to-image translation . Motivated by this, under a partially-shared assumption, we propose a better disentanglement of the content and style latent space using a domain-specific style latent classifier and a domain-shared cross-content latent discriminator. Moreover, we design asymmetric weak/strong domain discriminators to achieve a better translation performance with limited data within the few-shot domain. Furthermore, our method can be easily embedded into any latent space disentangled model of an image-to-image translation for a few-shot setting. Subjective evaluation and objective evaluation results both show that compared with other state-of-the-art methods, the images synthesized by our method have higher fidelity while maintaining certain diversity.
引用
收藏
页码:419 / 427
页数:9
相关论文
共 50 条
  • [41] Few-Shot Image Recognition with Knowledge Transfer
    Peng, Zhimao
    Li, Zechao
    Zhang, Junge
    Li, Yan
    Qi, Guo-Jun
    Tang, Jinhui
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 441 - 449
  • [42] Few-Shot Learning for Medical Image Classification
    Cai, Aihua
    Hu, Wenxin
    Zheng, Jun
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 441 - 452
  • [43] Few-shot Fish Image Generation and Classification
    Guo, Zonghui
    Zhang, Liqiang
    Jiang, Yufeng
    Niu, Wenjie
    Gu, Zhaorui
    Zheng, Haiyong
    Wang, Guoyu
    Zheng, Bing
    GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [44] Dataset Bias in Few-Shot Image Recognition
    Jiang, Shuqiang
    Zhu, Yaohui
    Liu, Chenlong
    Song, Xinhang
    Li, Xiangyang
    Min, Weiqing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 229 - 246
  • [45] Few-Shot Hash Learning for Image Retrieval
    Wang, Yu-Xiong
    Gui, Liangke
    Hebert, Martial
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 1228 - 1237
  • [46] FACE AGING AS IMAGE-TO-IMAGE TRANSLATION USING SHARED-LATENT SPACE GENERATIVE ADVERSARIAL NETWORKS
    Pantraki, Evangelia
    Kotropoulos, Constantine
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 306 - 310
  • [47] Where is My Spot? Few-shot Image Generation via Latent Subspace Optimization
    Zheng, Chenxi
    Liu, Bangzhen
    Zhang, Huaidong
    Xu, Xuemiao
    He, Shengfeng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3272 - 3281
  • [48] UNPAIRED IMAGE-TO-IMAGE TRANSLATION FROM SHARED DEEP SPACE
    Wu, Xuehui
    Shao, Jie
    Gao, Lianli
    Shen, Heng Tao
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2127 - 2131
  • [49] Mixture-based Feature Space Learning for Few-shot Image Classification
    Afrasiyabi, Arman
    Lalonde, Jean-Francois
    Gagne, Christian
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9021 - 9031
  • [50] The Euclidean Space is Evil: Hyperbolic Attribute Editing for Few-shot Image Generation
    Li, Lingxiao
    Zhang, Yi
    Wang, Shuhui
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22657 - 22667