Multimodal and Multiclass Semi-supervised Image-to-Image Translation

被引:0
|
作者
Bai, Jing [1 ,2 ]
Chen, Ran [1 ,2 ]
Ji, Hui [1 ,2 ]
Li, Saisai [1 ,2 ]
机构
[1] North Minzu Univ, Yinchuan 750021, Ningxia, Peoples R China
[2] Ningxia Provice Key Lab Intelligent Informat & Da, Yinchuan 750021, Ningxia, Peoples R China
来源
IMAGE AND GRAPHICS, ICIG 2019, PT III | 2019年 / 11903卷
基金
中国国家自然科学基金;
关键词
Image-to-image translation; Semi-supervised; Adversarial auto encoder; Adversarial learning;
D O I
10.1007/978-3-030-34113-8_42
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we propose a multimodal and multiclass semi-supervised image-to-image translation (MM-SSIT) framework to address the dilemma between expensive labeled work and diversity requirement of image translation. A cross-domain adversarial autoencoder is proposed to learn disentangled latent domain-invariant content codes and domain-specific style codes. The style codes are matched with a prior distribution so that we can generate a series of meaningful samples from the prior space. The content codes are embedded into a multiclass joint data distribution by an adversarial learning between a domain classifier and a category classifier so that we can generate multiclass images at one time. Consequently, multimodal and multiclass cross-domain images are generated by joint decoding the latent content codes and sampled style codes. Finally, the networks for MM-SSIT framework are designed and tested. Semi-supervised experiments with comparisons to state-of-art approach show that the proposed framework has the ability to generate high-quality and diversiform images in case of fewer labeled samples. Further experiments in the unsupervised setting demonstrate that MM-SSIT is superior in learning disentangled representation and domain adaption.
引用
收藏
页码:503 / 514
页数:12
相关论文
共 50 条
  • [1] Semi-supervised Task Aware Image-to-Image Translation
    Muetze, Annika
    Rottmann, Matthias
    Gottschalk, Hanno
    COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VISIGRAPP 2023, 2024, 2103 : 98 - 122
  • [2] Semi-Supervised Image-to-Image Translation for Lane Detection in Rain
    Wang, Jian-Gang
    Wan, Kong-Wah
    Pang, Chun-Ho
    Yau, Wei-Yun
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 118 - 123
  • [3] Semi-supervised Learning for Few-shot Image-to-Image Translation
    Wang, Yaxing
    Khan, Salman
    Gonzalez-Garcia, Abel
    van de Weijer, Joost
    Khan, Fahad Shahbaz
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4452 - 4461
  • [4] A Semi-Supervised Image-to-Image Translation Framework for SAR-Optical Image Matching
    Du, Wen-Liang
    Zhou, Yong
    Zhu, Hancheng
    Zhao, Jiaqi
    Shao, Zhiwen
    Tian, Xiaolin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [5] Image-to-Image Translation on Defined Highlighting Regions by Semi-Supervised Semantic Segmentation
    Chang, Ching-Yu
    Ye, Chun-Ting
    Wei, Tzer-Jen
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [6] Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation
    Jiang, Yuxin
    Jiang, Liming
    Yang, Shuai
    Loy, Chen Change
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7323 - 7333
  • [7] SEMI-SUPERVISED MULTIMODAL IMAGE TRANSLATION FOR MISSING MODALITY IMPUTATION
    Sun, Wangbin
    Ma, Fei
    Li, Yang
    Huang, Shao-Lun
    Ni, Shiguang
    Zhang, Lin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4320 - 4324
  • [8] SemiStarGAN: Semi-supervised Generative Adversarial Networks for Multi-domain Image-to-Image Translation
    Hsu, Shu-Yu
    Yang, Chih-Yuan
    Huang, Chi-Chia
    Hsu, Jane Yung-jen
    COMPUTER VISION - ACCV 2018, PT IV, 2019, 11364 : 338 - 353
  • [9] Toward Multimodal Image-to-Image Translation
    Zhu, Jun-Yan
    Zhang, Richard
    Pathak, Deepak
    Darrell, Trevor
    Efros, Alexei A.
    Wang, Oliver
    Shechtman, Eli
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [10] Multimodal Unsupervised Image-to-Image Translation
    Huang, Xun
    Liu, Ming-Yu
    Belongie, Serge
    Kautz, Jan
    COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 179 - 196