Multimodal and Multiclass Semi-supervised Image-to-Image Translation

被引：0

作者：

Bai, Jing ^{[1
,2
]}

Chen, Ran ^{[1
,2
]}

Ji, Hui ^{[1
,2
]}

Li, Saisai ^{[1
,2
]}

机构：

[1] North Minzu Univ, Yinchuan 750021, Ningxia, Peoples R China

[2] Ningxia Provice Key Lab Intelligent Informat & Da, Yinchuan 750021, Ningxia, Peoples R China

来源：

IMAGE AND GRAPHICS, ICIG 2019, PT III | 2019年 / 11903卷

基金：

中国国家自然科学基金;

关键词：

Image-to-image translation; Semi-supervised; Adversarial auto encoder; Adversarial learning;

D O I：

10.1007/978-3-030-34113-8_42

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, we propose a multimodal and multiclass semi-supervised image-to-image translation (MM-SSIT) framework to address the dilemma between expensive labeled work and diversity requirement of image translation. A cross-domain adversarial autoencoder is proposed to learn disentangled latent domain-invariant content codes and domain-specific style codes. The style codes are matched with a prior distribution so that we can generate a series of meaningful samples from the prior space. The content codes are embedded into a multiclass joint data distribution by an adversarial learning between a domain classifier and a category classifier so that we can generate multiclass images at one time. Consequently, multimodal and multiclass cross-domain images are generated by joint decoding the latent content codes and sampled style codes. Finally, the networks for MM-SSIT framework are designed and tested. Semi-supervised experiments with comparisons to state-of-art approach show that the proposed framework has the ability to generate high-quality and diversiform images in case of fewer labeled samples. Further experiments in the unsupervised setting demonstrate that MM-SSIT is superior in learning disentangled representation and domain adaption.

引用

页码：503 / 514

页数：12

共 50 条

[1] Semi-supervised Task Aware Image-to-Image Translation
Muetze, Annika
Rottmann, Matthias
Gottschalk, Hanno
COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VISIGRAPP 2023, 2024, 2103 : 98 - 122
[2] Semi-Supervised Image-to-Image Translation for Lane Detection in Rain
Wang, Jian-Gang
Wan, Kong-Wah
Pang, Chun-Ho
Yau, Wei-Yun
2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 118 - 123
[3] Semi-supervised Learning for Few-shot Image-to-Image Translation
Wang, Yaxing
Khan, Salman
Gonzalez-Garcia, Abel
van de Weijer, Joost
Khan, Fahad Shahbaz
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4452 - 4461
[4] A Semi-Supervised Image-to-Image Translation Framework for SAR-Optical Image Matching
Du, Wen-Liang
Zhou, Yong
Zhu, Hancheng
Zhao, Jiaqi
Shao, Zhiwen
Tian, Xiaolin
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[5] Image-to-Image Translation on Defined Highlighting Regions by Semi-Supervised Semantic Segmentation
Chang, Ching-Yu
Ye, Chun-Ting
Wei, Tzer-Jen
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[6] Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation
Jiang, Yuxin
Jiang, Liming
Yang, Shuai
Loy, Chen Change
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7323 - 7333
[7] SEMI-SUPERVISED MULTIMODAL IMAGE TRANSLATION FOR MISSING MODALITY IMPUTATION
Sun, Wangbin
Ma, Fei
Li, Yang
Huang, Shao-Lun
Ni, Shiguang
Zhang, Lin
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4320 - 4324
[8] SemiStarGAN: Semi-supervised Generative Adversarial Networks for Multi-domain Image-to-Image Translation
Hsu, Shu-Yu
Yang, Chih-Yuan
Huang, Chi-Chia
Hsu, Jane Yung-jen
COMPUTER VISION - ACCV 2018, PT IV, 2019, 11364 : 338 - 353
[9] Toward Multimodal Image-to-Image Translation
Zhu, Jun-Yan
Zhang, Richard
Pathak, Deepak
Darrell, Trevor
Efros, Alexei A.
Wang, Oliver
Shechtman, Eli
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[10] Multimodal Unsupervised Image-to-Image Translation
Huang, Xun
Liu, Ming-Yu
Belongie, Serge
Kautz, Jan
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 179 - 196

← 1 2 3 4 5 →