Conditional Introspective Variational Autoencoder for Image Synthesis

被引:9
|
作者
Zheng, Kun [1 ]
Cheng, Yafan [2 ]
Kang, Xiaojun [2 ]
Yao, Hong [2 ]
Tian, Tian [2 ]
机构
[1] China Univ Geosci, Sch Geog & Informat Engn, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
基金
中国国家自然科学基金;
关键词
Image generation; artificial neural networks; image processing;
D O I
10.1109/ACCESS.2020.3018228
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a variational autoencoder (VAE) learning framework with introspective training for conditional image synthesis, and explore conditional capsule encoder by class-wise mask label insertion for this framework. Our model only consists of encoder (E), generator (G) and classifier (C), where E and G can be adversarially optimized, and C helps to boost conditional generation, improve authenticity and provide generation measures for E and G. Discriminator is not necessary in our framework and its absence makes our model more concise with fewer artifacts and pattern collapse problems. To compensate for the blurry weakness of VAE-like models, feature matching is introduced into loss functions by means of C to offer more reasonable measures between real and synthesized images. Moreover, in consideration of the key role of E in autoencoders as well as the interesting characteristics of capsule structure, conditional capsule encoder is preliminary explored in the image synthesis model. Class labels participate conditional encoding by masking high-level capsules of other categories, and capsule loss for the encoder is added to facilitate conditional synthesis. Experiments on MNIST and Fashion-MNIST data sets show that our model achieves real conditional synthesis performances with better diversity and fewer artifacts. And conditional capsule encoder also reveals interesting synthesis effects.
引用
收藏
页码:153905 / 153913
页数:9
相关论文
共 50 条
  • [1] Conditional Variational Autoencoder for Learned Image Reconstruction
    Zhang, Chen
    Barbano, Riccardo
    Jin, Bangti
    [J]. COMPUTATION, 2021, 9 (11)
  • [2] Multi-digit Image Synthesis Using Recurrent Conditional Variational Autoencoder
    Sun, Haoze
    Xu, Weidi
    Deng, Chao
    Tan, Ying
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 375 - 380
  • [3] IntroVAE: Introspective Variational Autoencoders for Photographic Image Synthesis
    Huang, Huaibo
    Li, Zhihang
    He, Ran
    Sun, Zhenan
    Tan, Tieniu
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [4] Diverse Image Captioning via Conditional Variational Transformer and Introspective Adversarial Learning
    Liu, Mingming
    Liu, Bing
    Liu, Hao
    Zhang, Haiyan
    [J]. Computer Engineering and Applications, 2024, 60 (21) : 164 - 171
  • [5] Diverse Image Captioning via Conditional Variational Inference and Introspective Adversarial Learning
    Liu, Bing
    Li, Sui
    Liu, Ming-Ming
    Liu, Hao
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (07): : 2219 - 2227
  • [6] Soft-IntroVAE: Analyzing and Improving the Introspective Variational Autoencoder
    Daniel, Tal
    Tamar, Aviv
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4389 - 4398
  • [7] Trajectory Prediction with a Conditional Variational Autoencoder
    Barbie, Thibault
    Nishio, Takaki
    Nishida, Takeshi
    [J]. JOURNAL OF ROBOTICS AND MECHATRONICS, 2019, 31 (03) : 493 - 499
  • [8] A Conditional Flow Variational Autoencoder for Controllable Synthesis of Virtual Populations of Anatomy
    Dou, Haoran
    Ravikumar, Nishant
    Frangi, Alejandro F.
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VII, 2023, 14226 : 143 - 152
  • [9] Diverse Image Captioning via Conditional Variational Autoencoder and Dual Contrastive Learning
    Xu, Jing
    Liu, Bing
    Zhou, Yong
    Liu, Mingming
    Yao, Rui
    Shao, Zhiwen
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (01)
  • [10] Conditional Variational AutoEncoder based on Stochastic Attacks
    Zaid G.
    Bossuet L.
    Carbone M.
    Habrard A.
    Venelli A.
    [J]. IACR Transactions on Cryptographic Hardware and Embedded Systems, 2023, 2023 (02): : 310 - 357