Combining deep generative and discriminative models for Bayesian semi-supervised learning

被引:34
|
作者
Gordon, Jonathan [1 ]
Hernandez-Lobato, Jose Miguel [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge, England
关键词
Probabilistic models; Semi-supervised learning; Variational autoencoders; Predictive uncertainty;
D O I
10.1016/j.patcog.2019.107156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative models can be used for a wide range of tasks, and have the appealing ability to learn from both labelled and unlabelled data. In contrast, discriminative models cannot learn from unlabelled data, but tend to outperform their generative counterparts in supervised tasks. We develop a framework to jointly train deep generative and discriminative models, enjoying the benefits of both. The framework allows models to learn from labelled and unlabelled data, as well as naturally account for uncertainty in predictive distributions, providing the first Bayesian approach to semi-supervised learning with deep generative models. We demonstrate that our blended discriminative and generative models outperform purely generative models in both predictive performance and uncertainty calibration in a number of semi-supervised learning tasks. (C) 2019 The Authors. Published by Elsevier Ltd.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Semi-supervised Learning with Deep Generative Models
    Kingma, Diederik P.
    Rezende, Danilo J.
    Mohamed, Shakir
    Welling, Max
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [2] Learning Disentangled Representations with Semi-Supervised Deep Generative Models
    Siddharth, N.
    Paige, Brooks
    van de Meent, Jan-Willem
    Desmaison, Alban
    Goodman, Noah D.
    Kohli, Pushmeet
    Wood, Frank
    Torr, Philip H. S.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [3] Regularizing Discriminative Capability of CGANs for Semi-Supervised Generative Learning
    Liu, Yi
    Deng, Guangchang
    Zeng, Xiangping
    Wu, Si
    Yu, Zhiwen
    Wong, Hau-San
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5719 - 5728
  • [4] Semi-Supervised Learning from Crowds Using Deep Generative Models
    Atarashi, Kyohei
    Oyama, Satoshi
    Kurihara, Masahito
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 1555 - 1562
  • [5] Deep Bayesian Active Semi-Supervised Learning
    Rottmann, Matthias
    Kahl, Karsten
    Gottschalk, Hanno
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 158 - 164
  • [6] Multimodal deep generative adversarial models for scalable doubly semi-supervised learning
    Du, Changde
    Du, Changying
    He, Huiguang
    [J]. INFORMATION FUSION, 2021, 68 : 118 - 130
  • [7] Discriminative Regularization with Conditional Generative Adversarial Nets for Semi-Supervised Learning
    Xie, Qiangian
    Peng, Min
    Huang, Jimin
    Wang, Bin
    Wang, Hua
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [8] Combining Deep Generative Models and Multi-lingual Pretraining for Semi-supervised Document Classification
    Zhu, Yi
    Shareghi, Ehsan
    Li, Yingzhen
    Reichart, Roi
    Korhonen, Anna
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 894 - 908
  • [9] Semi-Supervised Analysis of the Electrocardiogram Using Deep Generative Models
    Rasmussen, Soren M.
    Jensen, Malte E. K.
    Meyhoff, Christian S.
    Aasvang, Eske K.
    Sorensen, Helge B. D.
    [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 1124 - 1127
  • [10] Combining multi-distributed mixture models and Bayesian networks for semi-supervised learning
    Stritt, Manuel
    Schmidt-Thieme, Lars
    Poeppel, Gerhard
    [J]. ICMLA 2007: SIXTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2007, : 354 - +