How to Pretrain Deep Boltzmann Machines in Two Stages

被引:0
|
作者
Cho, Kyunghyun [1 ]
Raiko, Tapani [1 ]
Ilin, Alexander [1 ]
Karhunen, Juha [1 ]
机构
[1] Aalto Univ, Sch Sci, Dept Informat & Comp Sci, Espoo, Finland
来源
关键词
ALGORITHM; GRADIENT;
D O I
10.1007/978-3-319-09903-3_10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A deep Boltzmann machine (DBM) is a recently introduced Markov random field model that has multiple layers of hidden units. It has been shown empirically that it is difficult to train a DBM with approximate maximum-likelihood learning using the stochastic gradient unlike its simpler special case, restricted Boltzmann machine (RBM). In this paper, we propose a novel pretraining algorithm that consists of two stages; obtaining approximate posterior distributions over hidden units from a simpler model and maximizing the variational lower-bound given the fixed hidden posterior distributions. We show empirically that the proposed method overcomes the difficulty in training DBMs from randomly initialized parameters and results in a better, or comparable, generative model when compared to the conventional pretraining algorithm.
引用
收藏
页码:201 / 219
页数:19
相关论文
共 50 条
  • [41] Analysis on Noisy Boltzmann Machines and Noisy Restricted Boltzmann Machines
    Lu, Wenhao
    Leung, Chi-Sing
    Sum, John
    IEEE ACCESS, 2021, 9 : 112955 - 112965
  • [42] Text-mining the NeuroSynth corpus using Deep Boltzmann Machines
    Monti, Ricardo
    Lorenz, Romy
    Leech, Robert
    Anagnostopoulos, Christoforos
    Montana, Giovanni
    2016 6TH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION IN NEUROIMAGING (PRNI), 2016, : 13 - 16
  • [43] Application of Exchanging Monte Carlo Method to Sample Deep Boltzmann Machines
    Shibata, Hiroki
    Chen, Lieu-Hen
    Takama, Yasufumi
    2020 25TH INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2020), 2020, : 62 - 67
  • [44] Exploiting Restricted Boltzmann Machines and Deep Belief Networks in Compressed Sensing
    Polania, Luisa F.
    Barner, Kenneth E.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (17) : 4538 - 4550
  • [45] PV output forecasting by deep Boltzmann machines with SS-PPBSO
    Ogawa, Shota
    Mori, Hiroyuki
    ELECTRICAL ENGINEERING IN JAPAN, 2020, 213 (1-4) : 3 - 12
  • [46] Relationship between Pretraining and Maximum Likelihood Estimation in Deep Boltzmann Machines
    Yasuda, Muneki
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 582 - 590
  • [47] Robust spike-and-slab deep Boltzmann machines for face denoising
    Zhang, Nan
    Ding, Shifei
    Zhang, Jian
    Zhao, Xingyu
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (07): : 2815 - 2827
  • [48] Robust spike-and-slab deep Boltzmann machines for face denoising
    Nan Zhang
    Shifei Ding
    Jian Zhang
    Xingyu Zhao
    Neural Computing and Applications, 2020, 32 : 2815 - 2827
  • [49] Restricted Boltzmann Machines for Pre-training Deep Gaussian Networks
    Eastwood, Mark
    Jayne, Chrisina
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [50] Longitudinal Face Modeling via Temporal Deep Restricted Boltzmann Machines
    Duong, Chi Nhan
    Luu, Khoa
    Quach, Kha Gia
    Bui, Tien D.
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5772 - 5780