How to Pretrain Deep Boltzmann Machines in Two Stages

被引:0
|
作者
Cho, Kyunghyun [1 ]
Raiko, Tapani [1 ]
Ilin, Alexander [1 ]
Karhunen, Juha [1 ]
机构
[1] Aalto Univ, Sch Sci, Dept Informat & Comp Sci, Espoo, Finland
来源
关键词
ALGORITHM; GRADIENT;
D O I
10.1007/978-3-319-09903-3_10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A deep Boltzmann machine (DBM) is a recently introduced Markov random field model that has multiple layers of hidden units. It has been shown empirically that it is difficult to train a DBM with approximate maximum-likelihood learning using the stochastic gradient unlike its simpler special case, restricted Boltzmann machine (RBM). In this paper, we propose a novel pretraining algorithm that consists of two stages; obtaining approximate posterior distributions over hidden units from a simpler model and maximizing the variational lower-bound given the fixed hidden posterior distributions. We show empirically that the proposed method overcomes the difficulty in training DBMs from randomly initialized parameters and results in a better, or comparable, generative model when compared to the conventional pretraining algorithm.
引用
收藏
页码:201 / 219
页数:19
相关论文
共 50 条
  • [21] Layerwise Systematic Scan: Deep Boltzmann Machines and Beyond
    Guo, Heng
    Kara, Kaan
    Zhang, Ce
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [22] Annealing and Replica-Symmetry in Deep Boltzmann Machines
    Diego Alberici
    Adriano Barra
    Pierluigi Contucci
    Emanuele Mingione
    Journal of Statistical Physics, 2020, 180 : 665 - 677
  • [23] Training deep Boltzmann networks with sparse Ising machines
    Niazi, Shaila
    Chowdhury, Shuvro
    Aadit, Navid Anjum
    Mohseni, Masoud
    Qin, Yao
    Camsari, Kerem Y.
    NATURE ELECTRONICS, 2024, 7 (07): : 610 - 619
  • [24] Partitioned learning of deep Boltzmann machines for SNP data
    Hess, Moritz
    Lenz, Stefan
    Blaette, Tamara J.
    Bullinger, Lars
    Binder, Harald
    BIOINFORMATICS, 2017, 33 (20) : 3173 - 3180
  • [25] Deep Boltzmann Machines: Rigorous Results at Arbitrary Depth
    Alberici, Diego
    Contucci, Pierluigi
    Mingione, Emanuele
    ANNALES HENRI POINCARE, 2021, 22 (08): : 2619 - 2642
  • [26] Three learning stages and accuracy–efficiency tradeoff of restricted Boltzmann machines
    Lennart Dabelow
    Masahito Ueda
    Nature Communications, 13
  • [27] Fault Diagnosis Method Based on Improved Deep Boltzmann Machines
    Liu, Dan
    Wang, Qin
    Tao, Jiaojiao
    Li, Guang
    Wu, Jie
    PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 458 - 462
  • [28] Boltzmann machines as two-dimensional tensor networks
    Li, Sujie
    Pan, Feng
    Zhou, Pengfei
    Zhang, Pan
    PHYSICAL REVIEW B, 2021, 104 (07)
  • [29] An Adaptive Deep Belief Network With Sparse Restricted Boltzmann Machines
    Wang, Gongming
    Qiao, Junfei
    Bi, Jing
    Jia, Qing-Shan
    Zhou, MengChu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (10) : 4217 - 4228
  • [30] Protein Function Prediction Using Deep Restricted Boltzmann Machines
    Zou, Xianchun
    Wang, Guijun
    Yu, Guoxian
    BIOMED RESEARCH INTERNATIONAL, 2017, 2017