How to Pretrain Deep Boltzmann Machines in Two Stages

被引：0

作者：

Cho, Kyunghyun ^{[1
]}

Raiko, Tapani ^{[1
]}

Ilin, Alexander ^{[1
]}

Karhunen, Juha ^{[1
]}

机构：

[1] Aalto Univ, Sch Sci, Dept Informat & Comp Sci, Espoo, Finland

来源：

ARTIFICIAL NEURAL NETWORKS | 2015年

关键词：

ALGORITHM; GRADIENT;

D O I：

10.1007/978-3-319-09903-3_10

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A deep Boltzmann machine (DBM) is a recently introduced Markov random field model that has multiple layers of hidden units. It has been shown empirically that it is difficult to train a DBM with approximate maximum-likelihood learning using the stochastic gradient unlike its simpler special case, restricted Boltzmann machine (RBM). In this paper, we propose a novel pretraining algorithm that consists of two stages; obtaining approximate posterior distributions over hidden units from a simpler model and maximizing the variational lower-bound given the fixed hidden posterior distributions. We show empirically that the proposed method overcomes the difficulty in training DBMs from randomly initialized parameters and results in a better, or comparable, generative model when compared to the conventional pretraining algorithm.

引用

页码：201 / 219

页数：19

共 50 条

[21] Layerwise Systematic Scan: Deep Boltzmann Machines and Beyond
Guo, Heng
Kara, Kaan
Zhang, Ce
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
[22] Annealing and Replica-Symmetry in Deep Boltzmann Machines
Diego Alberici
Adriano Barra
Pierluigi Contucci
Emanuele Mingione
Journal of Statistical Physics, 2020, 180 : 665 - 677
[23] Training deep Boltzmann networks with sparse Ising machines
Niazi, Shaila
Chowdhury, Shuvro
Aadit, Navid Anjum
Mohseni, Masoud
Qin, Yao
Camsari, Kerem Y.
NATURE ELECTRONICS, 2024, 7 (07): : 610 - 619
[24] Partitioned learning of deep Boltzmann machines for SNP data
Hess, Moritz
Lenz, Stefan
Blaette, Tamara J.
Bullinger, Lars
Binder, Harald
BIOINFORMATICS, 2017, 33 (20) : 3173 - 3180
[25] Deep Boltzmann Machines: Rigorous Results at Arbitrary Depth
Alberici, Diego
Contucci, Pierluigi
Mingione, Emanuele
ANNALES HENRI POINCARE, 2021, 22 (08): : 2619 - 2642
[26] Three learning stages and accuracy–efficiency tradeoff of restricted Boltzmann machines
Lennart Dabelow
Masahito Ueda
Nature Communications, 13
[27] Fault Diagnosis Method Based on Improved Deep Boltzmann Machines
Liu, Dan
Wang, Qin
Tao, Jiaojiao
Li, Guang
Wu, Jie
PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 458 - 462
[28] Boltzmann machines as two-dimensional tensor networks
Li, Sujie
Pan, Feng
Zhou, Pengfei
Zhang, Pan
PHYSICAL REVIEW B, 2021, 104 (07)
[29] An Adaptive Deep Belief Network With Sparse Restricted Boltzmann Machines
Wang, Gongming
Qiao, Junfei
Bi, Jing
Jia, Qing-Shan
Zhou, MengChu
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (10) : 4217 - 4228
[30] Protein Function Prediction Using Deep Restricted Boltzmann Machines
Zou, Xianchun
Wang, Guijun
Yu, Guoxian
BIOMED RESEARCH INTERNATIONAL, 2017, 2017

← 1 2 3 4 5 →