How to Pretrain Deep Boltzmann Machines in Two Stages

被引:0
|
作者
Cho, Kyunghyun [1 ]
Raiko, Tapani [1 ]
Ilin, Alexander [1 ]
Karhunen, Juha [1 ]
机构
[1] Aalto Univ, Sch Sci, Dept Informat & Comp Sci, Espoo, Finland
来源
关键词
ALGORITHM; GRADIENT;
D O I
10.1007/978-3-319-09903-3_10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A deep Boltzmann machine (DBM) is a recently introduced Markov random field model that has multiple layers of hidden units. It has been shown empirically that it is difficult to train a DBM with approximate maximum-likelihood learning using the stochastic gradient unlike its simpler special case, restricted Boltzmann machine (RBM). In this paper, we propose a novel pretraining algorithm that consists of two stages; obtaining approximate posterior distributions over hidden units from a simpler model and maximizing the variational lower-bound given the fixed hidden posterior distributions. We show empirically that the proposed method overcomes the difficulty in training DBMs from randomly initialized parameters and results in a better, or comparable, generative model when compared to the conventional pretraining algorithm.
引用
收藏
页码:201 / 219
页数:19
相关论文
共 50 条
  • [31] Representational power of restricted Boltzmann machines and deep belief networks
    Le Roux, Nicolas
    Bengio, Yoshua
    NEURAL COMPUTATION, 2008, 20 (06) : 1631 - 1649
  • [32] Rolling Bearing Fault Diagnosis based on Deep Boltzmann Machines
    Deng, Shengcai
    Cheng, Zhiwei
    Li, Chuan
    Yao, Xingyan
    Chen, Zhiqiang
    Sanchez, Rene-Vinicio
    2016 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-CHENGDU), 2016,
  • [33] Deep Boltzmann Machines for Robust Fingerprint Spoofing Attack Detection
    Souza, Gustavo B.
    Santos, Daniel F. S.
    Pires, Rafael G.
    Marana, Aparecido N.
    Papa, Joao P.
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1863 - 1870
  • [34] Detection of Hypertension Retinopathy Using Deep Learning and Boltzmann Machines
    Triwijoyo, B. K.
    Pradipto, Y. D.
    1ST INTERNATIONAL CONFERENCE ON COMPUTING AND APPLIED INFORMATICS 2016 : APPLIED INFORMATICS TOWARD SMART ENVIRONMENT, PEOPLE, AND SOCIETY, 2017, 801
  • [35] Restricted Boltzmann Machines and Deep Belief Networks on Sunway Cluster
    Song, Kaida
    Liu, Yi
    Wang, Rui
    Zhao, Meiting
    Hao, Ziyu
    Qian, Depei
    PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 245 - 252
  • [36] Mode-assisted joint training of deep Boltzmann machines
    Manukian, Haik
    Di Ventra, Massimiliano
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [37] Mode-assisted joint training of deep Boltzmann machines
    Haik Manukian
    Massimiliano Di Ventra
    Scientific Reports, 11
  • [38] Beyond Principal Components: Deep Boltzmann Machines for Face Modeling
    Chi Nhan Duong
    Luu, Khoa
    Quach, Kha Gia
    Bui, Tien D.
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 4786 - 4794
  • [39] Neuromorphic Adaptations of Restricted Boltzmann Machines and Deep Belief Networks
    Pedroni, Bruno U.
    Das, Srinjoy
    Neftci, Emre
    Kreutz-Delgado, Kenneth
    Cauwenberghs, Gert
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [40] Three learning stages and accuracy-efficiency tradeoff of restricted Boltzmann machines
    Dabelow, Lennart
    Ueda, Masahito
    NATURE COMMUNICATIONS, 2022, 13 (01)