Infinite Swapping Algorithm for Training Restricted Boltzmann Machines

被引:2
|
作者
Hult, Henrik [1 ]
Nyquist, Pierre [1 ]
Ringqvist, Carl [1 ]
机构
[1] KTH Royal Inst Technol, Lindstedtsvagen 25, S-10044 Stockholm, Sweden
关键词
Infinite swapping; Restricted Boltzmann machines; Statistical learning; Latent variable models; Gibbs sampling; MONTE-CARLO; PRODUCTS;
D O I
10.1007/978-3-030-43465-6_14
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Given the important role latent variable models play, for example in statistical learning, there is currently a growing need for efficient Monte Carlo methods for conducting inference on the latent variables given data. Recently, Desjardins et al. (JMLR Workshop and Conference Proceedings: AISTATS 2010, pp. 145-152, 2010 [3]) explored the use of the parallel tempering algorithm for training restricted Boltzmann machines, showing considerable improvement over the previous state-of-the-art. In this paper we continue their efforts by comparing previous methods, including parallel tempering, with the infinite swapping algorithm, an MCMC method first conceived when attempting to optimise performance of parallel tempering (Dupuis et al. in J. Chem. Phys. 137, 2012 [7]), for the training task. We implement a Gibbs-sampling version of infinite swapping and evaluate its performance on a number of test cases, concluding that the algorithm enjoys better mixing properties than both persistent contrastive divergence and parallel tempering for complex energy landscapes associated with restricted Boltzmann machines.
引用
收藏
页码:285 / 307
页数:23
相关论文
共 50 条
  • [1] On better training the infinite restricted Boltzmann machines
    Xuan Peng
    Xunzhang Gao
    Xiang Li
    [J]. Machine Learning, 2018, 107 : 943 - 968
  • [2] On better training the infinite restricted Boltzmann machines
    Peng, Xuan
    Gao, Xunzhang
    Li, Xiang
    [J]. MACHINE LEARNING, 2018, 107 (06) : 943 - 968
  • [3] Training Restricted Boltzmann Machines
    Fischer, Asja
    [J]. KUNSTLICHE INTELLIGENZ, 2015, 29 (04): : 441 - 444
  • [4] Training restricted Boltzmann machines: An introduction
    Fischer, Asja
    Igel, Christian
    [J]. PATTERN RECOGNITION, 2014, 47 (01) : 25 - 39
  • [5] Wasserstein Training of Restricted Boltzmann Machines
    Montavon, Gregoire
    Mueller, Klaus-Robert
    Cuturi, Marco
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [6] The Streaming Approach to Training Restricted Boltzmann Machines
    Duda, Piotr
    Rutkowski, Leszek
    Woldan, Piotr
    Najgebauer, Patryk
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2021), PT I, 2021, 12854 : 308 - 317
  • [7] Enhanced Gradient for Training Restricted Boltzmann Machines
    Cho, KyungHyun
    Raiko, Tapani
    Ilin, Alexander
    [J]. NEURAL COMPUTATION, 2013, 25 (03) : 805 - 831
  • [8] Approximate Learning Algorithm for Restricted Boltzmann Machines
    Yasuda, Muneki
    Tanaka, Kazuyuki
    [J]. 2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING CONTROL & AUTOMATION, VOLS 1 AND 2, 2008, : 692 - 697
  • [9] Generative and discriminative infinite restricted Boltzmann machine training
    Wang, Qianglong
    Gao, Xiaoguang
    Wan, Kaifang
    Hu, Zijian
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (10) : 7857 - 7887
  • [10] TRAINING RESTRICTED BOLTZMANN MACHINES WITH AUXILIARY FUNCTION APPROACH
    Kameoka, Hirokazu
    Takamune, Norihiro
    [J]. 2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,