Infinite Swapping Algorithm for Training Restricted Boltzmann Machines

被引：2

作者：

Hult, Henrik ^{[1
]}

Nyquist, Pierre ^{[1
]}

Ringqvist, Carl ^{[1
]}

机构：

[1] KTH Royal Inst Technol, Lindstedtsvagen 25, S-10044 Stockholm, Sweden

来源：

MONTE CARLO AND QUASI-MONTE CARLO METHODS, MCQMC 2018 | 2020年 / 324卷

关键词：

Infinite swapping; Restricted Boltzmann machines; Statistical learning; Latent variable models; Gibbs sampling; MONTE-CARLO; PRODUCTS;

D O I：

10.1007/978-3-030-43465-6_14

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

Given the important role latent variable models play, for example in statistical learning, there is currently a growing need for efficient Monte Carlo methods for conducting inference on the latent variables given data. Recently, Desjardins et al. (JMLR Workshop and Conference Proceedings: AISTATS 2010, pp. 145-152, 2010 [3]) explored the use of the parallel tempering algorithm for training restricted Boltzmann machines, showing considerable improvement over the previous state-of-the-art. In this paper we continue their efforts by comparing previous methods, including parallel tempering, with the infinite swapping algorithm, an MCMC method first conceived when attempting to optimise performance of parallel tempering (Dupuis et al. in J. Chem. Phys. 137, 2012 [7]), for the training task. We implement a Gibbs-sampling version of infinite swapping and evaluate its performance on a number of test cases, concluding that the algorithm enjoys better mixing properties than both persistent contrastive divergence and parallel tempering for complex energy landscapes associated with restricted Boltzmann machines.

引用

页码：285 / 307

页数：23

共 50 条

[1] On better training the infinite restricted Boltzmann machines
Xuan Peng
Xunzhang Gao
Xiang Li
[J]. Machine Learning, 2018, 107 : 943 - 968
[2] On better training the infinite restricted Boltzmann machines
Peng, Xuan
Gao, Xunzhang
Li, Xiang
[J]. MACHINE LEARNING, 2018, 107 (06) : 943 - 968
[3] Training Restricted Boltzmann Machines
Fischer, Asja
[J]. KUNSTLICHE INTELLIGENZ, 2015, 29 (04): : 441 - 444
[4] Training restricted Boltzmann machines: An introduction
Fischer, Asja
Igel, Christian
[J]. PATTERN RECOGNITION, 2014, 47 (01) : 25 - 39
[5] Wasserstein Training of Restricted Boltzmann Machines
Montavon, Gregoire
Mueller, Klaus-Robert
Cuturi, Marco
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[6] The Streaming Approach to Training Restricted Boltzmann Machines
Duda, Piotr
Rutkowski, Leszek
Woldan, Piotr
Najgebauer, Patryk
[J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2021), PT I, 2021, 12854 : 308 - 317
[7] Enhanced Gradient for Training Restricted Boltzmann Machines
Cho, KyungHyun
Raiko, Tapani
Ilin, Alexander
[J]. NEURAL COMPUTATION, 2013, 25 (03) : 805 - 831
[8] Approximate Learning Algorithm for Restricted Boltzmann Machines
Yasuda, Muneki
Tanaka, Kazuyuki
[J]. 2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING CONTROL & AUTOMATION, VOLS 1 AND 2, 2008, : 692 - 697
[9] Generative and discriminative infinite restricted Boltzmann machine training
Wang, Qianglong
Gao, Xiaoguang
Wan, Kaifang
Hu, Zijian
[J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (10) : 7857 - 7887
[10] TRAINING RESTRICTED BOLTZMANN MACHINES WITH AUXILIARY FUNCTION APPROACH
Kameoka, Hirokazu
Takamune, Norihiro
[J]. 2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,

← 1 2 3 4 5 →