Infinite Swapping Algorithm for Training Restricted Boltzmann Machines

被引：2

作者：

Hult, Henrik ^{[1
]}

Nyquist, Pierre ^{[1
]}

Ringqvist, Carl ^{[1
]}

机构：

[1] KTH Royal Inst Technol, Lindstedtsvagen 25, S-10044 Stockholm, Sweden

来源：

MONTE CARLO AND QUASI-MONTE CARLO METHODS, MCQMC 2018 | 2020年 / 324卷

关键词：

Infinite swapping; Restricted Boltzmann machines; Statistical learning; Latent variable models; Gibbs sampling; MONTE-CARLO; PRODUCTS;

D O I：

10.1007/978-3-030-43465-6_14

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

Given the important role latent variable models play, for example in statistical learning, there is currently a growing need for efficient Monte Carlo methods for conducting inference on the latent variables given data. Recently, Desjardins et al. (JMLR Workshop and Conference Proceedings: AISTATS 2010, pp. 145-152, 2010 [3]) explored the use of the parallel tempering algorithm for training restricted Boltzmann machines, showing considerable improvement over the previous state-of-the-art. In this paper we continue their efforts by comparing previous methods, including parallel tempering, with the infinite swapping algorithm, an MCMC method first conceived when attempting to optimise performance of parallel tempering (Dupuis et al. in J. Chem. Phys. 137, 2012 [7]), for the training task. We implement a Gibbs-sampling version of infinite swapping and evaluate its performance on a number of test cases, concluding that the algorithm enjoys better mixing properties than both persistent contrastive divergence and parallel tempering for complex energy landscapes associated with restricted Boltzmann machines.

引用

页码：285 / 307

页数：23

共 50 条

[21] Discrete restricted Boltzmann machines
Montúfar, Guido
Morton, Jason
[J]. Journal of Machine Learning Research, 2015, 16 : 653 - 672
[22] An Overview of Restricted Boltzmann Machines
Vidyadhar Upadhya
P. S. Sastry
[J]. Journal of the Indian Institute of Science, 2019, 99 : 225 - 236
[23] Continuous restricted Boltzmann machines
Robert W. Harrison
[J]. Wireless Networks, 2022, 28 : 1263 - 1267
[24] Fuzzy Restricted Boltzmann Machines
Harrison, Robert W.
Freas, Christopher
[J]. FUZZY LOGIC IN INTELLIGENT SYSTEM DESIGN: THEORY AND APPLICATIONS, 2018, 648 : 392 - 398
[25] Adaptive hyperparameter updating for training restricted Boltzmann machines on quantum annealers
Xu, Guanglei
Oates, William S.
[J]. SCIENTIFIC REPORTS, 2021, 11 (01)
[26] Adaptive hyperparameter updating for training restricted Boltzmann machines on quantum annealers
Guanglei Xu
William S. Oates
[J]. Scientific Reports, 11
[27] Exact Training of Restricted Boltzmann Machines on Intrinsically Low Dimensional Data
Decelle, A.
Furtlehner, C.
[J]. PHYSICAL REVIEW LETTERS, 2021, 127 (15)
[28] Incremental training of Restricted Boltzmann Machines using information driven saccades
Ortiz, Michael Garcia
Baillie, Jean-Christophe
[J]. FOUTH JOINT IEEE INTERNATIONAL CONFERENCES ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (IEEE ICDL-EPIROB 2014), 2014, : 325 - 330
[29] Restricted Boltzmann Machines for Pre-training Deep Gaussian Networks
Eastwood, Mark
Jayne, Chrisina
[J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
[30] Training Restricted Boltzmann Machines With a D-Wave Quantum Annealer
Dixit, Vivek
Selvarajan, Raja
Alam, Muhammad A.
Humble, Travis S.
Kais, Sabre
[J]. FRONTIERS IN PHYSICS, 2021, 9

← 1 2 3 4 5 →