Infinite Swapping Algorithm for Training Restricted Boltzmann Machines

被引:2
|
作者
Hult, Henrik [1 ]
Nyquist, Pierre [1 ]
Ringqvist, Carl [1 ]
机构
[1] KTH Royal Inst Technol, Lindstedtsvagen 25, S-10044 Stockholm, Sweden
关键词
Infinite swapping; Restricted Boltzmann machines; Statistical learning; Latent variable models; Gibbs sampling; MONTE-CARLO; PRODUCTS;
D O I
10.1007/978-3-030-43465-6_14
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Given the important role latent variable models play, for example in statistical learning, there is currently a growing need for efficient Monte Carlo methods for conducting inference on the latent variables given data. Recently, Desjardins et al. (JMLR Workshop and Conference Proceedings: AISTATS 2010, pp. 145-152, 2010 [3]) explored the use of the parallel tempering algorithm for training restricted Boltzmann machines, showing considerable improvement over the previous state-of-the-art. In this paper we continue their efforts by comparing previous methods, including parallel tempering, with the infinite swapping algorithm, an MCMC method first conceived when attempting to optimise performance of parallel tempering (Dupuis et al. in J. Chem. Phys. 137, 2012 [7]), for the training task. We implement a Gibbs-sampling version of infinite swapping and evaluate its performance on a number of test cases, concluding that the algorithm enjoys better mixing properties than both persistent contrastive divergence and parallel tempering for complex energy landscapes associated with restricted Boltzmann machines.
引用
收藏
页码:285 / 307
页数:23
相关论文
共 50 条
  • [21] Discrete restricted Boltzmann machines
    Montúfar, Guido
    Morton, Jason
    [J]. Journal of Machine Learning Research, 2015, 16 : 653 - 672
  • [22] An Overview of Restricted Boltzmann Machines
    Vidyadhar Upadhya
    P. S. Sastry
    [J]. Journal of the Indian Institute of Science, 2019, 99 : 225 - 236
  • [23] Continuous restricted Boltzmann machines
    Robert W. Harrison
    [J]. Wireless Networks, 2022, 28 : 1263 - 1267
  • [24] Fuzzy Restricted Boltzmann Machines
    Harrison, Robert W.
    Freas, Christopher
    [J]. FUZZY LOGIC IN INTELLIGENT SYSTEM DESIGN: THEORY AND APPLICATIONS, 2018, 648 : 392 - 398
  • [25] Adaptive hyperparameter updating for training restricted Boltzmann machines on quantum annealers
    Xu, Guanglei
    Oates, William S.
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [26] Adaptive hyperparameter updating for training restricted Boltzmann machines on quantum annealers
    Guanglei Xu
    William S. Oates
    [J]. Scientific Reports, 11
  • [27] Exact Training of Restricted Boltzmann Machines on Intrinsically Low Dimensional Data
    Decelle, A.
    Furtlehner, C.
    [J]. PHYSICAL REVIEW LETTERS, 2021, 127 (15)
  • [28] Incremental training of Restricted Boltzmann Machines using information driven saccades
    Ortiz, Michael Garcia
    Baillie, Jean-Christophe
    [J]. FOUTH JOINT IEEE INTERNATIONAL CONFERENCES ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (IEEE ICDL-EPIROB 2014), 2014, : 325 - 330
  • [29] Restricted Boltzmann Machines for Pre-training Deep Gaussian Networks
    Eastwood, Mark
    Jayne, Chrisina
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [30] Training Restricted Boltzmann Machines With a D-Wave Quantum Annealer
    Dixit, Vivek
    Selvarajan, Raja
    Alam, Muhammad A.
    Humble, Travis S.
    Kais, Sabre
    [J]. FRONTIERS IN PHYSICS, 2021, 9