Thermodynamics of Restricted Boltzmann Machines and Related Learning Dynamics

被引:31
|
作者
Decelle, A. [1 ]
Fissore, G. [1 ]
Furtlehner, C. [2 ]
机构
[1] Univ Paris Saclay, Lab Rech Informat, TAU Inria Saclay, Bat 660, F-91190 Gif Sur Yvette, France
[2] INRIA Saclay, Orsay, France
关键词
Disorder systems; Neural networks; Unsupervised learning; NEURAL-NETWORKS; MODELS;
D O I
10.1007/s10955-018-2105-y
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
We investigate the thermodynamic properties of a restricted Boltzmann machine (RBM), a simple energy-based generative model used in the context of unsupervised learning. Assuming the information content of this model to be mainly reflected by the spectral properties of its weight matrix W, we try to make a realistic analysis by averaging over an appropriate statistical ensemble of RBMs. First, a phase diagram is derived. Otherwise similar to that of the Sherrington-Kirkpatrick (SK) model with ferromagnetic couplings, the RBM's phase diagram presents a ferromagnetic phase which may or may not be of compositional type depending on the kurtosis of the distribution of the components of the singular vectors of W. Subsequently, the learning dynamics of the RBM is studied in the thermodynamic limit. A "typical" learning trajectory is shown to solve an effective dynamical equation, based on the aforementioned ensemble average and explicitly involving order parameters obtained from the thermodynamic analysis. In particular, this let us show how the evolution of the dominant singular values of W, and thus of the unstable modes, is driven by the input data. At the beginning of the training, in which the RBM is found to operate in the linear regime, the unstable modes reflect the dominant covariance modes of the data. In the non-linear regime, instead, the selected modes interact and eventually impose a matching of the order parameters to their empirical counterparts estimated from the data. Finally, we illustrate our considerations by performing experiments on both artificial and real data, showing in particular how the RBM operates in the ferromagnetic compositional phase.
引用
收藏
页码:1576 / 1608
页数:33
相关论文
共 50 条
  • [21] Analysis on Noisy Boltzmann Machines and Noisy Restricted Boltzmann Machines
    Lu, Wenhao
    Leung, Chi-Sing
    Sum, John
    [J]. IEEE ACCESS, 2021, 9 : 112955 - 112965
  • [22] Relational Restricted Boltzmann Machines: A Probabilistic Logic Learning Approach
    Kaur, Navdeep
    Kunapuli, Gautam
    Khot, Tushar
    Kersting, Kristian
    Cohen, William
    Natarajan, Sriraam
    [J]. INDUCTIVE LOGIC PROGRAMMING (ILP 2017), 2018, 10759 : 94 - 111
  • [23] Mode-assisted unsupervised learning of restricted Boltzmann machines
    Manukian, Haik
    Pei, Yan Ru
    Bearden, Sean R. B.
    Di Ventra, Massimiliano
    [J]. COMMUNICATIONS PHYSICS, 2020, 3 (01)
  • [24] Non-parametric learning of lifted Restricted Boltzmann Machines
    Kaur, Navdeep
    Kunapuli, Gautam
    Natarajan, Sriraam
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2020, 120 : 33 - 47
  • [25] Mode-assisted unsupervised learning of restricted Boltzmann machines
    Haik Manukian
    Yan Ru Pei
    Sean R. B. Bearden
    Massimiliano Di Ventra
    [J]. Communications Physics, 3
  • [26] Improved Learning of Gaussian-Bernoulli Restricted Boltzmann Machines
    Cho, KyungHyun
    Ilin, Alexander
    Raiko, Tapani
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2011, PT I, 2011, 6791 : 10 - 17
  • [27] Convolutional restricted Boltzmann machines learning for robust visual tracking
    Lei, Jun
    Li, GuoHui
    Tu, Dan
    Guo, Qiang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2014, 25 (06): : 1383 - 1391
  • [28] Learning Large Q-Matrix by Restricted Boltzmann Machines
    Chengcheng Li
    Chenchen Ma
    Gongjun Xu
    [J]. Psychometrika, 2022, 87 : 1010 - 1041
  • [29] Efficient Learning of Restricted Boltzmann Machines Using Covariance Estimates
    Upadhya, Vidyadhar
    Sastry, P. S.
    [J]. ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 851 - 866
  • [30] Deterministic and Generalized Framework for Unsupervised Learning with Restricted Boltzmann Machines
    Tramel, Eric W.
    Gabrie, Marylou
    Manoel, Andre
    Caltagirone, Francesco
    Krzakala, Florent
    [J]. PHYSICAL REVIEW X, 2018, 8 (04):