On the effective initialisation for restricted Boltzmann machines via duality with Hopfield model

被引:18
|
作者
Leonelli, Francesca Elisa [1 ,2 ]
Agliari, Elena [1 ]
Albanese, Linda [3 ,4 ]
Barra, Adriano [3 ,5 ]
机构
[1] Sapienza Univ Roma, Dipartimento Matemat Guido Castelnuovo, Rome, Italy
[2] CNR, Ist Sci Marine, ISMAR, Rome, Italy
[3] Univ Salento, Dipartimento Matemat & Fis Ennio De Giorgi, Lecce, Italy
[4] Univ Salento, Scuola Super ISUFI, Lecce, Italy
[5] Ist Nazl Fis Nucl, Sez Lecce, Lecce, Italy
关键词
Hopfield model; Restricted Boltzmann machine; Statistical mechanics; INFORMATION-STORAGE; LEARNING ALGORITHM; NEURAL-NETWORKS; RETRIEVAL;
D O I
10.1016/j.neunet.2021.06.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Restricted Boltzmann machines (RBMs) with a binary visible layer of size N and a Gaussian hidden layer of size P have been proved to be equivalent to a Hopfield neural network (HNN) made of N binary neurons and storing P patterns xi, as long as the weights w in the former are identified with the patterns. Here we aim to leverage this equivalence to find effective initialisations for weights in the RBM when what is available is a set of noisy examples of each pattern, aiming to translate statistical mechanics background available for HNN to the study of RBM's learning and retrieval abilities. In particular, given a set of definite, structureless patterns we build a sample of blurred examples and prove that the initialisation where w corresponds to the empirical average xi over the sample is a fixed point under stochastic gradient descent. Further, as a toy application of the duality between HNN and RBM, we consider the simplest random auto-encoder (a three layer network made of two RBMs coupled by their hidden layer) and evidence that, as long as the parameter setting corresponds to the retrieval region of the dual HNN, reconstruction and denoising can be accomplished trivially, while when the system is in the spin-glass phase inference algorithms are necessary. This questions the need for larger retrieval regions which we obtain by applying a Gram-Schmidt orthogonalisation to the patterns: in fact, this procedure yields to a set of patterns devoid of correlations and for which the largest retrieval region can be accomplished. Finally we consider an application of duality also in a structured case: we test this approach on the MNIST dataset, and obtain that the network performs already similar to 67% of successful classifications, suggesting it can be exploited as a computationally-cheap pre-training. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页码:314 / 326
页数:13
相关论文
共 50 条
  • [31] Restricted Boltzmann machines in quantum physics
    Melko, Roger G.
    Carleo, Giuseppe
    Carrasquilla, Juan
    Cirac, J. Ignacio
    NATURE PHYSICS, 2019, 15 (09) : 887 - 892
  • [32] Temperature based Restricted Boltzmann Machines
    Li, Guoqi
    Deng, Lei
    Xu, Yi
    Wen, Changyun
    Wang, Wei
    Pei, Jing
    Shi, Luping
    SCIENTIFIC REPORTS, 2016, 6
  • [33] SCALABLE LEARNING FOR RESTRICTED BOLTZMANN MACHINES
    Barshan, Elnaz
    Fieguth, Paul
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2754 - 2758
  • [34] Wasserstein Training of Restricted Boltzmann Machines
    Montavon, Gregoire
    Mueller, Klaus-Robert
    Cuturi, Marco
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [35] Gaussian Cardinality Restricted Boltzmann Machines
    Wan, Cheng
    Jin, Xiaoming
    Ding, Guiguang
    Shen, Dou
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3031 - 3037
  • [36] κ-Entropy Based Restricted Boltzmann Machines
    Passos, Leandro Aparecido
    Santana, Marcos Cleison
    Moreira, Thierry
    Papa, Joao Paulo
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [37] A topological insight into restricted Boltzmann machines
    Decebal Constantin Mocanu
    Elena Mocanu
    Phuong H. Nguyen
    Madeleine Gibescu
    Antonio Liotta
    Machine Learning, 2016, 104 : 243 - 270
  • [38] A topological insight into restricted Boltzmann machines
    Mocanu, Decebal Constantin
    Mocanu, Elena
    Nguyen, Phuong H.
    Gibescu, Madeleine
    Liotta, Antonio
    MACHINE LEARNING, 2016, 104 (2-3) : 243 - 270
  • [39] Multiview Graph Restricted Boltzmann Machines
    Zhang, Nan
    Sun, Shiliang
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (11) : 12414 - 12428
  • [40] Tensor Ring Restricted Boltzmann Machines
    Wang, Maolin
    Zhang, Chenbin
    Pan, Yu
    Xu, Jing
    Xu, Zenglin
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,