Learning Restricted Boltzmann Machines via Influence Maximization

被引:11
|
作者
Bresler, Guy [1 ,2 ]
Koehler, Frederic [3 ]
Moitra, Ankur [2 ,4 ]
机构
[1] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
[2] IDSS, Cambridge, MA 02142 USA
[3] MIT, Dept Math, Cambridge, MA 02139 USA
[4] MIT, Dept Math, CSAIL, Cambridge, MA 02139 USA
关键词
Graphical models; Restricted Boltzmann Machines; submodularity; unsupervised learning; MODEL SELECTION;
D O I
10.1145/3313276.3316372
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Graphical models are a rich language for describing high-dimensional distributions in terms of their dependence structure. While there are algorithms with provable guarantees for learning undirected graphical models in a variety of settings, there has been much less progress in the important scenario when there are latent variables. Herewe study Restricted Boltzmann Machines (or RBMs), which are a popular model with wide-ranging applications in dimensionality reduction, collaborative filtering, topic modeling, feature extraction and deep learning. The main message of our paper is a strong dichotomy in the feasibility of learning RBMs, depending on the nature of the interactions between variables: ferromagnetic models can be learned efficiently, while general models cannot. In particular, we give a simple greedy algorithm based on influence maximization to learn ferromagnetic RBMs with bounded degree. In fact, we learn a description of the distribution on the observed variables as a Markov Random Field. Our analysis is based on tools from mathematical physics that were developed to show the concavity of magnetization. Our algorithm extends straighforwardly to general ferromagnetic Ising models with latent variables. Conversely, we show that even for a contant number of latent variables with constant degree, without ferromagneticity the problem is as hard as sparse parity with noise. This hardness result is based on a sharp and surprising characterization of the representational power of bounded degree RBMs: the distribution on their observed variables can simulate any bounded order MRF. This result is of independent interest since RBMs are the building blocks of deep belief networks.
引用
收藏
页码:828 / 839
页数:12
相关论文
共 50 条
  • [1] Learning ensemble classifiers via restricted Boltzmann machines
    Zhang, Chun-Xia
    Zhang, Jiang-She
    Ji, Nan-Nan
    Guo, Gao
    [J]. PATTERN RECOGNITION LETTERS, 2014, 36 : 161 - 170
  • [2] SCALABLE LEARNING FOR RESTRICTED BOLTZMANN MACHINES
    Barshan, Elnaz
    Fieguth, Paul
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2754 - 2758
  • [3] Rate Distortion Via Restricted Boltzmann Machines
    Li, Qing
    Chen, Yang
    [J]. 2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 1052 - 1059
  • [4] Spectral dynamics of learning in restricted Boltzmann machines
    Decelle, A.
    Fissore, G.
    Furtlehner, C.
    [J]. EPL, 2017, 119 (06)
  • [5] Neurosymbolic Reasoning and Learning with Restricted Boltzmann Machines
    Tran, Son N.
    Garcez, Artur d'Avila
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 5, 2023, : 6558 - 6565
  • [6] Approximate Learning Algorithm for Restricted Boltzmann Machines
    Yasuda, Muneki
    Tanaka, Kazuyuki
    [J]. 2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING CONTROL & AUTOMATION, VOLS 1 AND 2, 2008, : 692 - 697
  • [7] An Incremental Learning Approach for Restricted Boltzmann Machines
    Yu, Jongmin
    Gwak, Jeonghwan
    Lee, Sejeong
    Jeon, Moongu
    [J]. FOURTH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (CCAIS 2015), 2015, : 113 - 117
  • [8] Effective learning algorithm for restricted Boltzmann machines via spatial Monte Carlo integration
    Sekimoto, Kaiji
    Yasuda, Muneki
    [J]. IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2023, 14 (02): : 228 - 241
  • [9] Thermodynamics of Restricted Boltzmann Machines and Related Learning Dynamics
    A. Decelle
    G. Fissore
    C. Furtlehner
    [J]. Journal of Statistical Physics, 2018, 172 : 1576 - 1608
  • [10] LEARNING SPAM FEATURES USING RESTRICTED BOLTZMANN MACHINES
    da Silva, Luis Alexandre
    Pontara da Costa, Kelton Augusto
    Ribeiro, Patricia Bellin
    de Rosa, Gustavo Henrique
    Papa, Joao Paulo
    [J]. IADIS-INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2016, 11 (01): : 99 - 114