Three learning stages and accuracy–efficiency tradeoff of restricted Boltzmann machines

被引:0
|
作者
Lennart Dabelow
Masahito Ueda
机构
[1] RIKEN Center for Emergent Matter Science (CEMS),Department of Physics and Institute for Physics of Intelligence, Graduate School of Science
[2] The University of Tokyo,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Restricted Boltzmann Machines (RBMs) offer a versatile architecture for unsupervised machine learning that can in principle approximate any target probability distribution with arbitrary accuracy. However, the RBM model is usually not directly accessible due to its computational complexity, and Markov-chain sampling is invoked to analyze the learned probability distribution. For training and eventual applications, it is thus desirable to have a sampler that is both accurate and efficient. We highlight that these two goals generally compete with each other and cannot be achieved simultaneously. More specifically, we identify and quantitatively characterize three regimes of RBM learning: independent learning, where the accuracy improves without losing efficiency; correlation learning, where higher accuracy entails lower efficiency; and degradation, where both accuracy and efficiency no longer improve or even deteriorate. These findings are based on numerical experiments and heuristic arguments.
引用
收藏
相关论文
共 50 条
  • [41] Supervised Restricted Boltzmann Machines
    Tu Dinh Nguyen
    Dinh Phung
    Viet Huynh
    Trung Le
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
  • [42] Continuous restricted Boltzmann machines
    Robert W. Harrison
    Wireless Networks, 2022, 28 : 1263 - 1267
  • [43] Fuzzy Restricted Boltzmann Machines
    Harrison, Robert W.
    Freas, Christopher
    FUZZY LOGIC IN INTELLIGENT SYSTEM DESIGN: THEORY AND APPLICATIONS, 2018, 648 : 392 - 398
  • [44] An approach to improve online sequential extreme learning machines using restricted Boltzmann machines
    Pacheco, Andre G. C.
    Krohling, Renato A.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [45] Unsupervised hierarchical clustering using the learning dynamics of restricted Boltzmann machines
    Decelle, Aurelien
    Seoane, Beatriz
    Rosset, Lorenzo
    PHYSICAL REVIEW E, 2023, 108 (01)
  • [46] LEARNING INVARIANT COLOR FEATURES WITH SPARSE TOPOGRAPHIC RESTRICTED BOLTZMANN MACHINES
    Goh, Hanlin
    Kusmierz, Lukasz
    Lim, Joo-Hwee
    Thome, Nicolas
    Cord, Matthieu
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1241 - 1244
  • [47] A dynamical mean-field theory for learning in restricted Boltzmann machines
    Cakmak, Burak
    Opper, Manfred
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2020, 2020 (10):
  • [48] LEARNING A BETTER REPRESENTATION OF SPEECH SOUNDWAVES USING RESTRICTED BOLTZMANN MACHINES
    Jaitly, Navdeep
    Hinton, Geoffrey
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5884 - 5887
  • [49] Improving mixing rate with tempered transition for learning restricted Boltzmann machines
    Xu, Jungang
    Li, Hui
    Zhou, Shilong
    NEUROCOMPUTING, 2014, 139 : 328 - 335
  • [50] Restricted Boltzmann machine to determine the input weights for extreme learning machines
    Pacheco, Andre G. C.
    Krohling, Renato A.
    da Silva, Carlos A. S.
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 96 : 77 - 85