Three learning stages and accuracy–efficiency tradeoff of restricted Boltzmann machines

被引：0

作者：

Lennart Dabelow

Masahito Ueda

机构：

[1] RIKEN Center for Emergent Matter Science (CEMS),Department of Physics and Institute for Physics of Intelligence, Graduate School of Science

[2] The University of Tokyo,undefined

来源：

Nature Communications | / 13卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Restricted Boltzmann Machines (RBMs) offer a versatile architecture for unsupervised machine learning that can in principle approximate any target probability distribution with arbitrary accuracy. However, the RBM model is usually not directly accessible due to its computational complexity, and Markov-chain sampling is invoked to analyze the learned probability distribution. For training and eventual applications, it is thus desirable to have a sampler that is both accurate and efficient. We highlight that these two goals generally compete with each other and cannot be achieved simultaneously. More specifically, we identify and quantitatively characterize three regimes of RBM learning: independent learning, where the accuracy improves without losing efficiency; correlation learning, where higher accuracy entails lower efficiency; and degradation, where both accuracy and efficiency no longer improve or even deteriorate. These findings are based on numerical experiments and heuristic arguments.

引用

共 50 条

[41] Supervised Restricted Boltzmann Machines
Tu Dinh Nguyen
Dinh Phung
Viet Huynh
Trung Le
CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
[42] Continuous restricted Boltzmann machines
Robert W. Harrison
Wireless Networks, 2022, 28 : 1263 - 1267
[43] Fuzzy Restricted Boltzmann Machines
Harrison, Robert W.
Freas, Christopher
FUZZY LOGIC IN INTELLIGENT SYSTEM DESIGN: THEORY AND APPLICATIONS, 2018, 648 : 392 - 398
[44] An approach to improve online sequential extreme learning machines using restricted Boltzmann machines
Pacheco, Andre G. C.
Krohling, Renato A.
2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
[45] Unsupervised hierarchical clustering using the learning dynamics of restricted Boltzmann machines
Decelle, Aurelien
Seoane, Beatriz
Rosset, Lorenzo
PHYSICAL REVIEW E, 2023, 108 (01)
[46] LEARNING INVARIANT COLOR FEATURES WITH SPARSE TOPOGRAPHIC RESTRICTED BOLTZMANN MACHINES
Goh, Hanlin
Kusmierz, Lukasz
Lim, Joo-Hwee
Thome, Nicolas
Cord, Matthieu
2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1241 - 1244
[47] A dynamical mean-field theory for learning in restricted Boltzmann machines
Cakmak, Burak
Opper, Manfred
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2020, 2020 (10):
[48] LEARNING A BETTER REPRESENTATION OF SPEECH SOUNDWAVES USING RESTRICTED BOLTZMANN MACHINES
Jaitly, Navdeep
Hinton, Geoffrey
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5884 - 5887
[49] Improving mixing rate with tempered transition for learning restricted Boltzmann machines
Xu, Jungang
Li, Hui
Zhou, Shilong
NEUROCOMPUTING, 2014, 139 : 328 - 335
[50] Restricted Boltzmann machine to determine the input weights for extreme learning machines
Pacheco, Andre G. C.
Krohling, Renato A.
da Silva, Carlos A. S.
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 96 : 77 - 85

← 1 2 3 4 5 →