Learning with incomplete information in the committee machine

被引:0
|
作者
Urs M. Bergmann
Reimer Kühn
Ion-Olimpiu Stamatescu
机构
[1] Universität Heidelberg,Institut für Theoretische Physik
[2] Frankfurt Institute for Advanced Studies,Department of Mathematics
[3] King’s College,FESt, Heidelberg and Institut für Theoretische Physik
[4] Universität Heidelberg,undefined
来源
Biological Cybernetics | 2009年 / 101卷
关键词
Reinforcement learning; Online learning; Committee machine; Credit assignment; Coarsegrained analysis;
D O I
暂无
中图分类号
学科分类号
摘要
We study the problem of learning with incomplete information in a student–teacher setup for the committee machine. The learning algorithm combines unsupervised Hebbian learning of a series of associations with a delayed reinforcement step, in which the set of previously learnt associations is partly and indiscriminately unlearnt, to an extent that depends on the success rate of the student on these previously learnt associations. The relevant learning parameter λ represents the strength of Hebbian learning. A coarse-grained analysis of the system yields a set of differential equations for overlaps of student and teacher weight vectors, whose solutions provide a complete description of the learning behavior. It reveals complicated dynamics showing that perfect generalization can be obtained if the learning parameter exceeds a threshold λc, and if the initial value of the overlap between student and teacher weights is non-zero. In case of convergence, the generalization error exhibits a power law decay as a function of the number of examples used in training, with an exponent that depends on the parameter λ. An investigation of the system flow in a subspace with broken permutation symmetry between hidden units reveals a bifurcation point λ* above which perfect generalization does not depend on initial conditions. Finally, we demonstrate that cases of a complexity mismatch between student and teacher are optimally resolved in the sense that an over-complex student can emulate a less complex teacher rule, while an under-complex student reaches a state which realizes the minimal generalization error compatible with the complexity mismatch.
引用
收藏
页码:401 / 410
页数:9
相关论文
共 50 条
  • [21] Grey reinforcement learning for incomplete information processing
    Chen, Chunlin
    Dong, Daoyi
    Chen, Zonghai
    THEORY AND APPLICATIONS OF MODELS OF COMPUTATION, PROCEEDINGS, 2006, 3959 : 399 - 407
  • [22] Bayesian learning in repeated games of incomplete information
    John H. Nachbar
    Social Choice and Welfare, 2001, 18 : 303 - 326
  • [23] Learning with incomplete information and the mathematical structure behind it
    Kuehn, Reimer
    Stamatescu, Ion-Olimpiu
    BIOLOGICAL CYBERNETICS, 2007, 97 (01) : 99 - 112
  • [24] Learning with incomplete information and the mathematical structure behind it
    Reimer Kühn
    Ion-Olimpiu Stamatescu
    Biological Cybernetics, 2007, 97 : 99 - 112
  • [25] Cognitive concept learning from incomplete information
    Zhao, Yingxiu
    Li, Jinhai
    Liu, Wenqi
    Xu, Weihua
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (01) : 159 - 170
  • [26] Iterative Learning Control With Incomplete Information: A Survey
    Shen, Dong
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2018, 5 (05) : 885 - 901
  • [27] Machine Learning Methods with Noisy, Incomplete or Small Datasets
    Caiafa, Cesar F.
    Sun, Zhe
    Tanaka, Toshihisa
    Marti-Puig, Pere
    Sole-Casals, Jordi
    APPLIED SCIENCES-BASEL, 2021, 11 (09):
  • [28] A machine learning method for incomplete and imbalanced medical data
    Salman, Issam
    Vomlel, Jiri
    PROCEEDINGS OF THE 20TH CZECH-JAPAN SEMINAR ON DATA ANALYSIS AND DECISION MAKING UNDER UNCERTAINTY, 2017, : 188 - 195
  • [29] Distance Estimation for Incomplete Data by Extreme Learning Machine
    Eirola, Emil
    Akusok, Anton
    Bjork, Kaj-Mikael
    Lendasse, Amaury
    PROCEEDINGS OF ELM-2017, 2019, 10 : 203 - 209
  • [30] Machine Learning for Information Retrieval
    Si, Luo
    Jin, Rong
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1293 - 1293