Learning with incomplete information in the committee machine

被引:0
|
作者
Urs M. Bergmann
Reimer Kühn
Ion-Olimpiu Stamatescu
机构
[1] Universität Heidelberg,Institut für Theoretische Physik
[2] Frankfurt Institute for Advanced Studies,Department of Mathematics
[3] King’s College,FESt, Heidelberg and Institut für Theoretische Physik
[4] Universität Heidelberg,undefined
来源
Biological Cybernetics | 2009年 / 101卷
关键词
Reinforcement learning; Online learning; Committee machine; Credit assignment; Coarsegrained analysis;
D O I
暂无
中图分类号
学科分类号
摘要
We study the problem of learning with incomplete information in a student–teacher setup for the committee machine. The learning algorithm combines unsupervised Hebbian learning of a series of associations with a delayed reinforcement step, in which the set of previously learnt associations is partly and indiscriminately unlearnt, to an extent that depends on the success rate of the student on these previously learnt associations. The relevant learning parameter λ represents the strength of Hebbian learning. A coarse-grained analysis of the system yields a set of differential equations for overlaps of student and teacher weight vectors, whose solutions provide a complete description of the learning behavior. It reveals complicated dynamics showing that perfect generalization can be obtained if the learning parameter exceeds a threshold λc, and if the initial value of the overlap between student and teacher weights is non-zero. In case of convergence, the generalization error exhibits a power law decay as a function of the number of examples used in training, with an exponent that depends on the parameter λ. An investigation of the system flow in a subspace with broken permutation symmetry between hidden units reveals a bifurcation point λ* above which perfect generalization does not depend on initial conditions. Finally, we demonstrate that cases of a complexity mismatch between student and teacher are optimally resolved in the sense that an over-complex student can emulate a less complex teacher rule, while an under-complex student reaches a state which realizes the minimal generalization error compatible with the complexity mismatch.
引用
收藏
页码:401 / 410
页数:9
相关论文
共 50 条
  • [1] Learning with incomplete information in the committee machine
    Bergmann, Urs M.
    Kuehn, Reimer
    Stamatescu, Ion-Olimpiu
    BIOLOGICAL CYBERNETICS, 2009, 101 (5-6) : 401 - 410
  • [2] Evaluating Fairness of Machine Learning Models Under Uncertain and Incomplete Information
    Awasthi, Pranjal
    Beutel, Alex
    Kleindessner, Matthäus
    Morgenstern, Jamie
    Wang, Xuezhi
    PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 206 - 214
  • [3] STSIIML: Study on token shuffling under incomplete information based on machine learning
    Wang, Yilei
    Li, Tao
    Liu, Ming
    Li, Chunmei
    Wang, Hui
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 11078 - 11100
  • [4] ONLINE LEARNING IN THE COMMITTEE MACHINE
    COPELLI, M
    CATICHA, N
    JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (06): : 1615 - 1625
  • [5] Approval-Based Committee Voting under Incomplete Information
    Imber, Aviram
    Israel, Jonas
    Brill, Markus
    Kimelfeld, Benny
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5076 - 5083
  • [6] Hybrid committee machine for incremental learning
    Yang, J
    Luo, SW
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 391 - 395
  • [7] On-line learning in the committee machine
    Copelli, M.
    Caticha, N.
    Journal of Physics A: Mathematical and General, 28 (06):
  • [8] Strategic learning in games with incomplete information
    Wang, MH
    INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, : 2047 - 2052
  • [9] NETWORK EVOLUTION WITH INCOMPLETE INFORMATION AND LEARNING
    Xu, Jie
    Zhang, Simpson
    van der Schaar, Mihaela
    2014 52ND ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2014, : 1163 - 1168
  • [10] Sequential Interdiction with Incomplete Information and Learning
    Borrero, Juan S.
    Prokopyev, Oleg A.
    Saure, Denis
    OPERATIONS RESEARCH, 2019, 67 (01) : 72 - 89