Learning with incomplete information and the mathematical structure behind it

被引:0
|
作者
Kuehn, Reimer
Stamatescu, Ion-Olimpiu [1 ]
机构
[1] Kings Coll London, Dept Math, London WC2R 2LS, England
[2] Univ Heidelberg, FESt, D-6900 Heidelberg, Germany
[3] Univ Heidelberg, Inst Theoret Phys, D-6900 Heidelberg, Germany
关键词
D O I
10.1007/s00422-007-0162-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We investigate the problem of learning with incomplete information as exemplified by learning with delayed reinforcement. We study a two phase learning scenario in which a phase of Hebbian associative learning based on momentary internal representations is supplemented by an 'unlearning' phase depending on a graded reinforcement signal. The reinforcement signal quantifies the success-rate globally for a number of learning steps in phase one, and 'unlearning' is indiscriminate with respect to associations learnt in that phase. Learning according to this model is studied via simulations and analytically within a student-teacher scenario for both single layer networks and, for a committee machine. Success and speed of learning depend on the ratio lambda of the learning rates used for the associative Hebbian learning phase and for the unlearning-correction in response to the reinforcement signal, respectively. Asymptotically perfect generalization is possible only, if this ratio exceeds a critical value lambda (c) , in which case the generalization error exhibits a power law decay with the number of examples seen by the student, with an exponent that depends in a non-universal manner on the parameter lambda. We find these features to be robust against a wide spectrum of modifications of microscopic modelling details. Two illustrative applications-one of a robot learning to navigate a field containing obstacles, and the problem of identifying a specific component in a collection of stimuli-are also provided.
引用
收藏
页码:99 / 112
页数:14
相关论文
共 50 条
  • [21] Beliefs and engagement structures: behind the affective dimension of mathematical learning
    Goldin, Gerald A.
    Epstein, Yakov M.
    Schorr, Roberta Y.
    Warner, Lisa B.
    ZDM-MATHEMATICS EDUCATION, 2011, 43 (04): : 547 - 560
  • [22] Mathematical analysis behind occlusion handling in image with deep learning
    Chavan S.A.
    Chaudhari N.M.
    Ramteke R.J.
    Pawar U.B.
    International Journal of Information Technology, 2023, 15 (7) : 3741 - 3749
  • [23] Mathematical Logic Models for Decision Support Systems in the Case of Incomplete Information
    Baliuk, A. S.
    Peryazev, N. A.
    Gaidukov, A. I.
    Peryazeva, Yu. V.
    2015 XVIII International Conference on Soft Computing and Measurements (SCM), 2015, : 217 - 218
  • [24] The Structure of Online Information Behind Social Crises
    Cardenas, Juan Pablo
    Olivares, Gaston
    Vidal, Gerardo
    Urbina, Carolina
    Fuentes, Miguel
    FRONTIERS IN PHYSICS, 2021, 9
  • [25] Research on Mathematical Structure of Information Field
    Mu Yan
    Duan Rong
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 1387 - 1391
  • [26] The Mathematical Structure of Information Bottleneck Methods
    Gedeon, Tomas
    Parker, Albert E.
    Dimitrov, Alexander G.
    ENTROPY, 2012, 14 (03) : 456 - 479
  • [27] The Mathematical Structure of Integrated Information Theory
    Kleiner, Johannes
    Tull, Sean
    FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2021, 6
  • [30] Investment Timing with Incomplete Information and Multiple Means of Learning
    Harrison, J. Michael
    Sunar, Nur
    OPERATIONS RESEARCH, 2015, 63 (02) : 442 - 457