Learning with incomplete information and the mathematical structure behind it

被引:0
|
作者
Kuehn, Reimer
Stamatescu, Ion-Olimpiu [1 ]
机构
[1] Kings Coll London, Dept Math, London WC2R 2LS, England
[2] Univ Heidelberg, FESt, D-6900 Heidelberg, Germany
[3] Univ Heidelberg, Inst Theoret Phys, D-6900 Heidelberg, Germany
关键词
D O I
10.1007/s00422-007-0162-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We investigate the problem of learning with incomplete information as exemplified by learning with delayed reinforcement. We study a two phase learning scenario in which a phase of Hebbian associative learning based on momentary internal representations is supplemented by an 'unlearning' phase depending on a graded reinforcement signal. The reinforcement signal quantifies the success-rate globally for a number of learning steps in phase one, and 'unlearning' is indiscriminate with respect to associations learnt in that phase. Learning according to this model is studied via simulations and analytically within a student-teacher scenario for both single layer networks and, for a committee machine. Success and speed of learning depend on the ratio lambda of the learning rates used for the associative Hebbian learning phase and for the unlearning-correction in response to the reinforcement signal, respectively. Asymptotically perfect generalization is possible only, if this ratio exceeds a critical value lambda (c) , in which case the generalization error exhibits a power law decay with the number of examples seen by the student, with an exponent that depends in a non-universal manner on the parameter lambda. We find these features to be robust against a wide spectrum of modifications of microscopic modelling details. Two illustrative applications-one of a robot learning to navigate a field containing obstacles, and the problem of identifying a specific component in a collection of stimuli-are also provided.
引用
收藏
页码:99 / 112
页数:14
相关论文
共 50 条
  • [1] Learning with incomplete information and the mathematical structure behind it
    Reimer Kühn
    Ion-Olimpiu Stamatescu
    Biological Cybernetics, 2007, 97 : 99 - 112
  • [2] Strategic learning in games with incomplete information
    Wang, MH
    INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, : 2047 - 2052
  • [3] Learning with incomplete information in the committee machine
    Bergmann, Urs M.
    Kuehn, Reimer
    Stamatescu, Ion-Olimpiu
    BIOLOGICAL CYBERNETICS, 2009, 101 (5-6) : 401 - 410
  • [4] NETWORK EVOLUTION WITH INCOMPLETE INFORMATION AND LEARNING
    Xu, Jie
    Zhang, Simpson
    van der Schaar, Mihaela
    2014 52ND ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2014, : 1163 - 1168
  • [5] Learning with incomplete information in the committee machine
    Urs M. Bergmann
    Reimer Kühn
    Ion-Olimpiu Stamatescu
    Biological Cybernetics, 2009, 101 : 401 - 410
  • [6] Sequential Interdiction with Incomplete Information and Learning
    Borrero, Juan S.
    Prokopyev, Oleg A.
    Saure, Denis
    OPERATIONS RESEARCH, 2019, 67 (01) : 72 - 89
  • [7] Learning in Network Games with Incomplete Information
    Eksin, Ceyhun
    Molavi, Pooya
    Ribeiro, Alejandro
    Jadbabaie, Ali
    IEEE SIGNAL PROCESSING MAGAZINE, 2013, 30 (03) : 30 - 42
  • [8] Incomplete Information: Structure, Inference, Complexity
    Jouni Järvinen
    Studia Logica, 2006, 84 (3) : 469 - 475
  • [9] Iterative Learning Control With Incomplete Information: A Survey
    Dong Shen
    IEEE/CAAJournalofAutomaticaSinica, 2018, 5 (05) : 885 - 901
  • [10] Learning from Multimedia Data with Incomplete Information
    Tao, Renshuai
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4921 - 4922