Learning with incomplete information and the mathematical structure behind it

被引：0

作者：

Kuehn, Reimer

Stamatescu, Ion-Olimpiu ^{[1
]}

机构：

[1] Kings Coll London, Dept Math, London WC2R 2LS, England

[2] Univ Heidelberg, FESt, D-6900 Heidelberg, Germany

[3] Univ Heidelberg, Inst Theoret Phys, D-6900 Heidelberg, Germany

来源：

BIOLOGICAL CYBERNETICS | 2007年 / 97卷 / 01期

关键词：

D O I：

10.1007/s00422-007-0162-4

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

We investigate the problem of learning with incomplete information as exemplified by learning with delayed reinforcement. We study a two phase learning scenario in which a phase of Hebbian associative learning based on momentary internal representations is supplemented by an 'unlearning' phase depending on a graded reinforcement signal. The reinforcement signal quantifies the success-rate globally for a number of learning steps in phase one, and 'unlearning' is indiscriminate with respect to associations learnt in that phase. Learning according to this model is studied via simulations and analytically within a student-teacher scenario for both single layer networks and, for a committee machine. Success and speed of learning depend on the ratio lambda of the learning rates used for the associative Hebbian learning phase and for the unlearning-correction in response to the reinforcement signal, respectively. Asymptotically perfect generalization is possible only, if this ratio exceeds a critical value lambda (c) , in which case the generalization error exhibits a power law decay with the number of examples seen by the student, with an exponent that depends in a non-universal manner on the parameter lambda. We find these features to be robust against a wide spectrum of modifications of microscopic modelling details. Two illustrative applications-one of a robot learning to navigate a field containing obstacles, and the problem of identifying a specific component in a collection of stimuli-are also provided.

引用

页码：99 / 112

页数：14

共 50 条

[41] Learning the state of nature in repeated games with incomplete information and signals
Renault, J
Tomala, T
GAMES AND ECONOMIC BEHAVIOR, 2004, 47 (01) : 124 - 156
[42] LEARNING ALGORITHMS FOR REPEATED BIMATRIX NASH GAMES WITH INCOMPLETE INFORMATION
PAPAVASSILOPOULOS, GP
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1989, 62 (03) : 467 - 488
[43] Brave induction: a logical framework for learning from incomplete information
Chiaki Sakama
Katsumi Inoue
Machine Learning, 2009, 76 : 3 - 35
[44] Incomplete label distribution learning based on supervised neighborhood information
Xue-Qiang Zeng
Su-Fen Chen
Run Xiang
Guo-Zheng Li
Xue-Feng Fu
International Journal of Machine Learning and Cybernetics, 2020, 11 : 111 - 121
[45] A Fuzzy Learning Anti-Jamming Approach With Incomplete Information
Zhang, Yunpeng
Jia, Luliang
IEEE COMMUNICATIONS LETTERS, 2024, 28 (07) : 1514 - 1518
[46] Incomplete label distribution learning based on supervised neighborhood information
Zeng, Xue-Qiang
Chen, Su-Fen
Xiang, Run
Li, Guo-Zheng
Fu, Xue-Feng
International Journal of Machine Learning and Cybernetics, 2020, 11 (01): : 111 - 121
[47] A learning-based model of repeated games with incomplete information
Chong, Juin-Kuan
Camerer, Colin F.
Ho, Teck H.
GAMES AND ECONOMIC BEHAVIOR, 2006, 55 (02) : 340 - 371
[48] LEARNING-PROCESSES BASED ON INCOMPLETE IDENTIFICATION AND INFORMATION GENERATION
GUNJI, YP
SHINOHARA, S
KONNO, N
APPLIED MATHEMATICS AND COMPUTATION, 1993, 55 (2-3) : 219 - 253
[49] Incomplete label distribution learning based on supervised neighborhood information
Zeng, Xue-Qiang
Chen, Su-Fen
Xiang, Run
Li, Guo-Zheng
Fu, Xue-Feng
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (01) : 111 - 121
[50] Repeated Stackelberg security games: Learning with incomplete state information
Alcantara-Jimenez, Guillermo
Clempner, Julio B.
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2020, 195

← 1 2 3 4 5 →