DISCRETIZED ESTIMATOR LEARNING AUTOMATA

被引：57

作者：

LANCTOT, JK ^{[1
]}

OOMMEN, BJ ^{[1
]}

机构：

[1] MITEL CORP,KANATA K2K 1X3,ON,CANADA

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS | 1992年 / 22卷 / 06期

基金：

加拿大自然科学与工程研究理事会;

关键词：

721.1 Computer Theory; Includes Computational Logic; Automata Theory; Switching Theory; Programming Theory - 723.1 Computer Programming - 723.4 Artificial Intelligence - 921.6 Numerical Methods;

D O I：

10.1109/21.199471

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning automata are stochastic automata interacting with an unknown random environment. The fundamental problem is that of learning, through feedback, the action that has the highest probability of being rewarded by the environment. A class of algorithms known as estimator algorithms are presently among the fastest known. They are characterized by the use of running estimates of the probabilities of each possible action being rewarded. The improvements gained by rendering the various estimator algorithms discrete are investigated. This is done by restricting the probability of selecting an action to a finite, and hence, discrete subset of [0,1]. This modification is proven to be epsilon-optimal in all stationary environments. In the paper, various discretized estimator algorithms (DEA's) are constructed. Subsequently, members of the family of DEA's will be shown to be epsilon-optimal by deriving two sufficient conditions required for the epsilon-optimality-the properties of monotonicity and moderation. There is a conjecture presented about the necessity of these conditions for epsilon-optimality too. Experimental results indicate that the discrete modifications improve the performance of these algorithms to the extent that these automata constitute the fastest converging and most accurate learning automata reported to date.

引用

页码：1473 / 1483

页数：11

共 50 条

[1] DISCRETIZED PURSUIT LEARNING AUTOMATA
OOMMEN, BJ
LANCTOT, JK
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1990, 20 (04): : 931 - 938
[2] ERGODIC DISCRETIZED ESTIMATOR LEARNING AUTOMATA WITH HIGH-ACCURACY AND HIGH ADAPTATION RATE FOR NONSTATIONARY ENVIRONMENTS
VASILAKOS, AV
PAPADIMITRIOU, GI
NEUROCOMPUTING, 1992, 4 (3-4) : 181 - 196
[3] Inertial Estimator Learning Automata
Zhang, Junqi
Ni, Lina
Xie, Chen
Gao, Shangce
Tang, Zheng
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2012, E95A (06) : 1041 - 1048
[4] IMPROVED ESTIMATOR FOR A DISCRETIZED LEARNING ROUTING ALGORITHM
WAN, TC
DOULIGERIS, C
ELECTRONICS LETTERS, 1994, 30 (02) : 108 - 110
[5] EPSILON-OPTIMAL DISCRETIZED PURSUIT LEARNING AUTOMATA
OOMMEN, BJ
LANCTOT, JK
1989 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-3: CONFERENCE PROCEEDINGS, 1989, : 6 - 12
[6] Virtual circuit routing algorithms using discretized estimator learning automata for high-speed packet-switched networks
Vasilakos, A.V.
Paximadis, C.T.
Papadimitriou, G.I.
Proceedings of the IFIP TC6/WG6.4 International Conference on High Speed Networking, 1991,
[7] A novel estimator based learning automata algorithm
Ge, Hao
Jiang, Wen
Li, Shenghong
Li, Jianhua
Wang, Yifan
Jing, Yuchun
APPLIED INTELLIGENCE, 2015, 42 (02) : 262 - 275
[8] A novel estimator based learning automata algorithm
Hao Ge
Wen Jiang
Shenghong Li
Jianhua Li
Yifan Wang
Yuchun Jing
Applied Intelligence, 2015, 42 : 262 - 275
[9] Fast and Epsilon-Optimal Discretized Pursuit Learning Automata
Zhang, JunQi
Wang, Cheng
Zhou, MengChu
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (10) : 2089 - 2099
[10] THE ASYMPTOTIC OPTIMALITY OF DISCRETIZED LINEAR REWARD INACTION LEARNING AUTOMATA
OOMMEN, BJ
HANSEN, E
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1984, 14 (03): : 542 - 545

← 1 2 3 4 5 →