DISCRETIZED ESTIMATOR LEARNING AUTOMATA

被引:57
|
作者
LANCTOT, JK [1 ]
OOMMEN, BJ [1 ]
机构
[1] MITEL CORP,KANATA K2K 1X3,ON,CANADA
来源
基金
加拿大自然科学与工程研究理事会;
关键词
721.1 Computer Theory; Includes Computational Logic; Automata Theory; Switching Theory; Programming Theory - 723.1 Computer Programming - 723.4 Artificial Intelligence - 921.6 Numerical Methods;
D O I
10.1109/21.199471
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Learning automata are stochastic automata interacting with an unknown random environment. The fundamental problem is that of learning, through feedback, the action that has the highest probability of being rewarded by the environment. A class of algorithms known as estimator algorithms are presently among the fastest known. They are characterized by the use of running estimates of the probabilities of each possible action being rewarded. The improvements gained by rendering the various estimator algorithms discrete are investigated. This is done by restricting the probability of selecting an action to a finite, and hence, discrete subset of [0,1]. This modification is proven to be epsilon-optimal in all stationary environments. In the paper, various discretized estimator algorithms (DEA's) are constructed. Subsequently, members of the family of DEA's will be shown to be epsilon-optimal by deriving two sufficient conditions required for the epsilon-optimality-the properties of monotonicity and moderation. There is a conjecture presented about the necessity of these conditions for epsilon-optimality too. Experimental results indicate that the discrete modifications improve the performance of these algorithms to the extent that these automata constitute the fastest converging and most accurate learning automata reported to date.
引用
收藏
页码:1473 / 1483
页数:11
相关论文
共 50 条
  • [1] DISCRETIZED PURSUIT LEARNING AUTOMATA
    OOMMEN, BJ
    LANCTOT, JK
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1990, 20 (04): : 931 - 938
  • [2] ERGODIC DISCRETIZED ESTIMATOR LEARNING AUTOMATA WITH HIGH-ACCURACY AND HIGH ADAPTATION RATE FOR NONSTATIONARY ENVIRONMENTS
    VASILAKOS, AV
    PAPADIMITRIOU, GI
    NEUROCOMPUTING, 1992, 4 (3-4) : 181 - 196
  • [3] Inertial Estimator Learning Automata
    Zhang, Junqi
    Ni, Lina
    Xie, Chen
    Gao, Shangce
    Tang, Zheng
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2012, E95A (06) : 1041 - 1048
  • [4] IMPROVED ESTIMATOR FOR A DISCRETIZED LEARNING ROUTING ALGORITHM
    WAN, TC
    DOULIGERIS, C
    ELECTRONICS LETTERS, 1994, 30 (02) : 108 - 110
  • [5] EPSILON-OPTIMAL DISCRETIZED PURSUIT LEARNING AUTOMATA
    OOMMEN, BJ
    LANCTOT, JK
    1989 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-3: CONFERENCE PROCEEDINGS, 1989, : 6 - 12
  • [6] Virtual circuit routing algorithms using discretized estimator learning automata for high-speed packet-switched networks
    Vasilakos, A.V.
    Paximadis, C.T.
    Papadimitriou, G.I.
    Proceedings of the IFIP TC6/WG6.4 International Conference on High Speed Networking, 1991,
  • [7] A novel estimator based learning automata algorithm
    Ge, Hao
    Jiang, Wen
    Li, Shenghong
    Li, Jianhua
    Wang, Yifan
    Jing, Yuchun
    APPLIED INTELLIGENCE, 2015, 42 (02) : 262 - 275
  • [8] A novel estimator based learning automata algorithm
    Hao Ge
    Wen Jiang
    Shenghong Li
    Jianhua Li
    Yifan Wang
    Yuchun Jing
    Applied Intelligence, 2015, 42 : 262 - 275
  • [9] Fast and Epsilon-Optimal Discretized Pursuit Learning Automata
    Zhang, JunQi
    Wang, Cheng
    Zhou, MengChu
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (10) : 2089 - 2099
  • [10] THE ASYMPTOTIC OPTIMALITY OF DISCRETIZED LINEAR REWARD INACTION LEARNING AUTOMATA
    OOMMEN, BJ
    HANSEN, E
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1984, 14 (03): : 542 - 545