A new prospective for Learning Automata: A machine learning approach

被引:8
|
作者
Jiang, Wen [1 ]
Li, Bin [2 ]
Li, Shenghong [1 ]
Tang, Yuanyan [3 ]
Chen, Chun Lung Philip [3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
[3] Univ Macau, Fac Sci & Technol, Macau, Peoples R China
基金
中国国家自然科学基金;
关键词
Learning Automata; epsilon-Optimal; Bayesian estimator; Maximum Likelihood Estimator; MULTITEACHER ENVIRONMENT; ALGORITHM; OPTIMALITY; SCHEMES;
D O I
10.1016/j.neucom.2015.04.125
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of Learning Automata (LA), how to design faster learning algorithms has always been a key issue. Among solutions reported in the literature, the stochastic estimator reward-inaction learning automaton (SERI), which belongs to the Maximum Likelihood estimator based LAs, has been recognized as the fastest epsilon-optimal LA. In this paper, we first point out the limitations of the traditional Maximum Likelihood Estimator (MLE) based LAs and then introduce Bayesian estimator based approach, which is demonstrated to be equivalent to Laplace smoothing of the traditional method, to overcome these limitations. The key idea is that the Bayesian estimator, which estimates the probability of selecting each action in the LA, aims to reconstruct Bernoulli distribution from sequential data, and is formalized based on exponential conjugate family so that the LA has a relatively simple format for easy implementation. In addition, we also indicate that this Bayesian estimator could be applied to update almost all existing MLE estimator based LAs. Based on the proposed Bayesian estimator, a new LA, known as Generalized Bayesian Stochastic Estimator (GBSE) LA, is presented and proved to be epsilon-optimal. Finally, extensive experimental results on benchmarks demonstrate that our proposed learning scheme is more efficient than the current best LA SERI. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:319 / 325
页数:7
相关论文
共 50 条
  • [1] A machine learning approach to synchronization of automata
    Podolak, Igor
    Roman, Adam
    Szykula, Marek
    Zielinski, Bartosz
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 97 : 357 - 371
  • [2] Automata, a powerful approach of machine learning
    Lu, Xiyan
    Zhang, Runtong
    [J]. Proceedings of 2006 International Conference on Artificial Intelligence: 50 YEARS' ACHIEVEMENTS, FUTURE DIRECTIONS AND SOCIAL IMPACTS, 2006, : 707 - 712
  • [3] A New Approach for Active Automata Learning Based on Apartness
    Vaandrager, Frits
    Garhewal, Bharat
    Rot, Jurriaan
    Wissmann, Thorsten
    [J]. TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, TACAS 2022, PT I, 2022, 13243 : 223 - 243
  • [4] A NEW APPROACH TO THE DESIGN OF REINFORCEMENT SCHEMES FOR LEARNING AUTOMATA
    THATHACHAR, MAL
    SASTRY, PS
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (01): : 168 - 175
  • [5] Machine-learning with Cellular Automata
    Povalej, P
    Kokol, P
    Druzovec, TW
    Stiglic, B
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS VI, PROCEEDINGS, 2005, 3646 : 305 - 315
  • [6] Towards Machine Learning on the Automata Processor
    Tracy, Tommy, II
    Fu, Yao
    Roy, Indranil
    Jonas, Eric
    Glendenning, Paul
    [J]. HIGH PERFORMANCE COMPUTING, 2016, 9697 : 200 - 218
  • [7] Automata Learning: An Algebraic Approach
    Urbat, Henning
    Schroder, Lutz
    [J]. PROCEEDINGS OF THE 35TH ANNUAL ACM/IEEE SYMPOSIUM ON LOGIC IN COMPUTER SCIENCE (LICS 2020), 2020, : 900 - 914
  • [8] Mobile App Fingerprinting through Automata Learning and Machine Learning
    Marzani, Fatemeh
    Ghassemi, Fatemeh
    Sabahi-Kaviani, Zeynab
    van Ede, Thijs
    van Steen, Maarten
    [J]. 2023 IFIP NETWORKING CONFERENCE, IFIP NETWORKING, 2023,
  • [9] A NEW APPROACH TO THE DESIGN OF REINFORCEMENT SCHEMES FOR LEARNING AUTOMATA - STOCHASTIC ESTIMATOR LEARNING ALGORITHMS
    PAPADIMITRIOU, GI
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1994, 6 (04) : 649 - 654
  • [10] A NEW APPROACH TO THE DESIGN OF REINFORCEMENT SCHEMES FOR LEARNING AUTOMATA - STOCHASTIC ESTIMATOR LEARNING ALGORITHM
    VASILAKOS, AV
    PAPADIMITRIOU, GI
    [J]. NEUROCOMPUTING, 1995, 7 (03) : 275 - 297