A reinforcement learning algorithm with fuzzy approximation for semi Markov decision problems

被引:3
|
作者
Kula, Ufuk [1 ]
Ocaktan, Beyazit [2 ]
机构
[1] Sakarya Univ, Dept Ind Engn, TR-54187 Sakarya, Turkey
[2] Balikesir Univ, Dept Ind Engn, Balikesir, Turkey
关键词
Fuzzy approximation; ANFIS; reinforcement learning; SMDPs; ANFIS;
D O I
10.3233/IFS-141460
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real life stochastic problems are generally large-scale, difficult to model, and therefore, suffer from the curses of dimensionality. Such problems cannot be solved by classical optimization methods. This paper presents a reinforcement learning algorithm using a fuzzy inference system, ANFIS to find an approximate solution for semi Markov decision problems (SMDPs). The performance of the developed algorithm is measured and compared to a classical reinforcement algorithm, SMART in a numerical example. Our numerical examples show that the developed algorithm converges significantly faster as the problem size increases and the average cost calculated by the algorithm gets closer to that of SMART as number of epochs used in the developed algorithm is increased.
引用
收藏
页码:1733 / 1744
页数:12
相关论文
共 50 条
  • [21] A reinforcement learning method based on an immune network adapted to a semi-Markov decision process
    Kogawa N.
    Obayashi M.
    Kobayashi K.
    Kuremoto T.
    Artificial Life and Robotics, 2009, 13 (2) : 538 - 542
  • [22] A Novel Q-learning Algorithm with Function Approximation for Constrained Markov Decision Processes
    Lakshmanan, K.
    Bhatnagar, Shalabh
    2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 400 - 405
  • [23] Semi-Markov Offline Reinforcement Learning for Healthcare
    Fatemi, Mehdi
    Wu, Mary
    Petch, Jeremy
    Nelson, Walter
    Connolly, Stuart J.
    Benz, Alexander
    Carnicelli, Anthony
    Ghassemi, Marzyeh
    CONFERENCE ON HEALTH, INFERENCE, AND LEARNING, VOL 174, 2022, 174 : 119 - 137
  • [24] A NEW REINFORCEMENT LEARNING ALGORITHM WITH FIXED EXPLORATION FOR SEMI-MARKOV CONTROL IN PREVENTIVE MAINTENANCE
    Encapera, Angelo
    Gosavi, Abhijit
    PROCEEDINGS OF THE ASME 12TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE - 2017, VOL 3, 2017,
  • [25] Multi-Task Approach to Reinforcement Learning for Factored-State Markov Decision Problems
    Simm, Jaak
    Sugiyama, Masashi
    Hachiya, Hirotaka
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (10) : 2426 - 2437
  • [26] A sensitivity view of Markov decision processes and reinforcement learning
    Cao, XR
    MODELING, CONTROL AND OPTIMIZATION OF COMPLEX SYSTEMS: IN HONOR OF PROFESSOR YU-CHI HO, 2003, 14 : 261 - 283
  • [27] A Sublinear-Regret Reinforcement Learning Algorithm on Constrained Markov Decision Processes with reset action
    Watanabe, Takashi
    Sakuragawa, Takashi
    ICMLSC 2020: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING, 2020, : 51 - 55
  • [28] Continuous-state reinforcement learning with fuzzy approximation
    Busoniu, Lucian
    Ernst, Damien
    De Schutter, Bart
    Babuska, Robert
    ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS, 2008, 4865 : 27 - +
  • [29] A multi-agent reinforcement learning algorithm with fuzzy approximation for Distributed Stochastic Unit Commitment
    Ghorbani, Farzaneh
    Afsharchi, Mohsen
    Derhami, Vali
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (05) : 6613 - 6628
  • [30] Multivariate Decision Tree Function Approximation for Reinforcement Learning
    Saghezchi, Hossein Bashashati
    Asadpour, Masoud
    NEURAL INFORMATION PROCESSING: THEORY AND ALGORITHMS, PT I, 2010, 6443 : 687 - 694