A reinforcement learning algorithm with fuzzy approximation for semi Markov decision problems

被引：3

作者：

Kula, Ufuk ^{[1
]}

Ocaktan, Beyazit ^{[2
]}

机构：

[1] Sakarya Univ, Dept Ind Engn, TR-54187 Sakarya, Turkey

[2] Balikesir Univ, Dept Ind Engn, Balikesir, Turkey

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2015年 / 28卷 / 04期

关键词：

Fuzzy approximation; ANFIS; reinforcement learning; SMDPs; ANFIS;

D O I：

10.3233/IFS-141460

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Real life stochastic problems are generally large-scale, difficult to model, and therefore, suffer from the curses of dimensionality. Such problems cannot be solved by classical optimization methods. This paper presents a reinforcement learning algorithm using a fuzzy inference system, ANFIS to find an approximate solution for semi Markov decision problems (SMDPs). The performance of the developed algorithm is measured and compared to a classical reinforcement algorithm, SMART in a numerical example. Our numerical examples show that the developed algorithm converges significantly faster as the problem size increases and the average cost calculated by the algorithm gets closer to that of SMART as number of epochs used in the developed algorithm is increased.

引用

页码：1733 / 1744

页数：12

共 50 条

[21] A reinforcement learning method based on an immune network adapted to a semi-Markov decision process
Kogawa N.
Obayashi M.
Kobayashi K.
Kuremoto T.
Artificial Life and Robotics, 2009, 13 (2) : 538 - 542
[22] A Novel Q-learning Algorithm with Function Approximation for Constrained Markov Decision Processes
Lakshmanan, K.
Bhatnagar, Shalabh
2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 400 - 405
[23] Semi-Markov Offline Reinforcement Learning for Healthcare
Fatemi, Mehdi
Wu, Mary
Petch, Jeremy
Nelson, Walter
Connolly, Stuart J.
Benz, Alexander
Carnicelli, Anthony
Ghassemi, Marzyeh
CONFERENCE ON HEALTH, INFERENCE, AND LEARNING, VOL 174, 2022, 174 : 119 - 137
[24] A NEW REINFORCEMENT LEARNING ALGORITHM WITH FIXED EXPLORATION FOR SEMI-MARKOV CONTROL IN PREVENTIVE MAINTENANCE
Encapera, Angelo
Gosavi, Abhijit
PROCEEDINGS OF THE ASME 12TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE - 2017, VOL 3, 2017,
[25] Multi-Task Approach to Reinforcement Learning for Factored-State Markov Decision Problems
Simm, Jaak
Sugiyama, Masashi
Hachiya, Hirotaka
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (10) : 2426 - 2437
[26] A sensitivity view of Markov decision processes and reinforcement learning
Cao, XR
MODELING, CONTROL AND OPTIMIZATION OF COMPLEX SYSTEMS: IN HONOR OF PROFESSOR YU-CHI HO, 2003, 14 : 261 - 283
[27] A Sublinear-Regret Reinforcement Learning Algorithm on Constrained Markov Decision Processes with reset action
Watanabe, Takashi
Sakuragawa, Takashi
ICMLSC 2020: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING, 2020, : 51 - 55
[28] Continuous-state reinforcement learning with fuzzy approximation
Busoniu, Lucian
Ernst, Damien
De Schutter, Bart
Babuska, Robert
ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS, 2008, 4865 : 27 - +
[29] A multi-agent reinforcement learning algorithm with fuzzy approximation for Distributed Stochastic Unit Commitment
Ghorbani, Farzaneh
Afsharchi, Mohsen
Derhami, Vali
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (05) : 6613 - 6628
[30] Multivariate Decision Tree Function Approximation for Reinforcement Learning
Saghezchi, Hossein Bashashati
Asadpour, Masoud
NEURAL INFORMATION PROCESSING: THEORY AND ALGORITHMS, PT I, 2010, 6443 : 687 - 694

← 1 2 3 4 5 →