Fuzzy inference system learning by reinforcement methods

被引:229
|
作者
Jouffe, L [1 ]
机构
[1] Inst Natl Sci Appl, Dept Comp Sci, IRISA, SODALEC Elect, F-35043 Rennes, France
关键词
Dynamic Programming (DP); fuzzy logic; learning; Markovian Decision Problem (MDP); reinforcement;
D O I
10.1109/5326.704563
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fuzzy Actor-Critic Learning (FACL) and Fuzzy Q-Learning (FQL) are reinforcement learning methods based on Dynamic Programming (DP) principles. In this paper, they are used to tune online the conclusion part of Fuzzy Inference Systems (FIS), The only information available for learning is the system feedback, which describes in terms of reward and punishment the task the fuzzy agent has to realize. At each time step, the agent receives a reinforcement signal according to the last action it has performed in the previous state. The problem involves optimizing not only the direct reinforcement, but also the total amount of reinforcements the agent can receive in the future. To illustrate the use of these two learning methods, we first applied them to a problem that involves finding a fuzzy controller to drive a boat from one bank to another, across a river with a strong nonlinear current. Then, we used the well-known Cart-Pole Balancing and Mountain-Car problems to be able to compare our methods to other reinforcement learning methods and focus on important characteristic aspects of FACL and FQL, We found that the genericity of our methods allows us to learn every kind of reinforcement learning problem (continuous states, discrete/continuous actions, various type of reinforcement functions). The experimental studies also show the superiority of these methods with respect to the other related methods we can find in the literature.
引用
收藏
页码:338 / 355
页数:18
相关论文
共 50 条
  • [1] Application of Fuzzy Inference System to Average Reward Reinforcement Learning
    Chen, Wei
    Zhai, Zhenkun
    Li, Xiong
    Guo, Jing
    2009 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE, VOL 1, PROCEEDINGS, 2009, : 374 - 377
  • [2] An advanced robust integral reinforcement learning scheme with the fuzzy inference system
    Liu, Ao
    Wang, Ding
    Qiao, Junfei
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, : 11745 - 11759
  • [3] Genetic reinforcement learning of Fuzzy Inference System application to mobile robotic
    Nemra, Abdelkrim
    Rezine, Hacene
    Souici, Abdelkrim
    ICINCO 2007: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL ICSO: INTELLIGENT CONTROL SYSTEMS AND OPTIMIZATION, 2007, : 206 - 213
  • [4] An Investigation of Methods of Parameter Tuning For Q-Learning Fuzzy Inference System
    Al-Talabi, Ahmad A.
    Schwartz, Howard M.
    2014 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2014, : 2594 - 2601
  • [5] Reinforcement learning in the fuzzy classifier system
    Valenzuela-Rendon, M
    EXPERT SYSTEMS WITH APPLICATIONS, 1998, 14 (1-2) : 237 - 247
  • [6] Olfactory-Based Navigation via Model-Based Reinforcement Learning and Fuzzy Inference Methods
    Wang, Lingxiao
    Pang, Shuo
    Li, Jinlong
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (10) : 3014 - 3027
  • [7] Online probabilistic learning for fuzzy inference system
    Oentaryo, Richard J.
    Er, Meng Joo
    Linn, San
    Li, Xiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (11) : 5082 - 5096
  • [8] Novel reinforcement learning approach for automatic generation of fuzzy inference systems
    Er, Meng Joo
    Zhou, Yi
    2006 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2006, : 100 - +
  • [9] A parallel fuzzy inference model with distributed prediction scheme for reinforcement learning
    Kuo, YH
    Hsu, JP
    Wang, CW
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (02): : 160 - 172
  • [10] Fuzzy Reinforcement Learning for System of Systems (SOS)
    Berenji, Hamid
    Jamshidi, Mo
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 1689 - 1694