Fuzzy inference system learning by reinforcement methods

被引:229
|
作者
Jouffe, L [1 ]
机构
[1] Inst Natl Sci Appl, Dept Comp Sci, IRISA, SODALEC Elect, F-35043 Rennes, France
关键词
Dynamic Programming (DP); fuzzy logic; learning; Markovian Decision Problem (MDP); reinforcement;
D O I
10.1109/5326.704563
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fuzzy Actor-Critic Learning (FACL) and Fuzzy Q-Learning (FQL) are reinforcement learning methods based on Dynamic Programming (DP) principles. In this paper, they are used to tune online the conclusion part of Fuzzy Inference Systems (FIS), The only information available for learning is the system feedback, which describes in terms of reward and punishment the task the fuzzy agent has to realize. At each time step, the agent receives a reinforcement signal according to the last action it has performed in the previous state. The problem involves optimizing not only the direct reinforcement, but also the total amount of reinforcements the agent can receive in the future. To illustrate the use of these two learning methods, we first applied them to a problem that involves finding a fuzzy controller to drive a boat from one bank to another, across a river with a strong nonlinear current. Then, we used the well-known Cart-Pole Balancing and Mountain-Car problems to be able to compare our methods to other reinforcement learning methods and focus on important characteristic aspects of FACL and FQL, We found that the genericity of our methods allows us to learn every kind of reinforcement learning problem (continuous states, discrete/continuous actions, various type of reinforcement functions). The experimental studies also show the superiority of these methods with respect to the other related methods we can find in the literature.
引用
收藏
页码:338 / 355
页数:18
相关论文
共 50 条
  • [41] An Adaptive Learning Method for the Generation of Fuzzy Inference System from Data
    ZHANG Li-Quan~(1
    自动化学报, 2008, (01) : 80 - 84
  • [42] Performance Evaluation of Learning Styles Based on Fuzzy Logic Inference System
    Ozdemir, Ali
    Alaybeyoglu, Aysegul
    Mulayim, Naciye
    Balbal, Kadriye Filiz
    COMPUTER APPLICATIONS IN ENGINEERING EDUCATION, 2016, 24 (06) : 853 - 865
  • [43] A learning-automaton-based method for fuzzy inference system identification
    Chtourou, M
    BenJemaa, M
    Ketata, R
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1997, 28 (09) : 889 - 896
  • [44] Constraint learning using adaptive neural-fuzzy inference system
    Yazdi, Hadi Sadoghi
    Pourreza, Reza
    Yazdi, Mehri Sadoghi
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2010, 3 (02) : 257 - 278
  • [45] Additive Fuzzy Functional Inference Methods
    Seki, Hirosato
    Mizumoto, Masaharu
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [46] Energy-Efficient Gait Optimization of Snake-Like Modular Robots by Using Multiobjective Reinforcement Learning and a Fuzzy Inference System
    Singh, Akash
    Chiu, Wei-Yu
    Manoharan, Shri Harish
    Romanov, Alexey M.
    IEEE ACCESS, 2022, 10 : 86624 - 86635
  • [47] Energy-Efficient Gait Optimization of Snake-Like Modular Robots by Using Multiobjective Reinforcement Learning and a Fuzzy Inference System
    Singh, Akash
    Chiu, Wei-Yu
    Manoharan, Shri Harish
    Romanov, Alexey M.
    IEEE Access, 2022, 10 : 86624 - 86635
  • [48] Fractional Fuzzy Inference System: The New Generation of Fuzzy Inference Systems
    Mazandarani, Mehran
    Li, Xiu
    IEEE ACCESS, 2020, 8 : 126066 - 126082
  • [49] Designing a Fuzzy Q-Learning Power Energy System Using Reinforcement Learning
    J A.
    Konduru S.
    Kura V.
    NagaJyothi G.
    Dudi B.P.
    Mani Naidu S.
    International Journal of Fuzzy System Applications, 2022, 11 (03)
  • [50] Collaborative Fuzzy Rule Learning for Mamdani Type Fuzzy Inference System with Mapping of Cluster Centers
    Prasad, M.
    Chou, K. P.
    Saxena, A.
    Kawrtiya, O. P.
    Li, D. L.
    Lin, C. T.
    2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN CONTROL AND AUTOMATION (CICA), 2014, : 15 - 20