Efficient implementation of dynamic fuzzy Q-learning

被引:0
|
作者
Deng, C [1 ]
Er, MJ [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a Dynamic Fuzzy Q-Learning (DFQL) method that is capable of tuning the Fuzzy Inference Systems (FIS) online. On-line self-organizing learning is developed so that structure and parameters identification are accomplished automatically and simultaneously. Self-organizing fuzzy inference is introduced to calculate actions and Q-functions so as to enable us to deal with continuous-valued states and actions. We provide the conditions of the convergence of the algorithm. Furthermore, the learning methods based on bias component and eligibility traces for rapid reinforcement learning are discussed.
引用
收藏
页码:1854 / 1858
页数:5
相关论文
共 50 条
  • [31] Fuzzy Q-learning with the modified fuzzy ART neural network
    Ueda, H
    Hanada, N
    Kimoto, H
    Naraki, T
    Takahashi, K
    Miyahara, T
    [J]. 2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2005, : 308 - 315
  • [32] Design of Dynamic Fuzzy Q-Learning Controller for Networked Wind Energy Conversion Systems
    Wanigasekara, Chathura
    Swain, Akshya
    Almakhles, Dhafer
    Zhou, Lv
    [J]. 2020 20TH IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2020 4TH IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC/I&CPS EUROPE), 2020,
  • [33] Parallel Implementation of Reinforcement Learning Q-Learning Technique for FPGA
    Da Silva, Lucileide M. D.
    Torquato, Matheus F.
    Fernandes, Marcelo A. C.
    [J]. IEEE ACCESS, 2019, 7 : 2782 - 2798
  • [34] Dynamic Choice of State Abstraction in Q-Learning
    Tamassia, Marco
    Zambetta, Fabio
    Raffe, William L.
    Mueller, Florian 'Floyd'
    Li, Xiaodong
    [J]. ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 46 - 54
  • [35] Q-learning with Experience Replay in a Dynamic Environment
    Pieters, Mathijs
    Wiering, Marco A.
    [J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [36] PENALIZED Q-LEARNING FOR DYNAMIC TREATMENT REGIMENS
    Song, Rui
    Wang, Weiwei
    Zeng, Donglin
    Kosorok, Michael R.
    [J]. STATISTICA SINICA, 2015, 25 (03) : 901 - 920
  • [37] Modeling and fuzzy Q-learning control of biped walking
    Meng Joo Er
    Yi Zhou
    [J]. Proceedings of the 24th Chinese Control Conference, Vols 1 and 2, 2005, : 641 - 646
  • [38] Accuracy based fuzzy Q-learning for robot behaviours
    Gu, DB
    Hu, HS
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, PROCEEDINGS, 2004, : 1455 - 1460
  • [39] A Hybrid Fuzzy Q-Learning algorithm for robot navigation
    Gordon, Sean W.
    Reyes, Napoleon H.
    Barczak, Andre
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2625 - 2631
  • [40] Intelligent Fuzzy Q-Learning control of humanoid robots
    Er, MJ
    Zhou, Y
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 3, PROCEEDINGS, 2005, 3498 : 216 - 221