Efficient implementation of dynamic fuzzy Q-learning

被引：0

作者：

Deng, C ^{[1
]}

Er, MJ ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore

来源：

ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS | 2003年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a Dynamic Fuzzy Q-Learning (DFQL) method that is capable of tuning the Fuzzy Inference Systems (FIS) online. On-line self-organizing learning is developed so that structure and parameters identification are accomplished automatically and simultaneously. Self-organizing fuzzy inference is introduced to calculate actions and Q-functions so as to enable us to deal with continuous-valued states and actions. We provide the conditions of the convergence of the algorithm. Furthermore, the learning methods based on bias component and eligibility traces for rapid reinforcement learning are discussed.

引用

页码：1854 / 1858

页数：5

共 50 条

[31] Fuzzy Q-learning with the modified fuzzy ART neural network
Ueda, H
Hanada, N
Kimoto, H
Naraki, T
Takahashi, K
Miyahara, T
[J]. 2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2005, : 308 - 315
[32] Design of Dynamic Fuzzy Q-Learning Controller for Networked Wind Energy Conversion Systems
Wanigasekara, Chathura
Swain, Akshya
Almakhles, Dhafer
Zhou, Lv
[J]. 2020 20TH IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2020 4TH IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC/I&CPS EUROPE), 2020,
[33] Parallel Implementation of Reinforcement Learning Q-Learning Technique for FPGA
Da Silva, Lucileide M. D.
Torquato, Matheus F.
Fernandes, Marcelo A. C.
[J]. IEEE ACCESS, 2019, 7 : 2782 - 2798
[34] Dynamic Choice of State Abstraction in Q-Learning
Tamassia, Marco
Zambetta, Fabio
Raffe, William L.
Mueller, Florian 'Floyd'
Li, Xiaodong
[J]. ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 46 - 54
[35] Q-learning with Experience Replay in a Dynamic Environment
Pieters, Mathijs
Wiering, Marco A.
[J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
[36] PENALIZED Q-LEARNING FOR DYNAMIC TREATMENT REGIMENS
Song, Rui
Wang, Weiwei
Zeng, Donglin
Kosorok, Michael R.
[J]. STATISTICA SINICA, 2015, 25 (03) : 901 - 920
[37] Modeling and fuzzy Q-learning control of biped walking
Meng Joo Er
Yi Zhou
[J]. Proceedings of the 24th Chinese Control Conference, Vols 1 and 2, 2005, : 641 - 646
[38] Accuracy based fuzzy Q-learning for robot behaviours
Gu, DB
Hu, HS
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, PROCEEDINGS, 2004, : 1455 - 1460
[39] A Hybrid Fuzzy Q-Learning algorithm for robot navigation
Gordon, Sean W.
Reyes, Napoleon H.
Barczak, Andre
[J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2625 - 2631
[40] Intelligent Fuzzy Q-Learning control of humanoid robots
Er, MJ
Zhou, Y
[J]. ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 3, PROCEEDINGS, 2005, 3498 : 216 - 221

← 1 2 3 4 5 →