Convergence of Recurrent Neuro-Fuzzy Value-Gradient Learning With and Without an Actor

被引:2
|
作者
Al-Dabooni, Seaar [1 ,2 ]
Wunsch, Donald [3 ]
机构
[1] Appl Computat Intelligence Lab, Rolla, MO 65401 USA
[2] Basra Oil Co, Basra 21240, Iraq
[3] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65401 USA
基金
美国国家科学基金会;
关键词
Adaptive systems; Computer architecture; Dynamic programming; Mobile robots; Adaptive dynamic programming (ADP); convergence analysis; eligibility traces; mobile robot; recurrent neuro-fuzzy (RNF); Takagi-Sugeno (T-S) neuro-fuzzy; SYSTEMS; BACKPROPAGATION; IDENTIFICATION;
D O I
10.1109/TFUZZ.2019.2912349
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, a gradient of the n-step temporaldifference [TD(lambda)] learning has been developed to present an advanced adaptive dynamic programming (ADP) algorithm, called value-gradient learning [VGLl]. In this paper, we improve the VGLl architecture, which is called the "single adaptive actor network [SNVGLl]" because it has only a single approximator function network (critic) instead of dual networks (critic and actor) as in VGLl. Therefore, SNVGLl has lower computational requirements when compared to VGLl. Moreover, in this paper, a recurrent hybrid neuro-fuzzy (RNF) and a first-order Takagi-Sugeno RNF (TSRNF) are derived and implemented to build the critic and actor networks. Furthermore, we develop the novel study of the theoretical convergence proofs for both VGLl and SNVGLl under certain conditions. In this paper, mobile robot simulation model (model based) is used to solve the optimal control problem for affine nonlinear discrete-time systems. Mobile robot is exposed various noise levels to verify the performance and to validate the theoretical analysis.
引用
收藏
页码:658 / 672
页数:15
相关论文
共 50 条
  • [1] Value-Gradient Learning
    Fairbank, Michael
    Alonso, Eduardo
    [J]. 2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [2] A modified gradient-based neuro-fuzzy learning algorithm and its convergence
    Wu, Wei
    Li, Long
    Yang, Jie
    Liu, Yan
    [J]. INFORMATION SCIENCES, 2010, 180 (09) : 1630 - 1642
  • [3] Mobile Robot Control Based on Hybrid Neuro-Fuzzy Value Gradient Reinforcement Learning
    Al-Dabooni, Seaar
    Wunsch, Donald
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 2820 - 2827
  • [4] Recurrent neuro-fuzzy systems
    Isik, C
    Farrokhi, M
    [J]. 1997 ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS, 1997, : 362 - 366
  • [5] A hierarchical recurrent neuro-fuzzy system
    Nürnberger, A
    [J]. JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 1407 - 1412
  • [6] An Evolving Neuro-Fuzzy Recurrent Network
    Rubio Avila, Jose de Jesus
    Pacheco Martinez, Jaime
    Ferreyra Ramirez, Andres
    [J]. 2009 IEEE WORKSHOP ON EVOLVING AND SELF-DEVELOPING INTELLIGENT SYSTEMS, 2009, : 9 - 15
  • [7] Strong Convergence of Neuro-Fuzzy Learning With Adaptive Momentum for Complex System
    Liu, Yan
    Liu, Fang
    Li, Long
    [J]. IEEE ACCESS, 2019, 7 : 39362 - 39368
  • [8] Hardware neuro-fuzzy learning
    Li, Chunshien
    Tsai, Zhao-Yi
    [J]. ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2007, : 282 - +
  • [9] Dynamic Behavior recurrent neuro-fuzzy modeling by combining global learning and local learning
    Chang, XG
    Li, W
    [J]. PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 119 - 126
  • [10] Dynamic Behavior recurrent neuro-fuzzy modeling by combining global learning and local learning
    Chang, XG
    Li, W
    [J]. PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 962 - 965