Hierarchical fuzzy ART for Q-learning and its application in air combat simulation

被引:5
|
作者
Zhou Y. [1 ]
Ma Y. [1 ]
Song X. [1 ]
Gong G. [1 ]
机构
[1] School of Automation Science and Electrical Engineering, Beihang University, XueYuan Road No. 37, HaiDian District, Beijing
来源
| 1600年 / World Scientific卷 / 08期
关键词
air combat simulation; Fuzzy ART; Q-learning; value function approximation;
D O I
10.1142/S1793962317500520
中图分类号
学科分类号
摘要
Value function approximation plays an important role in reinforcement learning (RL) with continuous state space, which is widely used to build decision models in practice. Many traditional approaches require experienced designers to manually specify the formulization of the approximating function, leading to the rigid, non-adaptive representation of the value function. To address this problem, a novel Q-value function approximation method named 'Hierarchical fuzzy Adaptive Resonance Theory' (HiART) is proposed in this paper. HiART is based on the Fuzzy ART method and is an adaptive classification network that learns to segment the state space by classifying the training input automatically. HiART begins with a highly generalized structure where the number of the category nodes is limited, which is beneficial to speed up the learning process at the early stage. Then, the network is refined gradually by creating the attached sub-networks, and a layered network structure is formed during this process. Based on this adaptive structure, HiART alleviates the dependence on expert experience to design the network parameter. The effectiveness and adaptivity of HiART are demonstrated in the Mountain Car benchmark problem with both fast learning speed and low computation time. Finally, a simulation application example of the one versus one air combat decision problem illustrates the applicability of HiART. © 2017 World Scientific Publishing Company.
引用
收藏
相关论文
共 50 条
  • [21] Parameter specification for fuzzy clustering by Q-learning
    Oh, CH
    Ikeda, E
    Honda, K
    Ichihashi, H
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL IV, 2000, : 9 - 12
  • [22] Fuzzy Q-learning Control for Temperature Systems
    Chen, Yeong-Chin
    Hung, Lon-Chen
    Syamsudin, Mariana
    22ND IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2021-FALL), 2021, : 148 - 151
  • [23] Decoupled Visual Servoing With Fuzzy Q-Learning
    Shi, Haobin
    Li, Xuesi
    Hwang, Kao-Shing
    Pan, Wei
    Xu, Genjiu
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (01) : 241 - 252
  • [24] Extending Q-learning to fuzzy classifier systems
    Bonarini, A
    TOPICS IN ARTIFICIAL INTELLIGENCE, 1995, 992 : 25 - 36
  • [25] Efficient implementation of dynamic fuzzy Q-learning
    Deng, C
    Er, MJ
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1854 - 1858
  • [26] Implementation of fuzzy Q-learning for a soccer agent
    Nakashima, T
    Udo, M
    Ishibuchi, H
    PROCEEDINGS OF THE 12TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1 AND 2, 2003, : 533 - 536
  • [27] Swarm Reinforcement Learning Method Based on Hierarchical Q-Learning
    Kuroe, Yasuaki
    Takeuchi, Kenya
    Maeda, Yutaka
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [28] Hybrid MDP based integrated hierarchical Q-learning
    TARN Tzyh-Jong
    Science China(Information Sciences), 2011, 54 (11) : 2279 - 2294
  • [29] Hybrid MDP based integrated hierarchical Q-learning
    ChunLin Chen
    DaoYi Dong
    Han-Xiong Li
    Tzyh-Jong Tarn
    Science China Information Sciences, 2011, 54 : 2279 - 2294
  • [30] Hybrid MDP based integrated hierarchical Q-learning
    Chen ChunLin
    Dong DaoYi
    Li Han-Xiong
    Tarn, Tzyh-Jong
    SCIENCE CHINA-INFORMATION SCIENCES, 2011, 54 (11) : 2279 - 2294