Hierarchical fuzzy ART for Q-learning and its application in air combat simulation

被引：5

作者：

Zhou Y. ^{[1
]}

Ma Y. ^{[1
]}

Song X. ^{[1
]}

Gong G. ^{[1
]}

机构：

[1] School of Automation Science and Electrical Engineering, Beihang University, XueYuan Road No. 37, HaiDian District, Beijing

来源：

| 1600年 / World Scientific卷 / 08期

关键词：

air combat simulation; Fuzzy ART; Q-learning; value function approximation;

D O I：

10.1142/S1793962317500520

中图分类号：

学科分类号：

摘要：

Value function approximation plays an important role in reinforcement learning (RL) with continuous state space, which is widely used to build decision models in practice. Many traditional approaches require experienced designers to manually specify the formulization of the approximating function, leading to the rigid, non-adaptive representation of the value function. To address this problem, a novel Q-value function approximation method named 'Hierarchical fuzzy Adaptive Resonance Theory' (HiART) is proposed in this paper. HiART is based on the Fuzzy ART method and is an adaptive classification network that learns to segment the state space by classifying the training input automatically. HiART begins with a highly generalized structure where the number of the category nodes is limited, which is beneficial to speed up the learning process at the early stage. Then, the network is refined gradually by creating the attached sub-networks, and a layered network structure is formed during this process. Based on this adaptive structure, HiART alleviates the dependence on expert experience to design the network parameter. The effectiveness and adaptivity of HiART are demonstrated in the Mountain Car benchmark problem with both fast learning speed and low computation time. Finally, a simulation application example of the one versus one air combat decision problem illustrates the applicability of HiART. © 2017 World Scientific Publishing Company.

引用

共 50 条

[21] Parameter specification for fuzzy clustering by Q-learning
Oh, CH
Ikeda, E
Honda, K
Ichihashi, H
IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL IV, 2000, : 9 - 12
[22] Fuzzy Q-learning Control for Temperature Systems
Chen, Yeong-Chin
Hung, Lon-Chen
Syamsudin, Mariana
22ND IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2021-FALL), 2021, : 148 - 151
[23] Decoupled Visual Servoing With Fuzzy Q-Learning
Shi, Haobin
Li, Xuesi
Hwang, Kao-Shing
Pan, Wei
Xu, Genjiu
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (01) : 241 - 252
[24] Extending Q-learning to fuzzy classifier systems
Bonarini, A
TOPICS IN ARTIFICIAL INTELLIGENCE, 1995, 992 : 25 - 36
[25] Efficient implementation of dynamic fuzzy Q-learning
Deng, C
Er, MJ
ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1854 - 1858
[26] Implementation of fuzzy Q-learning for a soccer agent
Nakashima, T
Udo, M
Ishibuchi, H
PROCEEDINGS OF THE 12TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1 AND 2, 2003, : 533 - 536
[27] Swarm Reinforcement Learning Method Based on Hierarchical Q-Learning
Kuroe, Yasuaki
Takeuchi, Kenya
Maeda, Yutaka
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
[28] Hybrid MDP based integrated hierarchical Q-learning
TARN Tzyh-Jong
Science China(Information Sciences), 2011, 54 (11) : 2279 - 2294
[29] Hybrid MDP based integrated hierarchical Q-learning
ChunLin Chen
DaoYi Dong
Han-Xiong Li
Tzyh-Jong Tarn
Science China Information Sciences, 2011, 54 : 2279 - 2294
[30] Hybrid MDP based integrated hierarchical Q-learning
Chen ChunLin
Dong DaoYi
Li Han-Xiong
Tarn, Tzyh-Jong
SCIENCE CHINA-INFORMATION SCIENCES, 2011, 54 (11) : 2279 - 2294

← 1 2 3 4 5 →