A Two Stage Learning Technique for Dual Learning in the Pursuit-Evasion Differential Game

被引:0
|
作者
Al-Talabi, Ahmad A. [1 ,2 ]
Schwartz, Howard M. [1 ]
机构
[1] Carleton Univ, Dept Syst & Comp Engn, 1125 Colonel By Dr, Ottawa, ON K1S 5B6, Canada
[2] Univ Baghdad, Al Khwarizmi Coll Engn, Mechatron Engn Dept, Baghdad, Iraq
关键词
PARTICLE SWARM; FUZZY; CONTROLLERS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the case of dual learning in the pursuit-evasion (PE) differential game and examines how fast the players can learn their default control strategies. The players should learn their default control strategies simultaneously by interacting with each other. Each player's learning process depends on the rewards received from its environment. The learning process is implemented using a two stage learning algorithm that combines the particle swarm optimization (PSO)-based fuzzy logic control (FLC) algorithm with the Q-Learning fuzzy inference system (QFIS) algorithm. The PSO algorithm is used as a global optimizer to autonomously tune the parameters of a fuzzy logic controller whereas the QFIS algorithm is used as a local optimizer. The two stage learning algorithm is compared through simulation with the default control strategy, the PSO-based FLC algorithm, and the QFIS algorithm. Simulation results show that the players are able to learn their default control strategies. Also, it shows that the two stage learning algorithm outperforms the PSO-based FLC algorithm and the QFIS algorithm with respect to the learning time.
引用
收藏
页码:243 / 250
页数:8
相关论文
共 50 条
  • [41] Capture zones in a pursuit-evasion game
    Shima, T
    42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 5450 - 5455
  • [42] A PURSUIT-EVASION GAME IN THE ORBITAL PLANE
    Selvakumar, Jhanani
    Bakolas, Efstathios
    SPACEFLIGHT MECHANICS 2017, PTS I - IV, 2017, 160 : 1105 - 1116
  • [43] Surveillance for security as a pursuit-evasion game
    Bhattacharya, Sourabh, 1600, Springer Verlag (8840):
  • [44] Optimal Path Planning For Two UAVs in a Pursuit-Evasion Game
    Mirzaei, Mehrdad
    Kosari, Amirreza
    Maghsoudi, Hossein
    2021 IEEE IFAC INTERNATIONAL CONFERENCE ON AUTOMATION/XXIV CONGRESS OF THE CHILEAN ASSOCIATION OF AUTOMATIC CONTROL (IEEE IFAC ICA - ACCA2021), 2021,
  • [45] COMMENTS ON A LINEAR PURSUIT-EVASION GAME
    MESCHLER, PA
    BARON, S
    HO, L
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1967, AC12 (03) : 326 - &
  • [46] A Pursuit-Evasion Game with Incomplete Information
    Rusnak, G. Hexner, I
    Weiss, H.
    2019 27TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2019, : 583 - 588
  • [47] Apollonius Partitions Based Pursuit-evasion Game Strategies by Q-Learning Approach
    Wang, Qing
    Wu, KaiQi
    Ye, JianFeng
    Wu, YongBao
    Xue, Lei
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4843 - 4848
  • [48] Intelligent Maneuver Strategy for a Hypersonic Pursuit-Evasion Game Based on Deep Reinforcement Learning
    Guo, Yunhe
    Jiang, Zijian
    Huang, Hanqiao
    Fan, Hongjia
    Weng, Weiye
    AEROSPACE, 2023, 10 (09)
  • [49] Adaptive Double Fuzzy Systems Based Q-Learning for Pursuit-Evasion Game
    Liu, Shuaizheng
    Hu, Xiaoxiang
    Dong, Kejun
    IFAC PAPERSONLINE, 2022, 55 (03): : 251 - 256
  • [50] A Pursuit-Evasion Algorithm Based on Hierarchical Reinforcement Learning
    Liu, Jie
    Liu, Shuhua
    Wu, Hongyan
    Zhang, Yu
    2009 INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION, VOL II, 2009, : 482 - 486