A Two Stage Learning Technique for Dual Learning in the Pursuit-Evasion Differential Game

被引：0

作者：

Al-Talabi, Ahmad A. ^{[1
,2
]}

Schwartz, Howard M. ^{[1
]}

机构：

[1] Carleton Univ, Dept Syst & Comp Engn, 1125 Colonel By Dr, Ottawa, ON K1S 5B6, Canada

[2] Univ Baghdad, Al Khwarizmi Coll Engn, Mechatron Engn Dept, Baghdad, Iraq

来源：

2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL) | 2014年

关键词：

PARTICLE SWARM; FUZZY; CONTROLLERS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the case of dual learning in the pursuit-evasion (PE) differential game and examines how fast the players can learn their default control strategies. The players should learn their default control strategies simultaneously by interacting with each other. Each player's learning process depends on the rewards received from its environment. The learning process is implemented using a two stage learning algorithm that combines the particle swarm optimization (PSO)-based fuzzy logic control (FLC) algorithm with the Q-Learning fuzzy inference system (QFIS) algorithm. The PSO algorithm is used as a global optimizer to autonomously tune the parameters of a fuzzy logic controller whereas the QFIS algorithm is used as a local optimizer. The two stage learning algorithm is compared through simulation with the default control strategy, the PSO-based FLC algorithm, and the QFIS algorithm. Simulation results show that the players are able to learn their default control strategies. Also, it shows that the two stage learning algorithm outperforms the PSO-based FLC algorithm and the QFIS algorithm with respect to the learning time.

引用

页码：243 / 250

页数：8

共 50 条

[21] Transfer reinforcement learning for multi-agent pursuit-evasion differential game with obstacles in a continuous environment
Hu, Penglin
Pan, Quan
Zhao, Chunhui
Guo, Yaning
ASIAN JOURNAL OF CONTROL, 2024, 26 (04) : 2125 - 2140
[22] On game value for a pursuit-evasion differential game with state and integral constraints
Sharifi, Somayeh
Badakaya, Abbas Ja'afaru
Salimi, Mehdi
JAPAN JOURNAL OF INDUSTRIAL AND APPLIED MATHEMATICS, 2022, 39 (02) : 653 - 668
[23] GAME VALUE FOR A PURSUIT-EVASION DIFFERENTIAL GAME PROBLEM IN A HILBERT SPACE
Badakaya, Abbas Ja'afaru
Halliru, Aminu Sulaiman
Adamu, Jamilu
JOURNAL OF DYNAMICS AND GAMES, 2022, 9 (01): : 1 - 12
[24] On game value for a pursuit-evasion differential game with state and integral constraints
Somayeh Sharifi
Abbas Ja’afaru Badakaya
Mehdi Salimi
Japan Journal of Industrial and Applied Mathematics, 2022, 39 : 653 - 668
[25] An Asymmetric Version of the Two Car Pursuit-Evasion Game
Exarchos, Ioannis
Tsiotras, Panagiotis
2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 4272 - 4277
[26] Intelligent Pursuit-Evasion Game Based on Deep Reinforcement Learning for Hypersonic Vehicles
Gao, Mengjing
Yan, Tian
Li, Quancheng
Fu, Wenxing
Zhang, Jin
AEROSPACE, 2023, 10 (01)
[27] Orbital Multi-Player Pursuit-Evasion Game with Deep Reinforcement Learning
Zhen-yu Li
Si Chen
Chenghong Zhou
Wei Sun
The Journal of the Astronautical Sciences, 72 (1)
[28] A NEW TECHNIQUE FOR SOLVING PURSUIT-EVASION DIFFERENTIAL GAMES
MEIER, L
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1969, AC14 (04) : 352 - &
[29] Learning to Play Pursuit-Evasion with Visibility Constraints
Engin, Selim
Jiang, Qingyuan
Isler, Volkan
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3858 - 3863
[30] Using Cognitive Behavioral Learning in Multi-Agent Pursuit-Evasion Game
Kuo, Jong Yih
Liu, Chien-Hung
Lee, Fang-Wen
ASIA MODELLING SYMPOSIUM 2014 (AMS 2014), 2014, : 16 - 20

← 1 2 3 4 5 →