A Two Stage Learning Technique for Dual Learning in the Pursuit-Evasion Differential Game

被引：0

作者：

Al-Talabi, Ahmad A. ^{[1
,2
]}

Schwartz, Howard M. ^{[1
]}

机构：

[1] Carleton Univ, Dept Syst & Comp Engn, 1125 Colonel By Dr, Ottawa, ON K1S 5B6, Canada

[2] Univ Baghdad, Al Khwarizmi Coll Engn, Mechatron Engn Dept, Baghdad, Iraq

来源：

2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL) | 2014年

关键词：

PARTICLE SWARM; FUZZY; CONTROLLERS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the case of dual learning in the pursuit-evasion (PE) differential game and examines how fast the players can learn their default control strategies. The players should learn their default control strategies simultaneously by interacting with each other. Each player's learning process depends on the rewards received from its environment. The learning process is implemented using a two stage learning algorithm that combines the particle swarm optimization (PSO)-based fuzzy logic control (FLC) algorithm with the Q-Learning fuzzy inference system (QFIS) algorithm. The PSO algorithm is used as a global optimizer to autonomously tune the parameters of a fuzzy logic controller whereas the QFIS algorithm is used as a local optimizer. The two stage learning algorithm is compared through simulation with the default control strategy, the PSO-based FLC algorithm, and the QFIS algorithm. Simulation results show that the players are able to learn their default control strategies. Also, it shows that the two stage learning algorithm outperforms the PSO-based FLC algorithm and the QFIS algorithm with respect to the learning time.

引用

页码：243 / 250

页数：8

共 50 条

[1] A simplified pursuit-evasion game with reinforcement learning
Paczolay G.
Harmati I.
Periodica polytechnica Electrical engineering and computer science, 2021, 65 (02): : 160 - 166
[2] An Application of Continuous Deep Reinforcement Learning Approach to Pursuit-Evasion Differential Game
Wang, Maolin
Wang, Lixin
Yue, Ting
PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 1150 - 1155
[3] A Two Stage Learning Technique Using PSO-based FLC and QFIS for the Pursuit Evasion Differential Game
Al-Talabi, Ahmad A.
Schwartz, Howard M.
2014 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2014), 2014, : 762 - 769
[4] A two-player stochastic pursuit-evasion differential game
Li, Dongxu
Criz, Jose B., Jr.
PROCEEDINGS OF THE 46TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2007, : 4623 - 4628
[5] Fuzzy Actor-Critic Learning Automaton Algorithm for the Pursuit-Evasion Differential Game
Al-Talabi, Ahmad A.
2017 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2017,
[6] On the Value of Information in a Differential Pursuit-Evasion Game
Becerra, Israel
Macias, Vladimir
Murrieta-Cid, Rafael
2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 4768 - 4774
[7] Towards a relaxation of the pursuit-evasion differential game
Chentsov, Alexander
Khachay, Daniel
IFAC PAPERSONLINE, 2019, 52 (13): : 2303 - 2307
[8] Two Targets Pursuit-Evasion Differential Game with a Restriction on the Targets Turning
Rubinovich, Evgeny Ja.
IFAC PAPERSONLINE, 2018, 51 (32): : 503 - 508
[9] Kalman Fuzzy Actor-Critic Learning Automaton Algorithm for the Pursuit-Evasion Differential Game
Al-Talabi, Ahmad A.
Schwartz, Howard M.
2016 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2016, : 1015 - 1022
[10] An Algorithm for UAV Pursuit-Evasion Game Based on MADDPG and Contrastive Learning
Wang R.
Wang X.
Yuhang Xuebao/Journal of Astronautics, 2024, 45 (02): : 262 - 272

← 1 2 3 4 5 →