An Improved Approach towards Multi-Agent Pursuit-Evasion Game Decision-Making Using Deep Reinforcement Learning

被引：20

作者：

Wan, Kaifang ^{[1
]}

Wu, Dingwei ^{[1
]}

Zhai, Yiwei ^{[1
]}

Li, Bo ^{[1
]}

Gao, Xiaoguang ^{[1
]}

Hu, Zijian ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China

来源：

ENTROPY | 2021年 / 23卷 / 11期

基金：

中国国家自然科学基金;

关键词：

pursuit-evasion; multi-agent; deep reinforcement learning; decision-making; adversarial learning; MADDPG;

D O I：

10.3390/e23111433

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

A pursuit-evasion game is a classical maneuver confrontation problem in the multi-agent systems (MASs) domain. An online decision technique based on deep reinforcement learning (DRL) was developed in this paper to address the problem of environment sensing and decision-making in pursuit-evasion games. A control-oriented framework developed from the DRL-based multi-agent deep deterministic policy gradient (MADDPG) algorithm was built to implement multi-agent cooperative decision-making to overcome the limitation of the tedious state variables required for the traditionally complicated modeling process. To address the effects of errors between a model and a real scenario, this paper introduces adversarial disturbances. It also proposes a novel adversarial attack trick and adversarial learning MADDPG (A2-MADDPG) algorithm. By introducing an adversarial attack trick for the agents themselves, uncertainties of the real world are modeled, thereby optimizing robust training. During the training process, adversarial learning was incorporated into our algorithm to preprocess the actions of multiple agents, which enabled them to properly respond to uncertain dynamic changes in MASs. Experimental results verified that the proposed approach provides superior performance and effectiveness for pursuers and evaders, and both can learn the corresponding confrontational strategy during training.

引用

页数：22

共 50 条

[1] Using Cognitive Behavioral Learning in Multi-Agent Pursuit-Evasion Game
Kuo, Jong Yih
Liu, Chien-Hung
Lee, Fang-Wen
[J]. ASIA MODELLING SYMPOSIUM 2014 (AMS 2014), 2014, : 16 - 20
[2] Reinforcement learning-based decision-making for spacecraft pursuit-evasion game in elliptical orbits
Yu, Weizhuo
Liu, Chuang
Yue, Xiaokui
[J]. CONTROL ENGINEERING PRACTICE, 2024, 153
[3] Learning-Based Metareasoning for Decision Making in Multi-Agent Pursuit-Evasion Games
Namala, Prannoy
Herrmann, Jeffrey W.
[J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS VI, 2024, 13051
[4] Transfer reinforcement learning for multi-agent pursuit-evasion differential game with obstacles in a continuous environment
Hu, Penglin
Pan, Quan
Zhao, Chunhui
Guo, Yaning
[J]. ASIAN JOURNAL OF CONTROL, 2024, 26 (04) : 2125 - 2140
[5] Pursuit-Evasion Games for Multi-agent Based on Reinforcement Learning with Obstacles
Hu, Penglin
Guo, Yaning
Hu, Jinwen
Pan, Quan
[J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1015 - 1024
[6] An Approach to Multi-Agent Pursuit Evasion Games Using Reinforcement Learning
Bilgin, Ahmet Tunc
Kadioglu-Urtis, Esra
[J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2015, : 164 - 169
[7] An Application of Continuous Deep Reinforcement Learning Approach to Pursuit-Evasion Differential Game
Wang, Maolin
Wang, Lixin
Yue, Ting
[J]. PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 1150 - 1155
[8] Multi-Agent Pursuit-Evasion Game Based on Organizational Architecture
Souidi, Mohammed El Habib
Siam, Abderrahim
Pei, Zhaoyi
Piao, Songhao
[J]. Journal of Computing and Information Technology, 2019, 27 (01): : 1 - 12
[9] Multi-agent pursuit and evasion games based on improved reinforcement learning
Xue, Ya-Li
Ye, Jin-Ze
Li, Han-Yan
[J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (08): : 1479 - 1486
[10] A simplified pursuit-evasion game with reinforcement learning
Paczolay, Gabor
Harmati, Istvan
[J]. Periodica polytechnica Electrical engineering and computer science, 2021, 65 (02): : 160 - 166

← 1 2 3 4 5 →