An Improved Approach towards Multi-Agent Pursuit-Evasion Game Decision-Making Using Deep Reinforcement Learning

被引:20
|
作者
Wan, Kaifang [1 ]
Wu, Dingwei [1 ]
Zhai, Yiwei [1 ]
Li, Bo [1 ]
Gao, Xiaoguang [1 ]
Hu, Zijian [1 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
pursuit-evasion; multi-agent; deep reinforcement learning; decision-making; adversarial learning; MADDPG;
D O I
10.3390/e23111433
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
A pursuit-evasion game is a classical maneuver confrontation problem in the multi-agent systems (MASs) domain. An online decision technique based on deep reinforcement learning (DRL) was developed in this paper to address the problem of environment sensing and decision-making in pursuit-evasion games. A control-oriented framework developed from the DRL-based multi-agent deep deterministic policy gradient (MADDPG) algorithm was built to implement multi-agent cooperative decision-making to overcome the limitation of the tedious state variables required for the traditionally complicated modeling process. To address the effects of errors between a model and a real scenario, this paper introduces adversarial disturbances. It also proposes a novel adversarial attack trick and adversarial learning MADDPG (A2-MADDPG) algorithm. By introducing an adversarial attack trick for the agents themselves, uncertainties of the real world are modeled, thereby optimizing robust training. During the training process, adversarial learning was incorporated into our algorithm to preprocess the actions of multiple agents, which enabled them to properly respond to uncertain dynamic changes in MASs. Experimental results verified that the proposed approach provides superior performance and effectiveness for pursuers and evaders, and both can learn the corresponding confrontational strategy during training.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Using Cognitive Behavioral Learning in Multi-Agent Pursuit-Evasion Game
    Kuo, Jong Yih
    Liu, Chien-Hung
    Lee, Fang-Wen
    [J]. ASIA MODELLING SYMPOSIUM 2014 (AMS 2014), 2014, : 16 - 20
  • [2] Reinforcement learning-based decision-making for spacecraft pursuit-evasion game in elliptical orbits
    Yu, Weizhuo
    Liu, Chuang
    Yue, Xiaokui
    [J]. CONTROL ENGINEERING PRACTICE, 2024, 153
  • [3] Learning-Based Metareasoning for Decision Making in Multi-Agent Pursuit-Evasion Games
    Namala, Prannoy
    Herrmann, Jeffrey W.
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS VI, 2024, 13051
  • [4] Transfer reinforcement learning for multi-agent pursuit-evasion differential game with obstacles in a continuous environment
    Hu, Penglin
    Pan, Quan
    Zhao, Chunhui
    Guo, Yaning
    [J]. ASIAN JOURNAL OF CONTROL, 2024, 26 (04) : 2125 - 2140
  • [5] Pursuit-Evasion Games for Multi-agent Based on Reinforcement Learning with Obstacles
    Hu, Penglin
    Guo, Yaning
    Hu, Jinwen
    Pan, Quan
    [J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1015 - 1024
  • [6] An Approach to Multi-Agent Pursuit Evasion Games Using Reinforcement Learning
    Bilgin, Ahmet Tunc
    Kadioglu-Urtis, Esra
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2015, : 164 - 169
  • [7] An Application of Continuous Deep Reinforcement Learning Approach to Pursuit-Evasion Differential Game
    Wang, Maolin
    Wang, Lixin
    Yue, Ting
    [J]. PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 1150 - 1155
  • [8] Multi-Agent Pursuit-Evasion Game Based on Organizational Architecture
    Souidi, Mohammed El Habib
    Siam, Abderrahim
    Pei, Zhaoyi
    Piao, Songhao
    [J]. Journal of Computing and Information Technology, 2019, 27 (01): : 1 - 12
  • [9] Multi-agent pursuit and evasion games based on improved reinforcement learning
    Xue, Ya-Li
    Ye, Jin-Ze
    Li, Han-Yan
    [J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (08): : 1479 - 1486
  • [10] A simplified pursuit-evasion game with reinforcement learning
    Paczolay, Gabor
    Harmati, Istvan
    [J]. Periodica polytechnica Electrical engineering and computer science, 2021, 65 (02): : 160 - 166