Pursuit-evasion game strategy of USV based on deep reinforcement learning in complex multi-obstacle environment

被引:18
|
作者
Qu, Xiuqing [1 ]
Gan, Wenhao [1 ]
Song, Dalei [1 ,2 ]
Zhou, Liqin [1 ]
机构
[1] Ocean Univ China, Coll Engn, 238 Songling Rd, Qingdao 266100, Shandong, Peoples R China
[2] Ocean Univ China, Inst Adv Ocean Study, 238 Songling Rd, Qingdao 266100, Shandong, Peoples R China
关键词
Unmanned surface vehicles; Pursuit-evasion game; Deep reinforcement learning; Imitation learning; Obstacle avoidance;
D O I
10.1016/j.oceaneng.2023.114016
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Aiming at the confrontation game problems between pursuit-evasion unmanned surface vehicles under complex multi-obstacle environment, a pursuit-evasion game strategy is proposed. Firstly, the multi-obstacle environment is set up, and the gaming situation can be judged by the perception between pursuit-evasion USVs. For the pursuers, the model training is performed based on multi-agent deep reinforcement learning, so that they can quickly plan a reasonable obstacle avoidance and pursuit route, and form an effective encirclement posture before the evader approaches the target point. Meanwhile, the credit assignment problem among the members of the pursuing group is considered. For the evader, deep reinforcement learning is combined with imitation learning to train the escape model, so that it can reach the preset point in as short a time as possible and avoid the obstacles on the way. Finally, an adversarial-evolutionary game training method under multiple random scenarios is designed and combined with curriculum learning to iteratively update the pursuit and escape models. Through the detailed comparative analysis of the model training process and simulation experiments, it is proved that the proposed two types of models have higher convergence efficiency and stability, and they can have higher intelligence to pursue, escape and avoid obstacles respectively.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Intelligent Maneuver Strategy for a Hypersonic Pursuit-Evasion Game Based on Deep Reinforcement Learning
    Guo, Yunhe
    Jiang, Zijian
    Huang, Hanqiao
    Fan, Hongjia
    Weng, Weiye
    [J]. AEROSPACE, 2023, 10 (09)
  • [2] Intelligent Pursuit-Evasion Game Based on Deep Reinforcement Learning for Hypersonic Vehicles
    Gao, Mengjing
    Yan, Tian
    Li, Quancheng
    Fu, Wenxing
    Zhang, Jin
    [J]. AEROSPACE, 2023, 10 (01)
  • [3] A simplified pursuit-evasion game with reinforcement learning
    Paczolay, Gabor
    Harmati, Istvan
    [J]. Periodica polytechnica Electrical engineering and computer science, 2021, 65 (02): : 160 - 166
  • [4] Pursuit and Evasion Strategy of a Differential Game Based on Deep Reinforcement Learning
    Xu, Can
    Zhang, Yin
    Wang, Weigang
    Dong, Ligang
    [J]. FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [5] Strategy solution of non-cooperative target pursuit-evasion game based on branching deep reinforcement learning
    Liu, Bingyan
    Ye, Xiongbing
    Gao, Yong
    Wang, Xinbo
    Ni, Lei
    [J]. Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2020, 41 (10):
  • [6] Collaborative pursuit-evasion game of multi-UAVs based on Apollonius circle in the environment with obstacle
    Liang, Xiao
    Zhou, Boran
    Jiang, Linping
    Meng, Guanglei
    Xiu, Yiwei
    [J]. CONNECTION SCIENCE, 2023, 35 (01)
  • [7] Game of Drones: Multi-UAV Pursuit-Evasion Game With Online Motion Planning by Deep Reinforcement Learning
    Zhang, Ruilong
    Zong, Qun
    Zhang, Xiuyun
    Dou, Liqian
    Tian, Bailing
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7900 - 7909
  • [8] An Application of Continuous Deep Reinforcement Learning Approach to Pursuit-Evasion Differential Game
    Wang, Maolin
    Wang, Lixin
    Yue, Ting
    [J]. PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 1150 - 1155
  • [9] A Deep Reinforcement Learning-Based Intelligent Maneuvering Strategy for the High-Speed UAV Pursuit-Evasion Game
    Yan, Tian
    Liu, Can
    Gao, Mengjing
    Jiang, Zijian
    Li, Tong
    [J]. DRONES, 2024, 8 (07)
  • [10] Transfer reinforcement learning for multi-agent pursuit-evasion differential game with obstacles in a continuous environment
    Hu, Penglin
    Pan, Quan
    Zhao, Chunhui
    Guo, Yaning
    [J]. ASIAN JOURNAL OF CONTROL, 2024, 26 (04) : 2125 - 2140