Cooperative multi-target hunting by unmanned surface vehicles based on multi-agent reinforcement learning

被引:10
|
作者
Xia, Jiawei [1 ]
Luo, Yasong [1 ]
Liu, Zhikun [1 ]
Zhang, Yalun [2 ]
Shi, Haoran [1 ]
Liu, Zhong [1 ]
机构
[1] Naval Univ Engn, Coll Weaponry Engn, Wuhan 430033, Peoples R China
[2] Naval Univ Engn, Inst Vibrat & Noise, Wuhan 430033, Peoples R China
基金
中国国家自然科学基金;
关键词
Unmanned surface vehicles; Multi-agent deep reinforcement learning; Cooperative hunting; Feature embedding; Proximal policy optimization; PURSUIT-EVASION GAME; DECISION-MAKING; GO;
D O I
10.1016/j.dt.2022.09.014
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
To solve the problem of multi-target hunting by an unmanned surface vehicle (USV) fleet, a hunting algorithm based on multi-agent reinforcement learning is proposed. Firstly, the hunting environment and kinematic model without boundary constraints are built, and the criteria for successful target capture are given. Then, the cooperative hunting problem of a USV fleet is modeled as a decentralized partially observable Markov decision process (Dec-POMDP), and a distributed partially observable multitarget hunting Proximal Policy Optimization (DPOMH-PPO) algorithm applicable to USVs is proposed. In addition, an observation model, a reward function and the action space applicable to multi-target hunting tasks are designed. To deal with the dynamic change of observational feature dimension input by partially observable systems, a feature embedding block is proposed. By combining the two feature compression methods of column-wise max pooling (CMP) and column-wise average-pooling (CAP), observational feature encoding is established. Finally, the centralized training and decentralized execution framework is adopted to complete the training of hunting strategy. Each USV in the fleet shares the same policy and perform actions independently. Simulation experiments have verified the effectiveness of the DPOMH-PPO algorithm in the test scenarios with different numbers of USVs. Moreover, the advantages of the proposed model are comprehensively analyzed from the aspects of algorithm performance, migration effect in task scenarios and self-organization capability after being damaged, the potential deployment and application of DPOMH-PPO in the real environment is verified. (c) 2023 China Ordnance Society. Publishing services by Elsevier B.V. on behalf of KeAi Communications Co. Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/ licenses/by-nc-nd/4.0/).
引用
收藏
页码:80 / 94
页数:15
相关论文
共 50 条
  • [1] Cooperative multi-target hunting by unmanned surface vehicles based on multi-agent reinforcement learning
    Jiawei Xia
    Yasong Luo
    Zhikun Liu
    Yalun Zhang
    Haoran Shi
    Zhong Liu
    [J]. Defence Technology, 2023, 29 (11) : 80 - 94
  • [2] Research on cooperative hunting method of unmanned surface vehicle based on multi-agent reinforcement learning
    Xia, Jia-Wei
    Zhu, Xu-Fang
    Zhang, Jian-Qiang
    Luo, Ya-Song
    Liu, Zhong
    [J]. Kongzhi yu Juece/Control and Decision, 2023, 38 (05): : 1438 - 1447
  • [3] Multi-Agent and Multi-Target Reinforcement Learning for Satellite Sensor Tasking
    Saeed, Amir K.
    Holguin, Francisco
    Yasin, Alhassan S.
    Johnson, Benjamin A.
    Rodriguez, Benjamin M.
    [J]. 2024 IEEE AEROSPACE CONFERENCE, 2024,
  • [4] Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning
    Zhou, Wenhong
    LI, Jie
    Liu, Zhihong
    Shen, Lincheng
    [J]. CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (07) : 100 - 112
  • [5] Improving Cooperative Multi-Target Tracking Control for UAV Swarm Using Multi-Agent Reinforcement Learning
    Yue, Longfei
    Lv, Maolong
    Yan, Mengda
    Zhao, Xiaoru
    Wu, Ao
    Li, Leyan
    Zuo, Jialiang
    [J]. 2023 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS, ICCAR, 2023, : 179 - 186
  • [6] Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning
    Wenhong ZHOU
    Jie LI
    Zhihong LIU
    Lincheng SHEN
    [J]. Chinese Journal of Aeronautics., 2022, 35 (07) - 112
  • [7] Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning
    Wenhong ZHOU
    Jie LI
    Zhihong LIU
    Lincheng SHEN
    [J]. Chinese Journal of Aeronautics, 2022, (07) : 100 - 112
  • [8] Multi-Agent Deep Reinforcement Learning for Cooperative Connected Vehicles
    Kwon, Dohyun
    Kim, Joongheon
    [J]. 2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [9] A Data Enhancement Strategy for Multi-Agent Cooperative Hunting based on Deep Reinforcement Learning
    Gao, Zhenkun
    Dai, Xiaoyan
    Yao, Meibao
    Xiao, Xueming
    [J]. 2023 IEEE 6TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS, 2023,
  • [10] Multi-Agent Reinforcement Learning and Chimpanzee Hunting
    Sauter, Michael Z.
    Shi, Dongqing
    Kralik, Jerald D.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2009), VOLS 1-4, 2009, : 622 - 626