Hierarchical Multi-Agent Training Based on Reinforcement Learning

被引:1
|
作者
Wang, Guanghua [1 ]
Li, Wenjie [2 ]
Wu, Zhanghua [3 ]
Guo, Xian [1 ]
机构
[1] Nankai Univ, Inst Robot & Automat Informat Syst, Tianjin, Peoples R China
[2] State Grid Tianjin Elect Power Co, Tianjin, Peoples R China
[3] Jiangsu Automat Res Inst, Lianyungang, Jiangsu, Peoples R China
关键词
Multi-Agent Systems; Reinforcement Learning; Multi-Agent Proximal Policy Optimization Algorithm; Formation Confrontation;
D O I
10.1109/ACIRS62330.2024.10684909
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the current multi-UAV adversarial games, issues exist such as the instability and difficulty in learning distributed strategies, as well as a lack of coordinated formation UAVs. In this paper, a hierarchical multi-agent training framework is proposed to solve these problems, which categorizes UAV formations into two types of intelligent agents: virtual centroid agents and UAVs within the formation. The centroid agents are responsible for controlling the overall movement of the formation. In contrast, the UAVs within the formation are capable of flexibly adjusting their speed and heading on this basis. By constructing a confrontation scenario involving multiple formations and types of UAVs, the effectiveness of the hierarchical training framework is experimentally validated. The average winning rate against UAVs controlled by strategy methods based on rule construction reaches 97%, enabling both formation variations and tactical evolutions.
引用
收藏
页码:11 / 18
页数:8
相关论文
共 50 条
  • [41] Cooperative multi-agent game based on reinforcement learning
    Liu, Hongbo
    HIGH-CONFIDENCE COMPUTING, 2024, 4 (01):
  • [42] Survey of Multi-Agent Strategy Based on Reinforcement Learning
    Chen, Liang
    Guo, Ting
    Liu, Yun-ting
    Yang, Jia-ming
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 604 - 609
  • [43] A Multi-agent Path Planning Algorithm Based on Hierarchical Reinforcement Learning and Artificial Potential Field
    Zheng, Yanbin
    Li, Bo
    An, Deyu
    Li, Na
    2015 11TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2015, : 363 - 369
  • [44] An Intelligent Strategy Decision Method for Collaborative Jamming Based on Hierarchical Multi-Agent Reinforcement Learning
    Zhang, Wenxu
    Zhao, Tong
    Zhao, Zhongkai
    Wang, Yajie
    Liu, Feiran
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (04) : 1467 - 1480
  • [45] Hierarchical Task Offloading for Vehicular Fog Computing Based on Multi-Agent Deep Reinforcement Learning
    Hou, Yukai
    Wei, Zhiwei
    Zhang, Rongqing
    Cheng, Xiang
    Yang, Liuqing
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (04) : 3074 - 3085
  • [46] An Intelligent Strategy Decision Method for Collaborative Jamming Based On Hierarchical Multi-Agent Reinforcement Learning
    Zhang, Wenxu
    Zhao, Tong
    Zhao, Zhongkai
    Wang, Yajie
    Liu, Feiran
    IEEE Transactions on Cognitive Communications and Networking, 1600, (1-1):
  • [47] Hierarchical Optimization Scheduling Algorithm for Logistics Transport Vehicles Based on Multi-Agent Reinforcement Learning
    Zhang, Min
    Pan, Chaohong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (03) : 3108 - 3117
  • [48] When Does Communication Learning Need Hierarchical Multi-Agent Deep Reinforcement Learning
    Ossenkopf, Marie
    Jorgensen, Mackenzie
    Geihs, Kurt
    CYBERNETICS AND SYSTEMS, 2019, 50 (08) : 672 - 692
  • [49] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [50] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
    Xu, Chi
    Zhang, Hui
    Zhang, Ya
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920