A Deep Reinforcement Learning Method based on Deterministic Policy Gradient for Multi-Agent Cooperative Competition

被引:0
|
作者
Zuo, Xuan [1 ]
Xue, Hui-Feng [2 ]
Wang, Xiao-Yin [2 ]
Du, Wan-Ru [2 ]
Tian, Tao [2 ]
Gao, Shan [1 ]
Zhang, Pu [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] China Aerosp Acad Syst Sci & Engn, Beijing 100048, Peoples R China
来源
关键词
Machine learning; reinforcement learning; multi-agent; cooperative competition; artificial intelligence; GO; ALGORITHM; GAME;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning in multi-agent scenario is important for real-world applications but presents challenges beyond those seen in single agent settings. This paper proposes a method to train a team of multiple types of agents to cooperate against another team of agents. Furthermore, this paper studies how to train multiple types of agents to collaborate better on their team tasks, and analyses the influence of various factors on agents' policy. In the computer experiments, agents are divided into attacking agents and defending agents. The results show that attacking agents which play the roles of deceivers can attract most of defending agents and help the other attacking agents to reach their targets successfully. Choosing appropriate length of training could help agents learn better action policy. The experiments results reveal that the number of agents has an effect on the performance of our proposed method. Increasing the number of deceivers in attacking agents can significantly increase the mission success of attacking team, but the computational complexity will rise and more episodes are needed to train agents.
引用
收藏
页码:88 / 98
页数:11
相关论文
共 50 条
  • [41] Decentralized multi-agent control of a three-tank hybrid system based on twin delayed deep deterministic policy gradient reinforcement learning algorithm
    Rajasekhar, N.
    Radhakrishnan, T. K.
    Samsudeen, N.
    INTERNATIONAL JOURNAL OF DYNAMICS AND CONTROL, 2023, 12 (4) : 1098 - 1115
  • [42] Multi-Agent Distributed Deep Deterministic Policy Gradient for Partially Observable Tracking
    Fan, Dongyu
    Shen, Haikuo
    Dong, Lijing
    ACTUATORS, 2021, 10 (10)
  • [43] A Data Enhancement Strategy for Multi-Agent Cooperative Hunting based on Deep Reinforcement Learning
    Gao, Zhenkun
    Dai, Xiaoyan
    Yao, Meibao
    Xiao, Xueming
    2023 IEEE 6TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS, 2023,
  • [44] Deep Multi-Agent Reinforcement Learning Based Cooperative Edge Caching in Wireless Networks
    Zhong, Chen
    Gursoy, M. Cenk
    Velipasalar, Senem
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [45] Ship cooperative collision avoidance strategy based on multi-agent deep reinforcement learning
    Sui L.-R.
    Gao S.
    He W.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (05): : 1395 - 1402
  • [46] Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
    Zhao, Xutong
    Pan, Yangchen
    Xiao, Chenjun
    Chandar, Sarath
    Rajendran, Janarthanan
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2529 - 2540
  • [47] Microscopic Traffic Simulation by Cooperative Multi-agent Deep Reinforcement Learning
    Bacchiani, Giulio
    Molinari, Daniele
    Patander, Marco
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1547 - 1555
  • [48] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [49] Independent Deep Deterministic Policy Gradient Reinforcement Learning in Cooperative Multiagent Pursuit Games
    Zhou, Shiyang
    Ren, Weiya
    Ren, Xiaoguang
    Wang, Yanzhen
    Yi, Xiaodong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 625 - 637
  • [50] Deterministic Policy Gradient Based Formation Control for Multi-Agent Systems
    Hong, Zhiying
    Wang, Qingling
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 4349 - 4354