A Deep Reinforcement Learning Method based on Deterministic Policy Gradient for Multi-Agent Cooperative Competition

被引:0
|
作者
Zuo, Xuan [1 ]
Xue, Hui-Feng [2 ]
Wang, Xiao-Yin [2 ]
Du, Wan-Ru [2 ]
Tian, Tao [2 ]
Gao, Shan [1 ]
Zhang, Pu [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] China Aerosp Acad Syst Sci & Engn, Beijing 100048, Peoples R China
来源
关键词
Machine learning; reinforcement learning; multi-agent; cooperative competition; artificial intelligence; GO; ALGORITHM; GAME;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning in multi-agent scenario is important for real-world applications but presents challenges beyond those seen in single agent settings. This paper proposes a method to train a team of multiple types of agents to cooperate against another team of agents. Furthermore, this paper studies how to train multiple types of agents to collaborate better on their team tasks, and analyses the influence of various factors on agents' policy. In the computer experiments, agents are divided into attacking agents and defending agents. The results show that attacking agents which play the roles of deceivers can attract most of defending agents and help the other attacking agents to reach their targets successfully. Choosing appropriate length of training could help agents learn better action policy. The experiments results reveal that the number of agents has an effect on the performance of our proposed method. Increasing the number of deceivers in attacking agents can significantly increase the mission success of attacking team, but the computational complexity will rise and more episodes are needed to train agents.
引用
收藏
页码:88 / 98
页数:11
相关论文
共 50 条
  • [31] Multi-agent deep deterministic policy gradient algorithm via prioritized experience selected method
    He M.
    Zhang B.
    Liu Q.
    Chen X.-L.
    Yang C.
    Kongzhi yu Juece/Control and Decision, 2021, 36 (01): : 68 - 74
  • [32] QDN: An Efficient Value Decomposition Method for Cooperative Multi-agent Deep Reinforcement Learning
    Xie, Zaipeng
    Zhang, Yufeng
    Shao, Pengfei
    Zhao, Weiyi
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1204 - 1211
  • [33] Decentralized Deterministic Multi-Agent Reinforcement Learning
    Grosnit, Antoine
    Cai, Desmond
    Wynter, Laura
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 1548 - 1553
  • [34] WRFMR: A Multi-Agent Reinforcement Learning Method for Cooperative Tasks
    Liu, Hui
    Zhang, Zhen
    Wang, Dongqing
    IEEE ACCESS, 2020, 8 : 216320 - 216331
  • [35] Effective credit assignment deep policy gradient multi-agent reinforcement learning for vehicle dispatch
    Xiaohui Huang
    Xiong Zhang
    Jiahao Ling
    Xuebo Cheng
    Applied Intelligence, 2023, 53 : 23457 - 23469
  • [36] Effective credit assignment deep policy gradient multi-agent reinforcement learning for vehicle dispatch
    Huang, Xiaohui
    Zhang, Xiong
    Ling, Jiahao
    Cheng, Xuebo
    APPLIED INTELLIGENCE, 2023, 53 (20) : 23457 - 23469
  • [37] Decentralized reinforcement social learning based on cooperative policy exploration in multi-agent systems
    Wang, Chi
    Chen, Xin
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 1575 - 1580
  • [38] DeCOM: Decomposed Policy for Constrained Cooperative Multi-Agent Reinforcement Learning
    Yang, Zhaoxing
    Jin, Haiming
    Ding, Rong
    You, Haoyi
    Fan, Guiyun
    Wang, Xinbing
    Zhou, Chenghu
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10861 - 10870
  • [39] QDAP: Downsizing adaptive policy for cooperative multi-agent reinforcement learning
    Zhao, Zhitong
    Zhang, Ya
    Wang, Siying
    Zhang, Fan
    Zhang, Malu
    Chen, Wenyu
    KNOWLEDGE-BASED SYSTEMS, 2024, 294
  • [40] Decentralized multi-agent control of a three-tank hybrid system based on twin delayed deep deterministic policy gradient reinforcement learning algorithm
    N. Rajasekhar
    T. K. Radhakrishnan
    N. Samsudeen
    International Journal of Dynamics and Control, 2024, 12 : 1098 - 1115