A Deep Reinforcement Learning Method based on Deterministic Policy Gradient for Multi-Agent Cooperative Competition

被引:0
|
作者
Zuo, Xuan [1 ]
Xue, Hui-Feng [2 ]
Wang, Xiao-Yin [2 ]
Du, Wan-Ru [2 ]
Tian, Tao [2 ]
Gao, Shan [1 ]
Zhang, Pu [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] China Aerosp Acad Syst Sci & Engn, Beijing 100048, Peoples R China
来源
关键词
Machine learning; reinforcement learning; multi-agent; cooperative competition; artificial intelligence; GO; ALGORITHM; GAME;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning in multi-agent scenario is important for real-world applications but presents challenges beyond those seen in single agent settings. This paper proposes a method to train a team of multiple types of agents to cooperate against another team of agents. Furthermore, this paper studies how to train multiple types of agents to collaborate better on their team tasks, and analyses the influence of various factors on agents' policy. In the computer experiments, agents are divided into attacking agents and defending agents. The results show that attacking agents which play the roles of deceivers can attract most of defending agents and help the other attacking agents to reach their targets successfully. Choosing appropriate length of training could help agents learn better action policy. The experiments results reveal that the number of agents has an effect on the performance of our proposed method. Increasing the number of deceivers in attacking agents can significantly increase the mission success of attacking team, but the computational complexity will rise and more episodes are needed to train agents.
引用
收藏
页码:88 / 98
页数:11
相关论文
共 50 条
  • [1] Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient
    Li, Shihui
    Wu, Yi
    Cui, Xinyue
    Dong, Honghua
    Fang, Fei
    Russell, Stuart
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4213 - 4220
  • [2] Multi-Agent Deep Deterministic Policy Gradient Method Based on Double Critics
    Ding S.
    Du W.
    Guo L.
    Zhang J.
    Xu X.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (10): : 2394 - 2404
  • [3] Multi-UAV Cooperative Autonomous Navigation Based on Multi-agent Deep Deterministic Policy Gradient
    Li B.
    Yue K.-Q.
    Gan Z.-G.
    Gao P.-X.
    Yuhang Xuebao/Journal of Astronautics, 2021, 42 (06): : 757 - 765
  • [4] QSOD: Hybrid Policy Gradient for Deep Multi-agent Reinforcement Learning
    Rehman, Hafiz Muhammad Raza Ur
    On, Byung-Won
    Ningombam, Devarani Devi
    Yi, Sungwon
    Choi, Gyu Sang
    IEEE ACCESS, 2021, 9 : 129728 - 129741
  • [5] MTMA-DDPG: A Deep Deterministic Policy Gradient Reinforcement Learning for Multi-task Multi-agent Environments
    Hamadeh, Karim
    El Zini, Julia
    Hajar, Joudi
    Awad, Mariette
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART I, 2022, 646 : 270 - 281
  • [6] A distributed adaptive policy gradient method based on momentum for multi-agent reinforcement learning
    Shi, Junru
    Wang, Xin
    Zhang, Mingchuan
    Liu, Muhua
    Zhu, Junlong
    Wu, Qingtao
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (05) : 7297 - 7310
  • [7] Twin Delayed Multi-Agent Deep Deterministic Policy Gradient
    Zhan, Mengying
    Chen, Jinchao
    Du, Chenglie
    Duan, Yuxin
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2021, : 48 - 52
  • [8] A review of cooperative multi-agent deep reinforcement learning
    Afshin Oroojlooy
    Davood Hajinezhad
    Applied Intelligence, 2023, 53 : 13677 - 13722
  • [9] Asynchronous Methods for Multi-agent Deep Deterministic Policy Gradient
    Jiang, Xuesong
    Li, Zhipeng
    Wei, Xiumei
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT II, 2018, 11302 : 711 - 721
  • [10] A review of cooperative multi-agent deep reinforcement learning
    Oroojlooy, Afshin
    Hajinezhad, Davood
    APPLIED INTELLIGENCE, 2023, 53 (11) : 13677 - 13722