Strategic Interaction Multi-Agent Deep Reinforcement Learning

被引:0
|
作者
Zhou, Wenhong [1 ]
Li, Jie [1 ]
Chen, Yiting [1 ]
Shen, Lin-Cheng [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Multi-agent deep reinforcement learning; scalability; local interaction; large scale;
D O I
10.1109/ACCESS.2020.3005734
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the proliferation of multi-agent deep reinforcement learning (MADRL), most existing typical methods do not scale well to the dynamics of agent populations. And as the population increases, the dimensional explosion of joint state-action and the complex interaction between agents make learning extremely cumbersome, which poses the scalability challenge for MADRL. This paper focuses on the scalability issue of MADRL with homogeneous agents. In a natural population, local interaction is a more feasible mode of interplay rather than global interaction. And inspired by the strategic interaction model in economics, we decompose the value function of each agent into the sum of the expected cumulative rewards of the interaction between the agent and each neighbor. This novel value function is decentralized and decomposable, which enables it to scale well to the dynamic changes in the number of large-scale agents. Hereby, the corresponding strategic interaction reinforcement learning algorithm (SIQ), is proposed to learn the optimal policy of each agent, wherein a neural network is employed to estimate the expected cumulative reward for the interaction between the agent and one of its neighbors. We test the validity of the proposed method in a mixed cooperative-competitive confrontation game through numerical experiments. Furthermore, the scalability comparison experiments illustrate that the scalability of the SIQ algorithm outperforms the independent learning and mean field reinforcement learning algorithms in multiple scenarios with different and dynamically changing numbers.
引用
收藏
页码:119000 / 119009
页数:10
相关论文
共 50 条
  • [21] Competitive Evolution Multi-Agent Deep Reinforcement Learning
    Zhou, Wenhong
    Chen, Yiting
    Li, Jie
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,
  • [22] Multi-Agent Deep Reinforcement Learning in Vehicular OCC
    Islam, Amirul
    Musavian, Leila
    Thomos, Nikolaos
    [J]. 2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
  • [23] Teaching on a Budget in Multi-Agent Deep Reinforcement Learning
    Ilhan, Ercument
    Gow, Jeremy
    Perez-Liebana, Diego
    [J]. 2019 IEEE CONFERENCE ON GAMES (COG), 2019,
  • [24] A Hybrid Interaction Model for Multi-Agent Reinforcement Learning
    Guisi, Douglas M.
    Ribeiro, Richardson
    Teixeira, Marcelo
    Borges, Andre P.
    Dosciatti, Eden R.
    Enembreck, Fabricio
    [J]. PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 2 (ICEIS), 2016, : 54 - 61
  • [25] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    [J]. 2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [26] Multi-Agent Deep Reinforcement Learning for Multi-Object Tracker
    Jiang, Mingxin
    Hai, Tao
    Pan, Zhigeng
    Wang, Haiyan
    Jia, Yinjie
    Deng, Chao
    [J]. IEEE ACCESS, 2019, 7 : 32400 - 32407
  • [27] Learning multi-agent communication with double attentional deep reinforcement learning
    Mao, Hangyu
    Zhang, Zhengchao
    Xiao, Zhen
    Gong, Zhibo
    Ni, Yan
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2020, 34 (01)
  • [28] Learning to school in dense configurations with multi-agent deep reinforcement learning
    Zhu, Yi
    Pang, Jian-Hua
    Gao, Tong
    Tian, Fang-Bao
    [J]. BIOINSPIRATION & BIOMIMETICS, 2023, 18 (01)
  • [29] Learning multi-agent communication with double attentional deep reinforcement learning
    Hangyu Mao
    Zhengchao Zhang
    Zhen Xiao
    Zhibo Gong
    Yan Ni
    [J]. Autonomous Agents and Multi-Agent Systems, 2020, 34
  • [30] Strategic maneuver and disruption with reinforcement learning approaches for multi-agent coordination
    Asher, Derrik E.
    Basak, Anjon
    Fernandez, Rolando
    Sharma, Piyush K.
    Zaroukian, Erin G.
    Hsu, Christopher D.
    Dorothy, Michael R.
    Mahre, Thomas
    Galindo, Gerardo
    Frerichs, Luke
    Rogers, John
    Fossaceca, John
    [J]. JOURNAL OF DEFENSE MODELING AND SIMULATION-APPLICATIONS METHODOLOGY TECHNOLOGY-JDMS, 2023, 20 (04): : 509 - 526