Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems

被引:0
|
作者
Fu, Qingxu [1 ,2 ]
Qiu, Tenghai [1 ]
Yi, Jianqiang [1 ,2 ]
Pu, Zhiqiang [1 ,2 ]
Wu, Shiguang [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When dealing with a series of imminent issues, humans can naturally concentrate on a subset of these concerning issues by prioritizing them according to their contributions to motivational indices, e.g., the probability of winning a game. This idea of concentration offers insights into reinforcement learning of sophisticated Large-scale Multi-Agent Systems (LMAS) participated by hundreds of agents. In such an LMAS, each agent receives a long series of entity observations at each step, which can overwhelm existing aggregation networks such as graph attention networks and cause inefficiency. In this paper, we propose a concentration network called ConcNet. First, ConcNet scores the observed entities considering several motivational indices, e.g., expected survival time and state value of the agents, and then ranks, prunes, and aggregates the encodings of observed entities to extract features. Second, distinct from the well-known attention mechanism, ConcNet has a unique motivational subnetwork to explicitly consider the motivational indices when scoring the observed entities. Furthermore, we present a concentration policy gradient architecture that can learn effective policies in LMAS from scratch. Extensive experiments demonstrate that the presented architecture has excellent scalability and flexibility, and significantly outperforms existing methods on LMAS benchmarks.
引用
收藏
页码:9341 / 9349
页数:9
相关论文
共 50 条
  • [41] A multi-agent reinforcement learning method with curriculum transfer for large-scale dynamic traffic signal control
    Li, Xuesi
    Li, Jingchen
    Shi, Haobin
    APPLIED INTELLIGENCE, 2023, 53 (18) : 21433 - 21447
  • [42] Graph-based multi-agent reinforcement learning for large-scale UAVs swarm system control
    Zhao, Bocheng
    Huo, Mingying
    Li, Zheng
    Yu, Ze
    Qi, Naiming
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 150
  • [43] A Large-Scale Multi-Agent Deep Reinforcement Learning Method for Cooperative Output Voltage Control of PEMFCs
    Li, Jiawen
    Cui, Haoyang
    Jiang, Wei
    Yu, Hengwen
    IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2024, 10 (01): : 78 - 94
  • [44] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [45] Large-Scale Computation Offloading Using a Multi-Agent Reinforcement Learning in Heterogeneous Multi-Access Edge Computing
    Gao, Zhen
    Yang, Lei
    Dai, Yu
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (06) : 3425 - 3443
  • [46] Boolean Network Models of Collective Dynamics of Open and Closed Large-Scale Multi-agent Systems
    Tosic, Predrag T.
    Ordonez, Carlos
    INDUSTRIAL APPLICATIONS OF HOLONIC AND MULTI-AGENT SYSTEMS, 2017, 10444 : 95 - 110
  • [47] Large-scale multi-agent transportation simulations
    Cetin, N
    Nagel, K
    Raney, B
    Voellmy, A
    COMPUTER PHYSICS COMMUNICATIONS, 2002, 147 (1-2) : 559 - 564
  • [48] Large-Scale Multi-Agent Deep FBSDEs
    Chen, Tianrong
    Wang, Ziyi
    Exarchos, Ioannis
    Theodorou, Evangelos A.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [49] Multi-Agent Learning Automata for Online Adaptive Control of Large-Scale Traffic Signal Systems
    Hou, Xuewei
    Chen, Lixing
    Tang, Junhua
    Li, Jianhua
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 1497 - 1502
  • [50] Dynamic and adaptive replication for large-scale reliable multi-agent systems
    Guessoum, Z
    Briot, JP
    Marin, O
    Hamel, A
    Sens, P
    SOFTWARE ENGINEERING FOR LARGE-SCALE MULTI-AGENT SYSTEMS: RESEARCH ISSUES AND PRACTICAL APPLICATIONS, 2003, 2603 : 182 - 198