Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems

被引:0
|
作者
Fu, Qingxu [1 ,2 ]
Qiu, Tenghai [1 ]
Yi, Jianqiang [1 ,2 ]
Pu, Zhiqiang [1 ,2 ]
Wu, Shiguang [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When dealing with a series of imminent issues, humans can naturally concentrate on a subset of these concerning issues by prioritizing them according to their contributions to motivational indices, e.g., the probability of winning a game. This idea of concentration offers insights into reinforcement learning of sophisticated Large-scale Multi-Agent Systems (LMAS) participated by hundreds of agents. In such an LMAS, each agent receives a long series of entity observations at each step, which can overwhelm existing aggregation networks such as graph attention networks and cause inefficiency. In this paper, we propose a concentration network called ConcNet. First, ConcNet scores the observed entities considering several motivational indices, e.g., expected survival time and state value of the agents, and then ranks, prunes, and aggregates the encodings of observed entities to extract features. Second, distinct from the well-known attention mechanism, ConcNet has a unique motivational subnetwork to explicitly consider the motivational indices when scoring the observed entities. Furthermore, we present a concentration policy gradient architecture that can learn effective policies in LMAS from scratch. Extensive experiments demonstrate that the presented architecture has excellent scalability and flexibility, and significantly outperforms existing methods on LMAS benchmarks.
引用
收藏
页码:9341 / 9349
页数:9
相关论文
共 50 条
  • [21] Organizational Metamodel for Large-Scale Multi-Agent Systems
    Duric, Bogdan Okresa
    TRENDS IN PRACTICAL APPLICATIONS OF SCALABLE MULTI-AGENT SYSTEMS, THE PAAMS COLLECTION, 2016, 473 : 387 - 390
  • [22] Requirements engineering for large-scale multi-agent systems
    Cysneiros, LM
    Yu, E
    SOFTWARE ENGINEERING FOR LARGE-SCALE MULTI-AGENT SYSTEMS: RESEARCH ISSUES AND PRACTICAL APPLICATIONS, 2003, 2603 : 39 - 56
  • [23] Macroscopic Observation of Large-scale Multi-agent Systems
    Lamarche-Perrin, Robin
    Demazeau, Yves
    Vincent, Jean-Marc
    2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 121 - 127
  • [24] Towards reliable large-scale multi-agent systems
    Guessoum, Z
    Faci, N
    MULTI-AGENT SYSTEMS AND APPLICATIONS IV, PROCEEDINGS, 2005, 3690 : 430 - 439
  • [25] Intelligent planning for large-scale multi-agent systems
    Ma, Hang
    AI MAGAZINE, 2022, 43 (04) : 376 - 382
  • [26] Optimizing Large-Scale Fleet Management on a Road Network using Multi-Agent Deep Reinforcement Learning with Graph Neural Network
    Kim, Juhyeon
    Kim, Kihyun
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 990 - 995
  • [27] Multi-Agent Reinforcement Learning for Resource Allocation in Large-Scale Robotic Warehouse Sortation Centers
    Shen, Yi
    McClosky, Benjamin
    Durham, Joseph W.
    Zavlanos, Michael M.
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 7137 - 7143
  • [28] Multi-Agent Deep Reinforcement Learning for Large-scale Platoon Coordination with Partial Information at Hubs
    Wei, Dixiao
    Yi, Peng
    Lei, Jinlong
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 6242 - 6248
  • [29] Large-Scale Multi-Agent Reinforcement Learning Using Image-Based State Representation
    Chu, Tianshu
    Qu, Shuhui
    Wang, Jie
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 7592 - 7597
  • [30] Multi-Agent Reinforcement Learning Based on Representational Communication for Large-Scale Traffic Signal Control
    Bokade, Rohit
    Jin, Xiaoning
    Amato, Christopher
    IEEE ACCESS, 2023, 11 : 47646 - 47658