Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems

被引：0

作者：

Fu, Qingxu ^{[1
,2
]}

Qiu, Tenghai ^{[1
]}

Yi, Jianqiang ^{[1
,2
]}

Pu, Zhiqiang ^{[1
,2
]}

Wu, Shiguang ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

来源：

THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2022年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

When dealing with a series of imminent issues, humans can naturally concentrate on a subset of these concerning issues by prioritizing them according to their contributions to motivational indices, e.g., the probability of winning a game. This idea of concentration offers insights into reinforcement learning of sophisticated Large-scale Multi-Agent Systems (LMAS) participated by hundreds of agents. In such an LMAS, each agent receives a long series of entity observations at each step, which can overwhelm existing aggregation networks such as graph attention networks and cause inefficiency. In this paper, we propose a concentration network called ConcNet. First, ConcNet scores the observed entities considering several motivational indices, e.g., expected survival time and state value of the agents, and then ranks, prunes, and aggregates the encodings of observed entities to extract features. Second, distinct from the well-known attention mechanism, ConcNet has a unique motivational subnetwork to explicitly consider the motivational indices when scoring the observed entities. Furthermore, we present a concentration policy gradient architecture that can learn effective policies in LMAS from scratch. Extensive experiments demonstrate that the presented architecture has excellent scalability and flexibility, and significantly outperforms existing methods on LMAS benchmarks.

引用

页码：9341 / 9349

页数：9

共 50 条

[21] Organizational Metamodel for Large-Scale Multi-Agent Systems
Duric, Bogdan Okresa
TRENDS IN PRACTICAL APPLICATIONS OF SCALABLE MULTI-AGENT SYSTEMS, THE PAAMS COLLECTION, 2016, 473 : 387 - 390
[22] Requirements engineering for large-scale multi-agent systems
Cysneiros, LM
Yu, E
SOFTWARE ENGINEERING FOR LARGE-SCALE MULTI-AGENT SYSTEMS: RESEARCH ISSUES AND PRACTICAL APPLICATIONS, 2003, 2603 : 39 - 56
[23] Macroscopic Observation of Large-scale Multi-agent Systems
Lamarche-Perrin, Robin
Demazeau, Yves
Vincent, Jean-Marc
2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 121 - 127
[24] Towards reliable large-scale multi-agent systems
Guessoum, Z
Faci, N
MULTI-AGENT SYSTEMS AND APPLICATIONS IV, PROCEEDINGS, 2005, 3690 : 430 - 439
[25] Intelligent planning for large-scale multi-agent systems
Ma, Hang
AI MAGAZINE, 2022, 43 (04) : 376 - 382
[26] Optimizing Large-Scale Fleet Management on a Road Network using Multi-Agent Deep Reinforcement Learning with Graph Neural Network
Kim, Juhyeon
Kim, Kihyun
2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 990 - 995
[27] Multi-Agent Reinforcement Learning for Resource Allocation in Large-Scale Robotic Warehouse Sortation Centers
Shen, Yi
McClosky, Benjamin
Durham, Joseph W.
Zavlanos, Michael M.
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 7137 - 7143
[28] Multi-Agent Deep Reinforcement Learning for Large-scale Platoon Coordination with Partial Information at Hubs
Wei, Dixiao
Yi, Peng
Lei, Jinlong
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 6242 - 6248
[29] Large-Scale Multi-Agent Reinforcement Learning Using Image-Based State Representation
Chu, Tianshu
Qu, Shuhui
Wang, Jie
2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 7592 - 7597
[30] Multi-Agent Reinforcement Learning Based on Representational Communication for Large-Scale Traffic Signal Control
Bokade, Rohit
Jin, Xiaoning
Amato, Christopher
IEEE ACCESS, 2023, 11 : 47646 - 47658

← 1 2 3 4 5 →