Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems

被引:0
|
作者
Fu, Qingxu [1 ,2 ]
Qiu, Tenghai [1 ]
Yi, Jianqiang [1 ,2 ]
Pu, Zhiqiang [1 ,2 ]
Wu, Shiguang [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When dealing with a series of imminent issues, humans can naturally concentrate on a subset of these concerning issues by prioritizing them according to their contributions to motivational indices, e.g., the probability of winning a game. This idea of concentration offers insights into reinforcement learning of sophisticated Large-scale Multi-Agent Systems (LMAS) participated by hundreds of agents. In such an LMAS, each agent receives a long series of entity observations at each step, which can overwhelm existing aggregation networks such as graph attention networks and cause inefficiency. In this paper, we propose a concentration network called ConcNet. First, ConcNet scores the observed entities considering several motivational indices, e.g., expected survival time and state value of the agents, and then ranks, prunes, and aggregates the encodings of observed entities to extract features. Second, distinct from the well-known attention mechanism, ConcNet has a unique motivational subnetwork to explicitly consider the motivational indices when scoring the observed entities. Furthermore, we present a concentration policy gradient architecture that can learn effective policies in LMAS from scratch. Extensive experiments demonstrate that the presented architecture has excellent scalability and flexibility, and significantly outperforms existing methods on LMAS benchmarks.
引用
收藏
页码:9341 / 9349
页数:9
相关论文
共 50 条
  • [31] HELSA: Hierarchical Reinforcement Learning with Spatiotemporal Abstraction for Large-Scale Multi-Agent Path Finding
    Song, Zhaoyi
    Zhang, Rongqing
    Cheng, Xiang
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7318 - 7325
  • [32] Dynamic Dispatching for Large-Scale Heterogeneous Fleet via Multi-agent Deep Reinforcement Learning
    Zhang, Chi
    Odonkor, Philip
    Zheng, Shuai
    Khorasgani, Hamed
    Serita, Susumu
    Gupta, Chetan
    Wang, Haiyan
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1436 - 1441
  • [33] Coverage Optimization for Large-Scale Mobile Networks With Digital Twin and Multi-Agent Reinforcement Learning
    Liu, Haoqiang
    Li, Tong
    Jiang, Fenyu
    Su, Weikang
    Wang, Zhaocheng
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (12) : 18316 - 18330
  • [34] A Coordination Mechanism to Replicate Large-Scale Multi-Agent Systems
    Ductor, Sylvain
    Guessoum, Zahia
    2018 IEEE/ACM 13TH INTERNATIONAL SYMPOSIUM ON SOFTWARE ENGINEERING FOR ADAPTIVE AND SELF-MANAGING SYSTEMS (SEAMS), 2018, : 130 - 136
  • [35] Decentralized Multi-agent Reinforcement Learning for Large-scale Mobile Wireless Sensor Network Control Using Mean Field Games
    Zhou, Zejian
    Qian, Lijun
    Xu, Hao
    2024 33RD INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, ICCCN 2024, 2024,
  • [36] Multi-agent Reinforcement Learning in Network Management
    Bagnasco, Ricardo
    Serrat, Joan
    SCALABILITY OF NETWORKS AND SERVICES, PROCEEDINGS, 2009, 5637 : 199 - 202
  • [37] SCM network with multi-agent reinforcement learning
    Zhao, Gang
    Sun, Ruoying
    FIFTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, VOLS 1-3, 2006, : 1294 - 1300
  • [38] Addressing deadlock in large-scale, complex rail networks via multi-agent deep reinforcement learning
    Bretas, A. M. C.
    Mendes, A.
    Chalup, S.
    Jackson, M.
    Clement, R.
    Sanhueza, C.
    EXPERT SYSTEMS, 2025, 42 (01)
  • [39] A multi-agent reinforcement learning method with curriculum transfer for large-scale dynamic traffic signal control
    Xuesi Li
    Jingchen Li
    Haobin Shi
    Applied Intelligence, 2023, 53 : 21433 - 21447
  • [40] Evolution of a Complex Predator-Prey Ecosystem on Large-scale Multi-Agent Deep Reinforcement Learning
    Yamada, Jun
    Shawe-Taylor, John
    Fountas, Zafeirios
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,