The Implementation of Asynchronous Advantage Actor-Critic with Stigmergy in Network-assisted Multi-agent System

被引:0
|
作者
Chen, Kun [1 ]
Li, Rongpeng [1 ]
Zhao, Zhifeng [2 ]
Zhang, Honggang [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Zhejiang Lab, Hangzhou, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
multi-agent system; stigmergy mechanism; digital pheromones; deep reinforcement learning; KHEPERA IV robots;
D O I
10.1109/wcsp49889.2020.9299839
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-agent system (MAS) needs to mobilize multiple simple agents to complete complex tasks. However, it is difficult to coherently coordinate distributed agents by means of limited local information. In this paper, we propose a decentralized collaboration method named as "stigmergy" in network-assisted MAS, by exploiting digital pheromones (DP) as an indirect medium of communication and utilizing deep reinforcement learning (DRL) on top. Correspondingly, we implement an experimental platform, where KHEPERA IV robots form targeted specific shapes in a decentralized manner. Experimental results demonstrate the effectiveness and efficiency of the proposed method. Our platform could be conveniently extended to investigate the impact of network factors (e.g., latency, data rate, etc) on the level of collective intelligence.
引用
收藏
页码:1082 / 1087
页数:6
相关论文
共 50 条
  • [21] Asynchronous Advantage Actor-Critic (A3C) Learning for Cognitive Network Security
    Muhati, Eric
    Rawat, Danda B.
    [J]. 2021 THIRD IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS (TPS-ISA 2021), 2021, : 106 - 113
  • [22] Actor-Critic for Multi-Agent Reinforcement Learning with Self-Attention
    Zhao, Juan
    Zhu, Tong
    Xiao, Shuo
    Gao, Zongqian
    Sun, Hao
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (09)
  • [23] Multi-agent reinforcement learning by the actor-critic model with an attention interface
    Zhang, Lixiang
    Li, Jingchen
    Zhu, Yi'an
    Shi, Haobin
    Hwang, Kao-Shing
    [J]. NEUROCOMPUTING, 2022, 471 : 275 - 284
  • [24] Structural relational inference actor-critic for multi-agent reinforcement learning
    Zhang, Xianjie
    Liu, Yu
    Xu, Xiujuan
    Huang, Qiong
    Mao, Hangyu
    Carie, Anil
    [J]. NEUROCOMPUTING, 2021, 459 : 383 - 394
  • [25] Capacity-Limited Decentralized Actor-Critic for Multi-Agent Games
    Malloy, Tyler
    Sims, Chris R.
    Klinger, Tim
    Liu, Miao
    Riemer, Matthew
    Tesauro, Gerald
    [J]. 2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 332 - 339
  • [26] Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
    Lowe, Ryan
    Wu, Yi
    Tamar, Aviv
    Harb, Jean
    Abbeel, Pieter
    Mordatch, Igor
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [27] Bi-level Multi-Agent Actor-Critic Methods with Transformers
    Wan, Tianjiao
    Mi, Haibo
    Gao, Zijian
    Zhai, Yuanzhao
    Ding, Bo
    Feng, Dawei
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON JOINT CLOUD COMPUTING, JCC, 2023, : 9 - 16
  • [28] Multi-Agent Actor-Critic for Cooperative Resource Allocation in Vehicular Networks
    Hammami, Nessrine
    Nguyen, Kim Khoa
    [J]. PROCEEDINGS OF THE 2022 14TH IFIP WIRELESS AND MOBILE NETWORKING CONFERENCE (WMNC 2022), 2022, : 93 - 100
  • [29] Cooperative multi-agent actor-critic control of traffic network flow based on edge computing
    Zhang, Yongnan
    Zhou, Yonghua
    Lu, Huapu
    Fujita, Hamido
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 123 : 128 - 141
  • [30] Towards Understanding Asynchronous Advantage Actor-Critic: Convergence and Linear Speedup
    Shen, Han
    Zhang, Kaiqing
    Hong, Mingyi
    Chen, Tianyi
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 : 2579 - 2594