The Implementation of Asynchronous Advantage Actor-Critic with Stigmergy in Network-assisted Multi-agent System

被引:0
|
作者
Chen, Kun [1 ]
Li, Rongpeng [1 ]
Zhao, Zhifeng [2 ]
Zhang, Honggang [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Zhejiang Lab, Hangzhou, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
multi-agent system; stigmergy mechanism; digital pheromones; deep reinforcement learning; KHEPERA IV robots;
D O I
10.1109/wcsp49889.2020.9299839
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-agent system (MAS) needs to mobilize multiple simple agents to complete complex tasks. However, it is difficult to coherently coordinate distributed agents by means of limited local information. In this paper, we propose a decentralized collaboration method named as "stigmergy" in network-assisted MAS, by exploiting digital pheromones (DP) as an indirect medium of communication and utilizing deep reinforcement learning (DRL) on top. Correspondingly, we implement an experimental platform, where KHEPERA IV robots form targeted specific shapes in a decentralized manner. Experimental results demonstrate the effectiveness and efficiency of the proposed method. Our platform could be conveniently extended to investigate the impact of network factors (e.g., latency, data rate, etc) on the level of collective intelligence.
引用
收藏
页码:1082 / 1087
页数:6
相关论文
共 50 条
  • [31] Adversarial retraining attack of asynchronous advantage actor-critic based pathfinding
    Chen Tong
    Liu Jiqiang
    Xiang Yingxiao
    Niu Wenjia
    Tong Endong
    Wang Shuoru
    Li He
    Chang Liang
    Li Gang
    Alfred, Chen Qi
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (05) : 2323 - 2346
  • [32] Workflow scheduling based on asynchronous advantage actor-critic algorithm in multi-cloud environment
    Tang, Xuhao
    Liu, Fagui
    Wang, Bin
    Xu, Dishi
    Jiang, Jun
    Wu, Qingbo
    Chen, C. L. Philip
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [33] Traffic signal control method based on asynchronous advantage actor-critic
    Ye, Baolin
    Sun, Ruitao
    Wu, Weimin
    Chen, Bin
    Yao, Qing
    [J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (08): : 1671 - 1680
  • [34] A new noise network and gradient parallelisation-based asynchronous advantage actor-critic algorithm
    Fei, Zhengshun
    Wang, Yanping
    Wang, Jinglong
    Liu, Kangling
    Huang, Bingqiang
    Tan, Ping
    [J]. IET CYBER-SYSTEMS AND ROBOTICS, 2022, 4 (03) : 175 - 188
  • [35] Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report Summarization
    Karn, Sanjeev Kumar
    Liu, Ning
    Schuetze, Hinrich
    Farri, Oladimeji
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1542 - 1553
  • [36] Multi-agent Attention Actor-Critic Algorithm for Load Balancing in Cellular Networks
    Kang, Jikun
    Wu, Di
    Wang, Ju
    Hossain, Ekram
    Liu, Xue
    Dedek, Gregory
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5160 - 5165
  • [37] Toward Resilient Multi-Agent Actor-Critic Algorithms for Distributed Reinforcement Learning
    Lin, Yixuan
    Gade, Shripad
    Sandhu, Romeil
    Liu, Ji
    [J]. 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 3953 - 3958
  • [38] Context-Aware Bayesian Network Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning
    Chen, Dingyang
    Zhang, Qi
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [39] Caching Transient Content for IoT Sensing: Multi-Agent Soft Actor-Critic
    Wu, Xiongwei
    Li, Xiuhua
    Li, Jun
    Ching, P. C.
    Leung, Victor C. M.
    Poor, H. Vincent
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (09) : 5886 - 5901
  • [40] Approximate Dynamic Programming Solutions of Multi-Agent Graphical Games Using Actor-Critic Network Structures
    Abouheaf, Mohammed I.
    Lewis, Frank L.
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,