The Implementation of Asynchronous Advantage Actor-Critic with Stigmergy in Network-assisted Multi-agent System

被引：0

作者：

Chen, Kun ^{[1
]}

Li, Rongpeng ^{[1
]}

Zhao, Zhifeng ^{[2
]}

Zhang, Honggang ^{[1
]}

机构：

[1] Zhejiang Univ, Hangzhou, Peoples R China

[2] Zhejiang Lab, Hangzhou, Peoples R China

来源：

2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP) | 2020年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

multi-agent system; stigmergy mechanism; digital pheromones; deep reinforcement learning; KHEPERA IV robots;

D O I：

10.1109/wcsp49889.2020.9299839

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-agent system (MAS) needs to mobilize multiple simple agents to complete complex tasks. However, it is difficult to coherently coordinate distributed agents by means of limited local information. In this paper, we propose a decentralized collaboration method named as "stigmergy" in network-assisted MAS, by exploiting digital pheromones (DP) as an indirect medium of communication and utilizing deep reinforcement learning (DRL) on top. Correspondingly, we implement an experimental platform, where KHEPERA IV robots form targeted specific shapes in a decentralized manner. Experimental results demonstrate the effectiveness and efficiency of the proposed method. Our platform could be conveniently extended to investigate the impact of network factors (e.g., latency, data rate, etc) on the level of collective intelligence.

引用

页码：1082 / 1087

页数：6

共 50 条

[31] Adversarial retraining attack of asynchronous advantage actor-critic based pathfinding
Chen Tong
Liu Jiqiang
Xiang Yingxiao
Niu Wenjia
Tong Endong
Wang Shuoru
Li He
Chang Liang
Li Gang
Alfred, Chen Qi
[J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (05) : 2323 - 2346
[32] Workflow scheduling based on asynchronous advantage actor-critic algorithm in multi-cloud environment
Tang, Xuhao
Liu, Fagui
Wang, Bin
Xu, Dishi
Jiang, Jun
Wu, Qingbo
Chen, C. L. Philip
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
[33] Traffic signal control method based on asynchronous advantage actor-critic
Ye, Baolin
Sun, Ruitao
Wu, Weimin
Chen, Bin
Yao, Qing
[J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (08): : 1671 - 1680
[34] A new noise network and gradient parallelisation-based asynchronous advantage actor-critic algorithm
Fei, Zhengshun
Wang, Yanping
Wang, Jinglong
Liu, Kangling
Huang, Bingqiang
Tan, Ping
[J]. IET CYBER-SYSTEMS AND ROBOTICS, 2022, 4 (03) : 175 - 188
[35] Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report Summarization
Karn, Sanjeev Kumar
Liu, Ning
Schuetze, Hinrich
Farri, Oladimeji
[J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1542 - 1553
[36] Multi-agent Attention Actor-Critic Algorithm for Load Balancing in Cellular Networks
Kang, Jikun
Wu, Di
Wang, Ju
Hossain, Ekram
Liu, Xue
Dedek, Gregory
[J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5160 - 5165
[37] Toward Resilient Multi-Agent Actor-Critic Algorithms for Distributed Reinforcement Learning
Lin, Yixuan
Gade, Shripad
Sandhu, Romeil
Liu, Ji
[J]. 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 3953 - 3958
[38] Context-Aware Bayesian Network Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning
Chen, Dingyang
Zhang, Qi
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
[39] Caching Transient Content for IoT Sensing: Multi-Agent Soft Actor-Critic
Wu, Xiongwei
Li, Xiuhua
Li, Jun
Ching, P. C.
Leung, Victor C. M.
Poor, H. Vincent
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (09) : 5886 - 5901
[40] Approximate Dynamic Programming Solutions of Multi-Agent Graphical Games Using Actor-Critic Network Structures
Abouheaf, Mohammed I.
Lewis, Frank L.
[J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,

← 1 2 3 4 5 →