Stigmergic Independent Reinforcement Learning for Multiagent Collaboration

被引：16

作者：

Xu, Xing ^{[1
]}

Li, Rongpeng ^{[1
]}

Zhao, Zhifeng ^{[2
]}

Zhang, Honggang ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China

[2] Zhejiang Lab, Hangzhou, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2022年 / 33卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Collaboration; Training; Shape; Insects; Task analysis; Intelligent agents; Wireless communication; Artificial intelligence; collective intelligence; multiagent collaboration; reinforcement learning; stigmergy; ANT COLONY; COORDINATION; ENVIRONMENT;

D O I：

10.1109/TNNLS.2021.3056418

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the rapid evolution of wireless mobile devices, there emerges an increased need to design effective collaboration mechanisms between intelligent agents to gradually approach the final collective objective by continuously learning from the environment based on their individual observations. In this regard, independent reinforcement learning (IRL) is often deployed in multiagent collaboration to alleviate the problem of a nonstationary learning environment. However, behavioral strategies of intelligent agents in IRL can be formulated only upon their local individual observations of the global environment, and appropriate communication mechanisms must be introduced to reduce their behavioral localities. In this article, we address the problem of communication between intelligent agents in IRL by jointly adopting mechanisms with two different scales. For the large scale, we introduce the stigmergy mechanism as an indirect communication bridge between independent learning agents, and carefully design a mathematical method to indicate the impact of digital pheromone. For the small scale, we propose a conflict-avoidance mechanism between adjacent agents by implementing an additionally embedded neural network to provide more opportunities for participants with higher action priorities. In addition, we present a federal training method to effectively optimize the neural network of each agent in a decentralized manner. Finally, we establish a simulation scenario in which a number of mobile agents in a certain area move automatically to form a specified target shape. Extensive simulations demonstrate the effectiveness of our proposed method.

引用

页码：4285 / 4299

页数：15

共 50 条

[1] Trustable Policy Collaboration Scheme for Multi-Agent Stigmergic Reinforcement Learning
Xu, Xing
Li, Rongpeng
Zhao, Zhifeng
Zhang, Honggang
[J]. IEEE COMMUNICATIONS LETTERS, 2022, 26 (04) : 823 - 827
[2] Satisficing Paths and Independent Multiagent Reinforcement Learning in Stochastic Games
Yongacoglu, Bora
Arslan, Gurdal
Yuksel, Serdar
[J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2023, 5 (03): : 745 - 773
[3] Multiagent Collaboration for Emergency Evacuation Using Reinforcement Learning for Transportation Systems
Yang, Yupeng
Yu, Jiahao
Liu, Dahai
Lee, Sang-A
Namilae, Sirish
Islam, Sabique
Gou, Huaxing
Park, Hyoshin
Song, Houbing
[J]. IEEE Journal on Miniaturization for Air and Space Systems, 2022, 3 (04): : 232 - 241
[4] Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem
Zhang, Chengwei
Jin, Shan
Xue, Wanli
Xie, Xiaofei
Chen, Shengyong
Chen, Rong
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (08) : 7426 - 7436
[5] A multiagent reinforcement learning algorithm to solve the maximum independent set problem
Alipour, Mir Mohammad
Abdolhosseinzadeh, Mohsen
[J]. MULTIAGENT AND GRID SYSTEMS, 2020, 16 (01) : 101 - 115
[6] Asymmetric multiagent reinforcement learning
Könönen, V
[J]. IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 336 - 342
[7] Lateral Transfer Learning for Multiagent Reinforcement Learning
Shi, Haobin
Li, Jingchen
Mao, Jiahui
Hwang, Kao-Shing
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1699 - 1711
[8] Learning to Teach in Cooperative Multiagent Reinforcement Learning
Omidshafiei, Shayegan
Kim, Dong-Ki
Liu, Miao
Tesauro, Gerald
Riemer, Matthew
Amato, Christopher
Campbell, Murray
How, Jonathan P.
[J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6128 - 6136
[9] Simultaneously Learning and Advising in Multiagent Reinforcement Learning
da Silva, Felipe Leno
Glatt, Ruben
Reali Costa, Anna Helena
[J]. AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1100 - 1108
[10] Learning Cooperative Behaviours in Multiagent Reinforcement Learning
Phon-Amnuaisuk, Somnuk
[J]. NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2009, 5863 : 570 - 579

← 1 2 3 4 5 →