Model Learning and Knowledge Sharing for Cooperative Multiagent Systems in Stochastic Environment

被引：5

作者：

Jiang, Wei-Cheng ^{[1
,2
]}

Narayanan, Vignesh ^{[2
]}

Li, Jr-Shin ^{[2
]}

机构：

[1] Tunghai Univ, Dept Elect Engn, Taichung 40704, Taiwan

[2] Washington Univ, Dept Elect & Syst Engn, St Louis, MO 63130 USA

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2021年 / 51卷 / 12期

基金：

美国国家卫生研究院;

关键词：

Stochastic processes; Task analysis; Computational modeling; Clustering algorithms; Numerical models; Multi-agent systems; Fuses; Knowledge sharing; model learning; multiagent system; reinforcement learning (RL); sample efficiency; REINFORCEMENT; AGENTS;

D O I：

10.1109/TCYB.2019.2958912

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

An imposing task for a reinforcement learning agent in an uncertain environment is to expeditiously learn a policy or a sequence of actions, with which it can achieve the desired goal. In this article, we present an incremental model learning scheme to reconstruct the model of a stochastic environment. In the proposed learning scheme, we introduce a clustering algorithm to assimilate the model information and estimate the probability for each state transition. In addition, utilizing the reconstructed model, we present an experience replay strategy to create virtual interactive experiences by incorporating a balance between exploration and exploitation, which greatly accelerates learning and enables planning. Furthermore, we extend the proposed learning scheme for a multiagent framework to decrease the effort required for exploration and to reduce the learning time in a large environment. In this multiagent framework, we introduce a knowledge-sharing algorithm to share the reconstructed model information among the different agents, as needed, and develop a computationally efficient knowledge fusing mechanism to fuse the knowledge acquired using the agents' own experience with the knowledge received from its teammates. Finally, the simulation results with comparative analysis are provided to demonstrate the efficacy of the proposed methods in the complex learning tasks.

引用

页码：5717 / 5727

页数：11

共 50 条

[1] Model Learning and Knowledge Sharing for a Multiagent System With Dyna-Q Learning
Hwang, Kao-Shing
Jiang, Wei-Cheng
Chen, Yu-Jen
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (05) : 964 - 976
[2] A layered design model for knowledge and information sharing cooperative systems
Nabuco, O
Drira, K
Dantas, E
[J]. PROCEEDINGS OF THE TENTH IEEE INTERNATIONAL WORKSHOPS ON ENABLING TECHNOLOGIES: INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES, 2001, : 305 - 310
[3] Learning coordination strategies for cooperative multiagent systems
Ho, F
Kamel, M
[J]. MACHINE LEARNING, 1998, 33 (2-3) : 155 - 177
[4] The dynamics of reinforcement learning in cooperative multiagent systems
Claus, C
Boutilier, C
[J]. FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 746 - 752
[5] Learning Coordination Strategies for Cooperative Multiagent Systems
F. Ho
M. Kamel
[J]. Machine Learning, 1998, 33 : 155 - 177
[6] Model Predictive Cooperative Control With ISM for Multiagent Systems Under Stochastic Communication Protocol
Yuan, Yuan
Guo, Lei
Liu, Huaping
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (12) : 6004 - 6016
[7] Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems
Hao, Jianye
Leung, Ho-Fung
Ming, Zhong
[J]. ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2015, 9 (04)
[8] A Model Learning Based Multiagent Flocking Collaborative Control Method for Stochastic Communication Environment
Xiao, Jian
Huang, Chongjun
Yuan, Guohui
Wang, Yaoting
Jia, Honyu
Wang, Zhuoran
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (06) : 8896 - 8906
[9] Cooperative multiagent learning
Plaza, E
Ontañón, S
[J]. ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS: ADAPTATION AND MULTI-AGENT LEARNING, 2003, 2636 : 1 - 17
[10] COOPERATIVE LEARNING IN MULTIAGENT SYSTEMS FROM INTERMITTENT MEASUREMENTS
Leonard, Naomi Ehrich
Olshevsky, Alex
[J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2015, 53 (01) : 1 - 29

← 1 2 3 4 5 →