Model Learning and Knowledge Sharing for Cooperative Multiagent Systems in Stochastic Environment

被引:5
|
作者
Jiang, Wei-Cheng [1 ,2 ]
Narayanan, Vignesh [2 ]
Li, Jr-Shin [2 ]
机构
[1] Tunghai Univ, Dept Elect Engn, Taichung 40704, Taiwan
[2] Washington Univ, Dept Elect & Syst Engn, St Louis, MO 63130 USA
基金
美国国家卫生研究院;
关键词
Stochastic processes; Task analysis; Computational modeling; Clustering algorithms; Numerical models; Multi-agent systems; Fuses; Knowledge sharing; model learning; multiagent system; reinforcement learning (RL); sample efficiency; REINFORCEMENT; AGENTS;
D O I
10.1109/TCYB.2019.2958912
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An imposing task for a reinforcement learning agent in an uncertain environment is to expeditiously learn a policy or a sequence of actions, with which it can achieve the desired goal. In this article, we present an incremental model learning scheme to reconstruct the model of a stochastic environment. In the proposed learning scheme, we introduce a clustering algorithm to assimilate the model information and estimate the probability for each state transition. In addition, utilizing the reconstructed model, we present an experience replay strategy to create virtual interactive experiences by incorporating a balance between exploration and exploitation, which greatly accelerates learning and enables planning. Furthermore, we extend the proposed learning scheme for a multiagent framework to decrease the effort required for exploration and to reduce the learning time in a large environment. In this multiagent framework, we introduce a knowledge-sharing algorithm to share the reconstructed model information among the different agents, as needed, and develop a computationally efficient knowledge fusing mechanism to fuse the knowledge acquired using the agents' own experience with the knowledge received from its teammates. Finally, the simulation results with comparative analysis are provided to demonstrate the efficacy of the proposed methods in the complex learning tasks.
引用
收藏
页码:5717 / 5727
页数:11
相关论文
共 50 条
  • [1] Model Learning and Knowledge Sharing for a Multiagent System With Dyna-Q Learning
    Hwang, Kao-Shing
    Jiang, Wei-Cheng
    Chen, Yu-Jen
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (05) : 964 - 976
  • [2] A layered design model for knowledge and information sharing cooperative systems
    Nabuco, O
    Drira, K
    Dantas, E
    [J]. PROCEEDINGS OF THE TENTH IEEE INTERNATIONAL WORKSHOPS ON ENABLING TECHNOLOGIES: INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES, 2001, : 305 - 310
  • [3] Learning coordination strategies for cooperative multiagent systems
    Ho, F
    Kamel, M
    [J]. MACHINE LEARNING, 1998, 33 (2-3) : 155 - 177
  • [4] The dynamics of reinforcement learning in cooperative multiagent systems
    Claus, C
    Boutilier, C
    [J]. FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 746 - 752
  • [5] Learning Coordination Strategies for Cooperative Multiagent Systems
    F. Ho
    M. Kamel
    [J]. Machine Learning, 1998, 33 : 155 - 177
  • [6] Model Predictive Cooperative Control With ISM for Multiagent Systems Under Stochastic Communication Protocol
    Yuan, Yuan
    Guo, Lei
    Liu, Huaping
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (12) : 6004 - 6016
  • [7] Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems
    Hao, Jianye
    Leung, Ho-Fung
    Ming, Zhong
    [J]. ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2015, 9 (04)
  • [8] A Model Learning Based Multiagent Flocking Collaborative Control Method for Stochastic Communication Environment
    Xiao, Jian
    Huang, Chongjun
    Yuan, Guohui
    Wang, Yaoting
    Jia, Honyu
    Wang, Zhuoran
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (06) : 8896 - 8906
  • [9] Cooperative multiagent learning
    Plaza, E
    Ontañón, S
    [J]. ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS: ADAPTATION AND MULTI-AGENT LEARNING, 2003, 2636 : 1 - 17
  • [10] COOPERATIVE LEARNING IN MULTIAGENT SYSTEMS FROM INTERMITTENT MEASUREMENTS
    Leonard, Naomi Ehrich
    Olshevsky, Alex
    [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2015, 53 (01) : 1 - 29