A Model-Based Reinforcement Learning Algorithm for Multi-Agent Cooperation Nash Equilibrium With Unstable Communication

被引:1
|
作者
Jiang, Yuannan [1 ,2 ]
Jiang, Shengming [1 ]
Wang, Xiaofeng [1 ]
机构
[1] Shanghai Maritime Univ, Coll Informat Engn, Shanghai 201306, Peoples R China
[2] East China Univ Sci & Technol, Sch Informat Sci & Technol, Shanghai 200237, Peoples R China
基金
中国国家自然科学基金;
关键词
GRAPHICAL GAMES; CONSENSUS;
D O I
10.1109/TCSII.2023.3263297
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Solving Nash equilibrium is important to multi-agent systems, in which the communication is an important factor. However, many proposed reinforcement learning (RL) based algorithms take into account the communication factors by assuming stable communication conditions, which does not hold in the real environment. In this brief, we analyze the effect of a typical RL algorithm in the cases of unstable communication and communication failure, which causes information loss between agents, leading to isolation of agents and affecting algorithm convergence. Then, we propose a model-based RL algorithm to solve Nash equilibrium for multi-agent systems when agents are isolated, and prove its convergence and rationality through mathematical proofs. The simulations results show the effectiveness of the proposed algorithm.
引用
收藏
页码:4743 / 4747
页数:5
相关论文
共 50 条
  • [41] Partial Communication Model based on the Gain of Q-value in Multi-agent Reinforcement Learning
    Xu, Jie
    Wei, Wei
    Zhang, Ya
    Cui, Peng
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 69 - 74
  • [42] Multi-agent Reinforcement Learning Based on K-Means Algorithm
    Liu Changan
    Liu Fei
    Liu Chunyang
    Wu Hua
    CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (03): : 414 - 418
  • [43] Multi-agent reinforcement learning clustering algorithm based on silhouette coefficient
    Du, Peng
    Li, Fenglian
    Shao, Jianli
    NEUROCOMPUTING, 2024, 596
  • [44] Knowledge-guided communication preference learning model for multi-agent cooperation
    Zhang, Han
    Yu, Hang
    Wang, Xiaoming
    Wang, Mengke
    Zhang, Zhenyu
    Li, Yang
    Xie, Shaorong
    Luo, Xiangfeng
    INFORMATION SCIENCES, 2024, 667
  • [45] ACM: Learning Dynamic Multi-agent Cooperation via Attentional Communication Model
    Han, Xue
    Yan, Hongping
    Zhang, Junge
    Wang, Lingfeng
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT II, 2018, 11140 : 219 - 229
  • [46] Efficient multi-agent cooperation: Scalable reinforcement learning with heterogeneous graph networks and limited communication
    Li, Z.
    Yang, Y.
    Cheng, H.
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [47] Model-Based Self-Advising for Multi-Agent Learning
    Ye, Dayong
    Zhu, Tianqing
    Zhu, Congcong
    Zhou, Wanlei
    Yu, Philip S.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7934 - 7945
  • [48] Sequence to Sequence Multi-agent Reinforcement Learning Algorithm
    Shi T.
    Wang L.
    Huang Z.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (03): : 206 - 213
  • [49] A new accelerating algorithm for multi-agent reinforcement learning
    张汝波
    仲宇
    顾国昌
    Journal of Harbin Institute of Technology, 2005, (01) : 48 - 51
  • [50] Model-based learning of interaction strategies in multi-agent systems
    Carmel, D
    Markovitch, S
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 1998, 10 (03) : 309 - 332