A Model-Based Reinforcement Learning Algorithm for Multi-Agent Cooperation Nash Equilibrium With Unstable Communication

被引：1

作者：

Jiang, Yuannan ^{[1
,2
]}

Jiang, Shengming ^{[1
]}

Wang, Xiaofeng ^{[1
]}

机构：

[1] Shanghai Maritime Univ, Coll Informat Engn, Shanghai 201306, Peoples R China

[2] East China Univ Sci & Technol, Sch Informat Sci & Technol, Shanghai 200237, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS | 2024年 / 71卷 / 11期

基金：

中国国家自然科学基金;

关键词：

GRAPHICAL GAMES; CONSENSUS;

D O I：

10.1109/TCSII.2023.3263297

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Solving Nash equilibrium is important to multi-agent systems, in which the communication is an important factor. However, many proposed reinforcement learning (RL) based algorithms take into account the communication factors by assuming stable communication conditions, which does not hold in the real environment. In this brief, we analyze the effect of a typical RL algorithm in the cases of unstable communication and communication failure, which causes information loss between agents, leading to isolation of agents and affecting algorithm convergence. Then, we propose a model-based RL algorithm to solve Nash equilibrium for multi-agent systems when agents are isolated, and prove its convergence and rationality through mathematical proofs. The simulations results show the effectiveness of the proposed algorithm.

引用

页码：4743 / 4747

页数：5

共 50 条

[41] Partial Communication Model based on the Gain of Q-value in Multi-agent Reinforcement Learning
Xu, Jie
Wei, Wei
Zhang, Ya
Cui, Peng
2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 69 - 74
[42] Multi-agent Reinforcement Learning Based on K-Means Algorithm
Liu Changan
Liu Fei
Liu Chunyang
Wu Hua
CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (03): : 414 - 418
[43] Multi-agent reinforcement learning clustering algorithm based on silhouette coefficient
Du, Peng
Li, Fenglian
Shao, Jianli
NEUROCOMPUTING, 2024, 596
[44] Knowledge-guided communication preference learning model for multi-agent cooperation
Zhang, Han
Yu, Hang
Wang, Xiaoming
Wang, Mengke
Zhang, Zhenyu
Li, Yang
Xie, Shaorong
Luo, Xiangfeng
INFORMATION SCIENCES, 2024, 667
[45] ACM: Learning Dynamic Multi-agent Cooperation via Attentional Communication Model
Han, Xue
Yan, Hongping
Zhang, Junge
Wang, Lingfeng
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT II, 2018, 11140 : 219 - 229
[46] Efficient multi-agent cooperation: Scalable reinforcement learning with heterogeneous graph networks and limited communication
Li, Z.
Yang, Y.
Cheng, H.
KNOWLEDGE-BASED SYSTEMS, 2024, 300
[47] Model-Based Self-Advising for Multi-Agent Learning
Ye, Dayong
Zhu, Tianqing
Zhu, Congcong
Zhou, Wanlei
Yu, Philip S.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7934 - 7945
[48] Sequence to Sequence Multi-agent Reinforcement Learning Algorithm
Shi T.
Wang L.
Huang Z.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (03): : 206 - 213
[49] A new accelerating algorithm for multi-agent reinforcement learning
张汝波
仲宇
顾国昌
Journal of Harbin Institute of Technology, 2005, (01) : 48 - 51
[50] Model-based learning of interaction strategies in multi-agent systems
Carmel, D
Markovitch, S
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 1998, 10 (03) : 309 - 332

← 1 2 3 4 5 →