Reinforcement Learning for Synchronization of Heterogeneous Multiagent Systems by Improved Q-Functions

被引：0

作者：

Li, Jinna ^{[1
]}

Yuan, Lin ^{[1
]}

Cheng, Weiran ^{[1
]}

Chai, Tianyou ^{[2
]}

Lewis, Frank L. ^{[3
]}

机构：

[1] Liaoning Petrochem Univ, Sch Informat & Control Engn, Fushun 113001, Liaoning, Peoples R China

[2] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China

[3] Univ Texas Arlington, UTA Res Inst, Arlington, TX 76118 USA

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2024年

基金：

中国国家自然科学基金;

关键词：

Synchronization; Protocols; Heuristic algorithms; Decision making; Nash equilibrium; Multi-agent systems; Games; Data-driven control; distributed control; multiagent systems (MASs); reinforcement learning (RL); synchronization; GRAPHICAL GAMES;

D O I：

10.1109/TCYB.2024.3440333

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article dedicates to investigating a methodology for enhancing adaptability to environmental changes of reinforcement learning (RL) techniques with data efficiency, by which a joint control protocol is learned using only data for multiagent systems (MASs). Thus, all followers are able to synchronize themselves with the leader and minimize their individual performance. To this end, an optimal synchronization problem of heterogeneous MASs is first formulated, and then an arbitration RL mechanism is developed for well addressing key challenges faced by the current RL techniques, that is, insufficient data and environmental changes. In the developed mechanism, an improved Q -function with an arbitration factor is designed for accommodating the fact that control protocols tend to be made by historic experiences and instinctive decision-making, such that the degree of control over agents' behaviors can be adaptively allocated by on-policy and off-policy RL techniques for the optimal multiagent synchronization problem. Finally, an arbitration RL algorithm with critic-only neural networks is proposed, and theoretical analysis and proofs of synchronization and performance optimality are provided. Simulation results verify the effectiveness of the proposed method.

引用

页码：6545 / 6558

页数：14

共 50 条

[41] Dynamic Leader-Follower Output Containment Control of Heterogeneous Multiagent Systems Using Reinforcement Learning
Zhang, Huaipin
Zhao, Wei
Xie, Xiangpeng
Yue, Dong
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (09): : 5307 - 5316
[42] An approach to the pursuit problem on a heterogeneous multiagent system using reinforcement learning
Ishiwaka, Y
Sato, T
Kakazu, Y
ROBOTICS AND AUTONOMOUS SYSTEMS, 2003, 43 (04) : 245 - 256
[43] Adaptive Output Synchronization With Designated Convergence Rate of Multiagent Systems Based on Off-Policy Reinforcement Learning
Huang, Chengjie
Chen, Ci
Xie, Kan
Li, Zhenni
Xie, Shengli
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (08): : 4667 - 4678
[44] Q-learning solution for optimal consensus control of discrete-time multiagent systems using reinforcement learning
Mu, Chaoxu
Zhao, Qian
Gao, Zhongke
Sun, Changyin
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2019, 356 (13): : 6946 - 6967
[45] Reinforcement Learning applied to Network Synchronization Systems
Destro, Alessandro
Giorgi, Giada
2022 IEEE INTERNATIONAL SYMPOSIUM ON MEASUREMENTS & NETWORKING (M&N 2022), 2022,
[46] Adaptive Individual Q-Learning-A Multiagent Reinforcement Learning Method for Coordination Optimization
Zhang, Zhen
Wang, Dongqing
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 12
[47] Multiagent reinforcement learning through merging individually learned value functions
张化祥
黄上腾
Journal of Harbin Institute of Technology(New series), 2005, (03) : 346 - 350
[48] SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multiagent Reinforcement Learning
Yao, Xinghu
Wen, Chao
Wang, Yuhui
Tan, Xiaoyang
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) : 52 - 63
[49] The dynamics of reinforcement social learning in networked cooperative multiagent systems
Hao, Jianye
Huang, Dongping
Cai, Yi
Leung, Ho-fung
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 58 : 111 - 122
[50] Adaptive Fuzzy Leader-Follower Synchronization of Constrained Heterogeneous Multiagent Systems
Yang, Yongliang
Xu, Cheng-Zhong
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (01) : 205 - 219

← 1 2 3 4 5 →