Q-learning algorithm in solving consensusability problem of discrete-time multi-agent systems

被引：12

作者：

Feng, Tao ^{[1
,2
]}

Zhang, Jilie ^{[1
]}

Tong, Yin ^{[1
]}

Zhang, Huaguang ^{[3
]}

机构：

[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu, Sichuan, Peoples R China

[2] Natl Engn Lab Integrated Transportat Big Data App, Chengdu, Peoples R China

[3] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Peoples R China

来源：

AUTOMATICA | 2021年 / 128卷

基金：

中国国家自然科学基金;

关键词：

Consensusability; Consensus region; Linear quadratic regulator (LQR); Q-learning; STABILITY MARGINS; GRAPHICAL GAMES; SYNCHRONIZATION;

D O I：

10.1016/j.automatica.2021.109576

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper solves the consensusability problem for the single-input discrete-time multi-agent system (MAS) over directed graphs by the linear quadratic regulator (LQR) design method. It is proved that the maximum consensus region is exactly the largest gain margin (GM) of LQR. Based on this, the necessary and sufficient condition on consensusability is derived by solving a standard algebraic Riccati equation (ARE). The developed framework permits that the consensusability problem can be solved when the agents' models are completely unavailable. Q-learning algorithm is employed to compute the maximum consensus region and implement the consensus protocol design. The algorithm runs only on a single agent rather than the intercommunicating MAS hence the unattainable initial admissible protocols are not required. A numerical example is given to illustrate the effectiveness of the developed methods. (c) 2021 Elsevier Ltd. All rights reserved.

引用

页数：7

共 50 条

[31] Consensus analysis of multi-agent discrete-time systems
[J]. Huang, Q.-Z. (qinzhenhuang2@gmail.com), 1600, Science Press (38):
[32] Stability of leaderless discrete-time multi-agent systems
David Angeli
Pierre-Alexandre Bliman
[J]. Mathematics of Control, Signals and Systems, 2006, 18 : 293 - 322
[33] Cooperative behavior acquisition for multi-agent systems by Q-learning
Xie, M. C.
Tachibana, A.
[J]. 2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 424 - +
[34] Stability of leaderless discrete-time multi-agent systems
Angeli, David
Bliman, Pierre-Alexandre
[J]. MATHEMATICS OF CONTROL SIGNALS AND SYSTEMS, 2006, 18 (04) : 293 - 322
[35] On the cluster consensus of discrete-time multi-agent systems
Chen, Yao
Lu, Jinhu
Han, Fengling
Yu, Xinghuo
[J]. SYSTEMS & CONTROL LETTERS, 2011, 60 (07) : 517 - 523
[36] Group controllability of discrete-time multi-agent systems
Liu, Bo
Han, Yue
Jiang, Fangcui
Su, Housheng
Zou, Jietao
[J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2016, 353 (14): : 3524 - 3559
[37] Extending Q-Learning to general adaptive multi-agent systems
Tesauro, G
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 871 - 878
[38] Minimax fuzzy Q-learning in cooperative multi-agent systems
Kilic, A
Arslan, A
[J]. ADVANCES IN INFORMATION SYSTEMS, 2002, 2457 : 264 - 272
[39] Efficient off-policy Q-learning for multi-agent systems by solving dual games
Wang, Yan
Xue, Huiwen
Wen, Jiwei
Liu, Jinfeng
Luan, Xiaoli
[J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (06) : 4193 - 4212
[40] Iterative learning control algorithm of consensus for discrete-time heterogeneous multi-agent systems with independent topologies
Liu, Xinxin
Li, Junmin
He, Chao
[J]. PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 284 - 289

← 1 2 3 4 5 →