Q-learning algorithm in solving consensusability problem of discrete-time multi-agent systems

被引:12
|
作者
Feng, Tao [1 ,2 ]
Zhang, Jilie [1 ]
Tong, Yin [1 ]
Zhang, Huaguang [3 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu, Sichuan, Peoples R China
[2] Natl Engn Lab Integrated Transportat Big Data App, Chengdu, Peoples R China
[3] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
Consensusability; Consensus region; Linear quadratic regulator (LQR); Q-learning; STABILITY MARGINS; GRAPHICAL GAMES; SYNCHRONIZATION;
D O I
10.1016/j.automatica.2021.109576
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper solves the consensusability problem for the single-input discrete-time multi-agent system (MAS) over directed graphs by the linear quadratic regulator (LQR) design method. It is proved that the maximum consensus region is exactly the largest gain margin (GM) of LQR. Based on this, the necessary and sufficient condition on consensusability is derived by solving a standard algebraic Riccati equation (ARE). The developed framework permits that the consensusability problem can be solved when the agents' models are completely unavailable. Q-learning algorithm is employed to compute the maximum consensus region and implement the consensus protocol design. The algorithm runs only on a single agent rather than the intercommunicating MAS hence the unattainable initial admissible protocols are not required. A numerical example is given to illustrate the effectiveness of the developed methods. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Consensus analysis of multi-agent discrete-time systems
    [J]. Huang, Q.-Z. (qinzhenhuang2@gmail.com), 1600, Science Press (38):
  • [32] Stability of leaderless discrete-time multi-agent systems
    David Angeli
    Pierre-Alexandre Bliman
    [J]. Mathematics of Control, Signals and Systems, 2006, 18 : 293 - 322
  • [33] Cooperative behavior acquisition for multi-agent systems by Q-learning
    Xie, M. C.
    Tachibana, A.
    [J]. 2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 424 - +
  • [34] Stability of leaderless discrete-time multi-agent systems
    Angeli, David
    Bliman, Pierre-Alexandre
    [J]. MATHEMATICS OF CONTROL SIGNALS AND SYSTEMS, 2006, 18 (04) : 293 - 322
  • [35] On the cluster consensus of discrete-time multi-agent systems
    Chen, Yao
    Lu, Jinhu
    Han, Fengling
    Yu, Xinghuo
    [J]. SYSTEMS & CONTROL LETTERS, 2011, 60 (07) : 517 - 523
  • [36] Group controllability of discrete-time multi-agent systems
    Liu, Bo
    Han, Yue
    Jiang, Fangcui
    Su, Housheng
    Zou, Jietao
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2016, 353 (14): : 3524 - 3559
  • [37] Extending Q-Learning to general adaptive multi-agent systems
    Tesauro, G
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 871 - 878
  • [38] Minimax fuzzy Q-learning in cooperative multi-agent systems
    Kilic, A
    Arslan, A
    [J]. ADVANCES IN INFORMATION SYSTEMS, 2002, 2457 : 264 - 272
  • [39] Efficient off-policy Q-learning for multi-agent systems by solving dual games
    Wang, Yan
    Xue, Huiwen
    Wen, Jiwei
    Liu, Jinfeng
    Luan, Xiaoli
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (06) : 4193 - 4212
  • [40] Iterative learning control algorithm of consensus for discrete-time heterogeneous multi-agent systems with independent topologies
    Liu, Xinxin
    Li, Junmin
    He, Chao
    [J]. PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 284 - 289