Optimal couple-group tracking control for the heterogeneous multi-agent systems with cooperative-competitive interactions via reinforcement learning method

被引:8
|
作者
Li, Jun [1 ,2 ]
Ji, Lianghao [1 ,2 ]
Zhang, Cuijuan [1 ,2 ]
Li, Huaqing [3 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Computat Intelligence, Chongqing 400065, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Sch Comp Sci & Technol, Chongqing 400065, Peoples R China
[3] Southwest Univ, Coll Elect & Informat Engn, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning (RL); Cooperative-competitive interaction; Optimal couple-group tracking control (OCGTC); Heterogeneous multi-agent systems (HeMASs); OUTPUT SYNCHRONIZATION; CLUSTER CONSENSUS; TIME NETWORKS; GAMES; ALGORITHM;
D O I
10.1016/j.ins.2022.07.181
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study a class of optimal couple-group tracking control (OCGTC) problems for heterogeneous multi-agent systems (HeMASs) based on reinforcement learning (RL) method, whose goal is to minimize the local tracking errors (states) and control inputs (ac-tions) of followers by learning the dynamic knowledge of a single leader. The weakly con-nected multi-agent network is randomly divided into coupled sub-networks, and each agent in the same sub-network cooperates to accomplish tracking control such that the positions and velocities of all the agents converge to the same value, while the agents from different subgroups compete with each other to dissimilar tracking goals. In particular, in the discussed HeMASs, we consider agents with unknown dynamics of first-order and second-order. To solve the algebraic Riccati equation (ARE), an policy-value-based actor -critic technique is applied. Using the Lyapunov-like theorem, we verify that the local track-ing error and the estimated weights of actor-critic neural networks are deduced to be uni-formly ultimately bounded. Eventually, several simulations demonstrate the correctness of the retrieved theoretical results. (c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:401 / 424
页数:24
相关论文
共 50 条
  • [21] Demand response model: A cooperative-competitive multi-agent reinforcement learning approach
    Salazar, Eduardo J.
    Rosero, Veronica
    Gabrielski, Jawana
    Samper, Mauricio E.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [22] Complex group consensus of multi-agent systems with cooperative-competitive mechanisms: acyclic partition method
    Yao, Lingling
    Xie, Dongmei
    Wang, Han
    APPLIED MATHEMATICS AND COMPUTATION, 2023, 443
  • [23] Exponential finite-time couple-group consensus for agents in cooperative-competitive networks via pinning method
    Ji, Lianghao
    Wu, Rong
    Zhang, Cuijuan
    Yang, Shasha
    Li, Huaqing
    ASIAN JOURNAL OF CONTROL, 2023, 25 (03) : 2139 - 2152
  • [24] Bias Estimation Correction in Multi-Agent Reinforcement Learning for Mixed Cooperative-Competitive Environments
    Sarkar T.
    Kalita S.
    SN Computer Science, 5 (1)
  • [25] Hierarchical relationship modeling in multi-agent reinforcement learning for mixed cooperative-competitive environments
    Xie, Shaorong
    Li, Yang
    Wang, Xinzhi
    Zhang, Han
    Zhang, Zhenyu
    Luo, Xiangfeng
    Yu, Hang
    INFORMATION FUSION, 2024, 108
  • [26] Optimal output anti-synchronisation of cooperative-competitive multi-agent systems via distributed observer
    Wu, Yanzhi
    Zhao, Yiyi
    Hu, Jiangping
    IET CONTROL THEORY AND APPLICATIONS, 2019, 13 (13): : 2029 - 2038
  • [27] Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning
    Meng, Hao
    Pang, Denghao
    Cao, Jinde
    Guo, Yechen
    Niazi, Azmat Ullah Khan
    APPLIED MATHEMATICS AND COMPUTATION, 2024, 476
  • [28] Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning
    Wang, Xin
    Zhao, Chen
    Huang, Tingwen
    Chakrabarti, Prasun
    Kurths, Juergen
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 13 - 23
  • [29] Reinforcement learning of coordination in heterogeneous cooperative multi-agent systems
    Kapetanakis, S
    Kudenko, D
    ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS II: ADAPTATION AND MULTI-AGENT LEARNING, 2005, 3394 : 119 - 131
  • [30] Quasi-consensus of Markovian switching heterogeneous multi-agent systems with cooperative-competitive interactions and parameter uncertainties
    Yang, Yuhan
    Shen, Jiarui
    Zhang, Jingxiang
    Chen, Jing
    Hu, Aihua
    Hu, Manfeng
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2023, 237 (02) : 284 - 293