Data-Driven Fault-Tolerant Reinforcement Learning Containment Control for Nonlinear Multiagent Systems

被引:2
|
作者
Wang, Xin [1 ]
Zhao, Chen [1 ]
Huang, Tingwen [2 ]
机构
[1] Southwest Univ, Coll Elect & Informat Engn, Chongqing Key Lab Nonlinear Circuits & Intelligent, Chongqing 400075, Peoples R China
[2] TM Univ Qatar, Doha 23874, Qatar
基金
中国国家自然科学基金;
关键词
Mutiagent systems; containment control; reinforcement learning; data-driven; actuator faults; ADAPTIVE-CONTROL; CONSENSUS; TRACKING;
D O I
10.1109/TETCI.2023.3303252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article concentrates on the data-driven containment problem for a class of nonlinear discrete-time multiagent systems via reinforcement learning. A novel two-layer control architecture is designed. In the first layer, a reference model is introduced with which all signals of the multiagent systems will reach synchronization. On account of the critic-actor neural network architecture, an adaptive neural network controller with a multigradient recursive reinforcement learning algorithm and less learning parameters method is designed to tackle the tracking issues and actuator faults. Then in the distributed control layer, the virtual containment control input is developed via policy iteration with critic-actor neural networks such that the containment error will converge to a small neighborhood of the origin. Note that the proposed method makes the solution of optimal containment control problem independent of system dynamics and takes energy costs into consideration. Besides, the semiglobally uniformly ultimately bounded property of signals in the closed-loop system and the policy iteration convergence are guaranteed. Finally, some numerical illustrations are attached to consolidate the effectiveness of our proposed mechanism.
引用
收藏
页码:416 / 426
页数:11
相关论文
共 50 条
  • [41] Fault-tolerant containment control of uncertain nonlinear systems in strict-feedback form
    Wang, Wei
    Wang, Dan
    Peng, Zhouhua
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2017, 27 (03) : 497 - 511
  • [42] Adaptive Fault-Tolerant Tracking Control for Affine Nonlinear Systems With Unknown Dynamics via Reinforcement Learning
    Roshanravan, Sajad
    Shamaghdari, Saeed
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (01) : 569 - 580
  • [43] Data-driven Second-order Sliding Mode Fault-Tolerant Control for a Class of Discrete-time Nonlinear Systems
    Han, Yaozhen
    Hou, Mingdong
    European Journal of Control, 2024, 76
  • [44] Multiple Environment Integral Reinforcement Learning-Based Fault-Tolerant Control for Affine Nonlinear Systems
    Ma, Hong-Jun
    Xu, Lin-Xing
    Yang, Guang-Hong
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (04) : 1913 - 1928
  • [45] Data-driven Second-order Sliding Mode Fault-Tolerant Control for a Class of Discrete-time Nonlinear Systems
    Han, Yaozhen
    Hou, Mingdong
    EUROPEAN JOURNAL OF CONTROL, 2024, 76
  • [46] Data-Driven Dynamic Event-Triggered Fault-Tolerant Platooning Control
    Yue, Bai-Fan
    Che, Wei-Wei
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (07) : 8418 - 8426
  • [47] A data-driven sensor fault-tolerant control scheme based on subspace identification
    Salim, Mina
    Ahmed, Saeed
    Khosrowjerdi, Mohammad Javad
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (15) : 6991 - 7006
  • [48] A Novel Data-Driven Fault-Tolerant Control Method Based on Convex Optimization
    Wang, Siqing
    Fan, Quan-Yong
    Li, Jiaxuan
    Zhang, Naizong
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 374 - 379
  • [49] Data-Driven Fault-Tolerant Platooning Control Under Aperiodic DoS Attacks
    Yue, Bai-Fan
    Che, Wei-Wei
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (11) : 12166 - 12179
  • [50] Cooperative Fault-Tolerant Output Regulation for Multiagent Systems by Distributed Learning Control Approach
    Deng, Chao
    Che, Wei-Wei
    Shi, Peng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 4831 - 4841