Game-Based Backstepping Design for Strict-Feedback Nonlinear Multi-Agent Systems Based on Reinforcement Learning

被引:28
|
作者
Long, Jia [1 ]
Yu, Dengxiu [1 ]
Wen, Guoxing [2 ,3 ]
Li, Li [4 ]
Wang, Zhen [5 ]
Chen, C. L. Philip [6 ,7 ]
机构
[1] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
[2] Binzhou Univ, Coll Sci, Binzhou 256600, Peoples R China
[3] Qilu Univ Technol, Sch Math & Stat, Jinan 250353, Peoples R China
[4] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Peoples R China
[5] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect, Xian 710072, Peoples R China
[6] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510641, Guangdong, Peoples R China
[7] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Games; Backstepping; Artificial neural networks; Nonlinear dynamical systems; Multi-agent systems; Mathematical models; Optimal control; Game-based backstepping; high-order multi-agent system; neural network (NN); reinforcement learning (RL); tracking game; SMOOTH TRANSITION; SWARM CONTROL; TRACKING; DYNAMICS;
D O I
10.1109/TNNLS.2022.3177461
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, the game-based backstepping control method is proposed for the high-order nonlinear multi-agent system with unknown dynamic and input saturation. Reinforcement learning (RL) is employed to get the saddle point solution of the tracking game between each agent and the reference signal for achieving robust control. Specifically, the approximate optimal solution of the established Hamilton-Jacobi-Isaacs (HJI) equation is obtained by policy iteration for each subsystem, and the single network adaptive critic (SNAC) architecture is used to reduce the computational burden. In addition, based on the separation operation of the error term from the derivative of the value function, we achieve the different proportions of the two agents in the game to realize the regulation of the final equilibrium point. Different from the general use of the neural network for system identification, the unknown nonlinear dynamic term is approximated based on the state difference obtained by the command filter. Furthermore, a sufficient condition is established to guarantee that the whole system and each subsystem included are uniformly ultimately bounded. Finally, simulation results are given to show the effectiveness of the proposed method.
引用
收藏
页码:817 / 830
页数:14
相关论文
共 50 条
  • [1] Game-based Optimized Backstepping Control for Strict-feedback Systems With Input Constraints
    Zhang, Liuliu
    Jing, Hailong
    Qian, Cheng
    Hua, Changchun
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 22 (08) : 2472 - 2482
  • [2] Reinforcement learning-based optimized backstepping control for strict-feedback nonlinear systems subject to external disturbances
    Qin, Yan
    Cao, Liang
    Lu, Qing
    Pan, Yingnan
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2023, 44 (05): : 2724 - 2743
  • [3] Nonsmooth Backstepping Design for a Class of Parametric Strict-Feedback Nonlinear Systems Based on Lipschitz Feedback
    Zheng, Kai
    Shen, Tielong
    He, Fenghua
    Yang, Ming
    2010 AMERICAN CONTROL CONFERENCE, 2010, : 6351 - 6356
  • [4] Optimized backstepping consensus control using adaptive observer-critic-actor reinforcement learning for strict-feedback multi-agent systems
    Zhu, Jiahao
    Wen, Guoxing
    Veluvolu, Kalyana C.
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (06):
  • [5] Leader-Following Consensus of Nonlinear Strict-Feedback Multi-agent Systems
    康剑灵
    於玲玲
    Journal of Donghua University(English Edition), 2019, 36 (01) : 1 - 7
  • [6] Optimized Backstepping Consensus Control Using Reinforcement Learning for a Class of Nonlinear Strict-Feedback-Dynamic Multi-Agent Systems
    Wen, Guoxing
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (03) : 1524 - 1536
  • [7] Adaptive Iterative Learning Backstepping Control for Nonlinear Strict-feedback Systems
    Chen, Jianyong
    PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 1054 - 1059
  • [8] Game-based coalescence in multi-agent systems
    Ma, Jingying
    Zheng, Yuanshi
    Zhou, Likai
    SYSTEMS & CONTROL LETTERS, 2021, 148
  • [9] Observer-Based Output Consensus Control Scheme for Strict-Feedback Nonlinear Multi-Agent Systems With Disturbances
    Zhu, Fanglai
    Zhao, Younan
    Fu, Yuhang
    Dinh, Thach Ngoc
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (03): : 2621 - 2631
  • [10] Adaptive containment control for nonlinear strict-feedback multi-agent systems with dynamic leaders
    Cui, Yang
    Liu, Xiaoping
    Deng, Xin
    Wang, Lidong
    INTERNATIONAL JOURNAL OF CONTROL, 2022, 95 (06) : 1658 - 1667