Game-Based Backstepping Design for Strict-Feedback Nonlinear Multi-Agent Systems Based on Reinforcement Learning

被引:28
|
作者
Long, Jia [1 ]
Yu, Dengxiu [1 ]
Wen, Guoxing [2 ,3 ]
Li, Li [4 ]
Wang, Zhen [5 ]
Chen, C. L. Philip [6 ,7 ]
机构
[1] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
[2] Binzhou Univ, Coll Sci, Binzhou 256600, Peoples R China
[3] Qilu Univ Technol, Sch Math & Stat, Jinan 250353, Peoples R China
[4] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Peoples R China
[5] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect, Xian 710072, Peoples R China
[6] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510641, Guangdong, Peoples R China
[7] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Games; Backstepping; Artificial neural networks; Nonlinear dynamical systems; Multi-agent systems; Mathematical models; Optimal control; Game-based backstepping; high-order multi-agent system; neural network (NN); reinforcement learning (RL); tracking game; SMOOTH TRANSITION; SWARM CONTROL; TRACKING; DYNAMICS;
D O I
10.1109/TNNLS.2022.3177461
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, the game-based backstepping control method is proposed for the high-order nonlinear multi-agent system with unknown dynamic and input saturation. Reinforcement learning (RL) is employed to get the saddle point solution of the tracking game between each agent and the reference signal for achieving robust control. Specifically, the approximate optimal solution of the established Hamilton-Jacobi-Isaacs (HJI) equation is obtained by policy iteration for each subsystem, and the single network adaptive critic (SNAC) architecture is used to reduce the computational burden. In addition, based on the separation operation of the error term from the derivative of the value function, we achieve the different proportions of the two agents in the game to realize the regulation of the final equilibrium point. Different from the general use of the neural network for system identification, the unknown nonlinear dynamic term is approximated based on the state difference obtained by the command filter. Furthermore, a sufficient condition is established to guarantee that the whole system and each subsystem included are uniformly ultimately bounded. Finally, simulation results are given to show the effectiveness of the proposed method.
引用
收藏
页码:817 / 830
页数:14
相关论文
共 50 条
  • [41] Eavesdropping Game Based on Multi-Agent Deep Reinforcement Learning
    Guo, Delin
    Tang, Lan
    Yang, Lvxi
    Liang, Ying-Chang
    2022 IEEE 23RD INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATION (SPAWC), 2022,
  • [42] Eavesdropping Game Based on Multi-Agent Deep Reinforcement Learning
    Guo, Delin
    Tang, Lan
    Yang, Lvxi
    Liang, Ying-Chang
    IEEE Workshop on Signal Processing Advances in Wireless Communications, SPAWC, 2022, 2022-July
  • [43] Fuzzy Backstepping Control for Strict-Feedback Nonlinear Systems with Mismatched Uncertainties
    Xu Zibin
    Min Jianqing
    2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 5054 - 5057
  • [44] A Multi-agent Reinforcement Learning Algorithm Based on Stackelberg Game
    Cheng, Chi
    Zhu, Zhangqing
    Xin, Bo
    Chen, Chunlin
    2017 6TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS (DDCLS), 2017, : 727 - 732
  • [45] Adaptive Neural Control for Strict-Feedback Nonlinear Systems Without Backstepping
    Park, Jang-Hyun
    Kim, Seong-Hwan
    Moon, Chae-Joo
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (07): : 1204 - 1209
  • [46] Controllability of game-based multi-agent system
    Junhao GUO
    Zhijian JI
    Yungang LIU
    Science China(Information Sciences), 2023, 66 (12) : 107 - 120
  • [47] Controllability of game-based multi-agent system
    Guo, Junhao
    Ji, Zhijian
    Liu, Yungang
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (12)
  • [48] DSC-Backstepping based Robust Adaptive Fuzzy Control for a Class of Strict-Feedback Nonlinear Systems
    Li, Tieshan
    Feng, Gang
    Zou, Zaojian
    2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 1276 - +
  • [49] Observer-based adaptive fuzzy backstepping control of MIMO stochastic nonlinear strict-feedback systems
    Yue Li
    Shaocheng Tong
    Yongming Li
    Nonlinear Dynamics, 2012, 67 : 1579 - 1593
  • [50] Robust adaptive fuzzy tracking control for a class of strict-feedback nonlinear systems based on backstepping technique
    Wang M.
    Wang X.
    Chen B.
    Tong S.
    Journal of Control Theory and Applications, 2007, 5 (3): : 317 - 322