Optimized Backstepping Consensus Control Using Reinforcement Learning for a Class of Nonlinear Strict-Feedback-Dynamic Multi-Agent Systems

被引:65
|
作者
Wen, Guoxing [1 ,2 ]
Chen, C. L. Philip [3 ,4 ]
机构
[1] Binzhou Univ, Coll Sci, Binzhou 256600, Peoples R China
[2] Qilu Univ Technol, Sch Math & Stat, Jinan 250353, Peoples R China
[3] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510641, Peoples R China
[4] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimal control; Backstepping; Artificial neural networks; Performance analysis; Nonlinear dynamical systems; Consensus control; Mathematical model; Critic-actor architecture; high-order multi-agent system (MAS); neural network (NN); optimal control; reinforcement learning (RL); ADAPTIVE OPTIMAL-CONTROL; CONTAINMENT CONTROL; TRACKING CONTROL;
D O I
10.1109/TNNLS.2021.3105548
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, an optimized leader-following consensus control scheme is proposed for the nonlinear strict-feedback-dynamic multi-agent system by learning from the controlling idea of optimized backstepping technique, which designs the virtual and actual controls of backstepping to be the optimized solution of corresponding subsystems so that the entire backstepping control is optimized. Since this control needs to not only ensure the optimizing system performance but also synchronize the multiple system state variables, it is an interesting and challenging topic. In order to achieve this optimized control, the neural network approximation-based reinforcement learning (RL) is performed under critic-actor architecture. In most of the existing RL-based optimal controls, since both the critic and actor RL updating laws are derived from the negative gradient of square of the Hamilton-Jacobi-Bellman (HJB) equation's approximation, which contains multiple nonlinear terms, their algorithm are inevitably intricate. However, the proposed optimized control derives the RL updating laws from the negative gradient of a simple positive function, which is correlated with the HJB equation; hence, it can be significantly simple in the algorithm. Meanwhile, it can also release two general conditions, known dynamic and persistence excitation, which are required in most of the RL-based optimal controls. Therefore, the proposed optimized scheme can be a natural selection for the high-order nonlinear multi-agent control. Finally, the effectiveness is demonstrated by both theory and simulation.
引用
收藏
页码:1524 / 1536
页数:13
相关论文
共 50 条
  • [21] Adaptive Consensus via Dynamic Feedback Control for Lipschitz Nonlinear Multi-Agent Systems
    Li, Lin
    Wang, Heyang
    JOURNAL OF ROBOTICS NETWORKING AND ARTIFICIAL LIFE, 2016, 2 (04): : 226 - 229
  • [22] Leader-Following Consensus of Nonlinear Strict-Feedback Multi-agent Systems
    康剑灵
    於玲玲
    Journal of Donghua University(English Edition), 2019, 36 (01) : 1 - 7
  • [23] Consensus Algorithms for Second-Order Nonlinear Multi-Agent Systems using Backstepping Control
    Wang, Yinqiu
    Gao, Zhe
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 3505 - 3510
  • [24] Distributed adaptive iterative learning exact consensus for nonlinear strict-feedback multi-agent systems with unknown control directions
    Liang, M. D.
    Li, J. M.
    IRANIAN JOURNAL OF FUZZY SYSTEMS, 2023, 20 (04): : 57 - 74
  • [25] Optimized distributed formation control using identifier-critic-actor reinforcement learning for a class of stochastic nonlinear multi-agent systems☆
    Wen, Guoxing
    Niu, Ben
    ISA TRANSACTIONS, 2024, 155 : 1 - 10
  • [26] Reinforcement learning-based optimized backstepping control of nonlinear strict feedback system with unknown control gain function
    Zhou, Ranran
    Wen, Guoxing
    Li, Bin
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2022, 43 (05): : 1358 - 1378
  • [27] Distributed Consensus Control Using Neural Network for a Class of Nonlinear Multi-agent Systems
    Wen, Guo-Xing
    Chen, C. L. Philip
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 2591 - 2595
  • [28] Adaptive neural control for a class of non-strict feedback nonlinear multi-agent systems with input quantization
    Shang Yun
    Chen Bing
    Lin Chong
    Cheng Zunshui
    Xin Youming
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 19 - 24
  • [29] Adaptive Neural Network Optimal Backstepping Control of Strict Feedback Nonlinear Systems via Reinforcement Learning
    Zhong, Mei
    Cao, Jinde
    Liu, Heng
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 832 - 847
  • [30] Observer-Based Output Consensus Control Scheme for Strict-Feedback Nonlinear Multi-Agent Systems With Disturbances
    Zhu, Fanglai
    Zhao, Younan
    Fu, Yuhang
    Dinh, Thach Ngoc
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (03): : 2621 - 2631