A data-driven online ADP control method for nonlinear system based on policy iteration and nonlinear MIMO decoupling ADRC

被引:6
|
作者
Huang, Zhijian [1 ,3 ]
Zhang, Cheng [1 ]
Zhang, Yanyan [2 ]
Zhang, Guichen [1 ]
机构
[1] Shanghai Maritime Univ, Lab Intelligent Control & Computat, Shanghai 201306, Peoples R China
[2] Tongji Univ, Peoples Hosp 10, Shanghai 200072, Peoples R China
[3] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
关键词
Data driven; Online; Approximate dynamic programming; Linear quadratic function; Least square method; Assumed neural network; ADRC; DISTURBANCE REJECTION CONTROL; TIME LINEAR-SYSTEMS; NEURAL-NETWORK; REINFORCEMENT; MANAGEMENT; STABILITY;
D O I
10.1016/j.neucom.2018.04.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The action-critic approximate dynamic programming (ADP) depends on its network structure and training algorithm. Since there are some inherent shortcomings of the neural network, this paper proposes a data-driven nonlinear online ADP control method without the neural network. Firstly, a multi-input multi-output (MIMO) policy iteration method is utilized for the proposed ADP. For its policy evaluation, the cost function is approximated with a quadratic function and least square method; for its policy improvement, the optimal control is approximated by solving the quadratic function linearly. In this way, an optimal control equation in form of variable coefficients and system states is deduced. Secondly, a nonlinear MIMO decoupling Active Disturbance Rejection Control method is used to obtain the variable coefficients in real time, which endows the ADP method with a nonlinear performance during its policy improvement. Once the variable coefficients are determined, the data-driven nonlinear ADP control method is deduced. Finally, the examples of an under-actuated nonlinear system and a real application are taken to demonstrate the optimal control effect. Compared with some published methods and their simulation, this method and its simulation excel in the method of policy improvement, nonlinear ability and control performance etc. Thus, the proposed method explores a new way to the ADP, and overcomes the shortcomings of the neural-network-based ADP. Since it enables to work like a PID controller and does not require data collecting, training or extra learning, this proposed ADP is a real data-driven non-linear online optimal control method. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:28 / 37
页数:10
相关论文
共 50 条
  • [1] Data-Driven Policy Iteration for Nonlinear Optimal Control Problems
    Possieri, Corrado
    Sassano, Mario
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7365 - 7376
  • [2] Optimization-based adaptive control for MIMO nonlinear systems: A data-driven method
    San, Yang
    Hui, Yu
    Cai, Kaiquan
    Meng, Deyuan
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (03) : 1522 - 1540
  • [3] A novel dynamic decoupling control method for MIMO nonlinear system
    Duan, C
    Xie, SS
    Wei, XK
    [J]. ISTM/2005: 6TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-9, CONFERENCE PROCEEDINGS, 2005, : 5188 - 5191
  • [4] Data-driven nonlinear MIMO modeling for turbofan aeroengine control system of autonomous aircraft
    Zhang, Xiaobo
    Zhu, Jianming
    Tang, Wei
    Yuan, Zhijie
    Wang, Zhanxue
    [J]. CONTROL ENGINEERING PRACTICE, 2023, 138
  • [5] Data-driven control of nonlinear systems: An online sequential approach
    Vu, Minh
    Huang, Yunshen
    Zeng, Shen
    [J]. Systems and Control Letters, 2024, 193
  • [6] Adaptive Sliding Mode Decoupling Control with Data-Driven Sliding Surface for Unknown MIMO Nonlinear Discrete Systems
    Yongpeng Weng
    Xianwen Gao
    [J]. Circuits, Systems, and Signal Processing, 2017, 36 : 969 - 997
  • [7] A data-driven indirect method for nonlinear optimal control
    Tang, Gao
    Hauser, Kris
    [J]. ASTRODYNAMICS, 2019, 3 (04) : 345 - 359
  • [8] A data-driven indirect method for nonlinear optimal control
    Gao Tang
    Kris Hauser
    [J]. Astrodynamics, 2019, 3 : 345 - 359
  • [9] Adaptive Sliding Mode Decoupling Control with Data-Driven Sliding Surface for Unknown MIMO Nonlinear Discrete Systems
    Weng, Yongpeng
    Gao, Xianwen
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (03) : 969 - 997
  • [10] A Data-driven Indirect Method for Nonlinear Optimal Control
    Tang, Gao
    Hauser, Kris
    [J]. 2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 4854 - 4861