Optimal dynamic output feedback control of unknown linear continuous-time systems by adaptive dynamic programming☆

被引:0
|
作者
Xie, Kedi [1 ,2 ]
Zheng, Yiwei [1 ]
Jiang, Yi [3 ,4 ]
Lan, Weiyao [1 ]
Yu, Xiao [5 ,6 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen 361005, Peoples R China
[2] Beijing Inst Technol, Sch Automat, Beijing, Peoples R China
[3] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[4] City Univ Hong Kong, Ctr Complex & Complex Networks, Hong Kong, Peoples R China
[5] Xiamen Univ, Inst Artificial Intelligence, Xiamen 361005, Peoples R China
[6] Minist Educ China, Key Lab Multimedia Trusted Percept & Efficient Com, Xiamen 361005, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Adaptive dynamic programming; Dynamic output feedback control; Linear quadratic regulation; Value iteration;
D O I
10.1016/j.automatica.2024.111601
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present an approximate optimal dynamic output feedback control learning algorithm to solve the linear quadratic regulation problem for unknown linear continuous -time systems. First, a dynamic output feedback controller is designed by constructing the internal state. Then, an adaptive dynamic programming based learning algorithm is proposed to estimate the optimal feedback control gain by only accessing the input and output data. By adding a constructed virtual observer error into the iterative learning equation, the proposed learning algorithm with the new iterative learning equation is immune to the observer error. In addition, the value iteration based learning equation is established without storing a series of past data, which could lead to a reduction of demands on the usage of memory storage. Besides, the proposed algorithm eliminates the requirement of repeated finite window integrals, which may reduce the computational load. Moreover, the convergence analysis shows that the estimated control policy converges to the optimal control policy. Finally, a physical experiment on an unmanned quadrotor is given to illustrate the effectiveness of the proposed approach. (c) 2024 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Output feedback control of Markov jump linear systems in continuous-time
    de Farias, DP
    Geromel, JC
    do Val, JBR
    Costa, OLV
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2000, 45 (05) : 944 - 949
  • [42] Dynamic Output Feedback H2 Control for Continuous-time Polytopic LPV Systems
    Cai Guang-Bin
    Yin Bao-Juan
    Han Xiao-Jun
    Hu Chang-Hua
    He Hua-Feng
    [J]. 2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 3603 - 3608
  • [43] Dynamic output-feedback H control for continuous-time singular Markovian jump systems
    Park, Chan-eun
    Kwon, Nam Kyu
    Park, PooGyeon
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2018, 28 (11) : 3521 - 3531
  • [44] Optimal Adaptive Control of Nonlinear Continuous-time Systems in Strict Feedback Form with Unknown Internal Dynamics
    Zargarzadeh, H.
    Dierks, T.
    Jagannathan, S.
    [J]. 2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 4127 - 4132
  • [45] Adaptive Output Feedback Control with an Adaptive Predictive Feedforward Input for Continuous-Time Systems
    Mizumoto, Ikuro
    Makimoto, Yusuke
    Masuda, Shiro
    [J]. IFAC PAPERSONLINE, 2019, 52 (29): : 216 - 221
  • [46] Dynamic Output-feedback Controller Design for Continuous-time Linear Systems with Actuator and Sensor Quantization
    Ferrante, Francesco
    Gouaisbaut, Frederic
    Tarbouriech, Sophie
    [J]. 2015 EUROPEAN CONTROL CONFERENCE (ECC), 2015, : 1663 - 1668
  • [47] Output Feedback Tracking Control of A Class of Continuous Nonlinear Systems via Adaptive Dynamic Programming Approach
    Yang, Yang
    Yue, Dong
    Shi, Jing
    [J]. PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 1647 - 1652
  • [48] Optimal Output Regulation for General Linear Systems via Adaptive Dynamic Programming
    Wu, Yanzhi
    Liang, Qingpeng
    Hu, Jiangping
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (11) : 11916 - 11926
  • [49] Value Iteration and Adaptive Optimal Control for Linear Continuous-time Systems
    Bian, Tao
    Jiang, Zhong-Ping
    [J]. PROCEEDINGS OF THE 2015 7TH IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) AND ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), 2015, : 53 - 58
  • [50] THE LINEAR-QUADRATIC OPTIMAL REGULATOR FOR CONTINUOUS-TIME DESCRIPTOR SYSTEMS - A DYNAMIC-PROGRAMMING APPROACH
    XU, H
    MIZUKAMI, K
    [J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1994, 25 (11) : 1889 - 1898