Policy Iteration-based Indirect Adaptive Optimal Control for Completely Unknown Continuous-Time LTI Systems

被引:0
|
作者
Jha, Sumit Kumar [1 ]
Roy, Sayan Basu [1 ]
Bhasin, Shubhendu [1 ]
机构
[1] Indian Inst Technol Delhi, Dept Elect Engn, New Delhi 110016, India
关键词
Adaptive optimal control; system identification; ARE; policy iteration; LINEAR-SYSTEMS; NONLINEAR-SYSTEMS; STABILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel indirect adaptive optimal controller (AOC) for completely unknown continuous-time (CT) linear time invariant (LTI) systems using the policy iteration (PI) technique. The algorithm builds on the Kleinman's method of iteratively solving the algebraic Riccati equation (ARE). However, the actual system and control matrices information, required by the Kleinman's algorithm, is replaced by their CT online estimates using uniform sampling. A gradient-based online system identifier is developed using a low pass filter, which strategically eliminates the need for state derivative information, while the system identifier exponentially converges to the actual plant-parameter vector under the assumption of persistence of excitation (PE). The proposed online identifier based Kleinman's algorithm is shown to converge to the optimal control policy while preserving the stabilizability of the intermediate policies for the unknown CT LTI systems as validated through simulation studies on multi-input-multi-output (MIMO) LTI systems. The designed indirect AOC is argued to be computationally less intricate as compared to the past literature on direct AOC.
引用
收藏
页码:448 / 454
页数:7
相关论文
共 50 条
  • [1] Adaptive optimal control for continuous-time linear systems based on policy iteration
    Vrabie, D.
    Pastravanu, O.
    Abu-Khalaf, M.
    Lewis, F. L.
    [J]. AUTOMATICA, 2009, 45 (02) : 477 - 484
  • [2] Adaptive Optimal Control Algorithm for Continuous-Time Nonlinear Systems Based on Policy Iteration
    Vrabie, D.
    Lewis, F. L.
    [J]. 47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 73 - 79
  • [3] Online adaptive optimal control for continuous-time nonlinear systems with completely unknown dynamics
    Lv, Yongfeng
    Na, Jing
    Yang, Qinmin
    Wu, Xing
    Guo, Yu
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2016, 89 (01) : 99 - 112
  • [4] Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
    Jiang, Yu
    Jiang, Zhong-Ping
    [J]. AUTOMATICA, 2012, 48 (10) : 2699 - 2704
  • [5] Bias-policy iteration based optimal control for unknown continuous-time linear periodic systems
    Li, Xiang
    Jiang, Huaiyuan
    Zhou, Bin
    [J]. SYSTEMS & CONTROL LETTERS, 2024, 189
  • [6] Indirect adaptive fuzzy-regulated optimal control for unknown continuous-time nonlinear systems
    Zhang, Haiyun
    Meng, Deyuan
    Wang, Jin
    Lu, Guodong
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2021, 22 (02) : 155 - 169
  • [7] Indirect adaptive fuzzy-regulated optimal control for unknown continuous-time nonlinear systems
    Zhang, Haiyun
    Meng, Deyuan
    Wang, Jin
    Lu, Guodong
    [J]. Frontiers of Information Technology and Electronic Engineering, 2021, 22 (02): : 155 - 169
  • [8] Generalized Policy Iteration-based Reinforcement Learning Algorithm for Optimal Control of Unknown Discrete-time Systems
    Lin, Mingduo
    Zhao, Bo
    Liu, Derong
    Liu, Xi
    Luo, Fangchao
    [J]. PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3650 - 3655
  • [9] Value Iteration and Adaptive Optimal Control for Linear Continuous-time Systems
    Bian, Tao
    Jiang, Zhong-Ping
    [J]. PROCEEDINGS OF THE 2015 7TH IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) AND ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), 2015, : 53 - 58
  • [10] Homotopic policy iteration-based learning design for unknown linear continuous-time systemsx2729;
    Chen, Ci
    Lewis, Frank L.
    Li, Bo
    [J]. AUTOMATICA, 2022, 138