Adaptive optimal control of continuous-time nonlinear affine systems via hybrid iteration

被引:6
|
作者
Qasem, Omar [2 ,3 ]
Gao, Weinan [1 ]
Vamvoudakis, Kyriakos G. [4 ]
机构
[1] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Liaoning, Peoples R China
[2] Amer Int Univ, Sch Engn & Comp, Elect Engn Dept, Al Jahra, Kuwait
[3] Florida Inst Technol, Coll Engn & Sci, Dept Mech & Civil Engn, Melbourne, FL 32901 USA
[4] Georgia Inst Technol, Daniel Guggenheim Sch Aerosp Engn, Atlanta, GA 30332 USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Hybrid iteration; Adaptive dynamic programming; Adaptive optimal control; Reinforcement learning; Nonlinear systems; POLICY ITERATION; HJB SOLUTION;
D O I
10.1016/j.automatica.2023.111261
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a novel successive approximation framework, named hybrid iteration (HI), is proposed to fill up the performance gap between two well-known dynamic programming algorithms, namely policy iteration (PI) and value iteration (VI). Using HI, an approximated optimal control policy can be learned without prior knowledge of an initial admissible control policy required by PI. Additionally, the HI algorithm converges to the optimal solution much faster than VI, and thus requires tremendously less number of learning iterations and CPU-time, compared to VI. Initially, we develop a model-based HI algorithm, and then extend it to a data-driven HI algorithm which learns the optimal control policy without any information of the physics of the system. Simulation results demonstrate the efficacy of the proposed HI algorithm. (c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Adaptive Optimal Control of Continuous-Time Linear Systems via Hybrid Iteration
    Qasem, Omar
    Gao, Weinan
    Bian, Tao
    [J]. 2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [2] Optimal Control of Affine Nonlinear Continuous-time Systems
    Dierks, T.
    Jagannathan, S.
    [J]. 2010 AMERICAN CONTROL CONFERENCE, 2010, : 1568 - 1573
  • [3] Adaptive Optimal Control via Continuous-Time Q-Learning for Unknown Nonlinear Affine Systems
    Chen, Anthony Siming
    Herrmann, Guido
    [J]. 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 1007 - 1012
  • [4] Adaptive Optimal Control Algorithm for Continuous-Time Nonlinear Systems Based on Policy Iteration
    Vrabie, D.
    Lewis, F. L.
    [J]. 47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 73 - 79
  • [5] Value Iteration and Adaptive Optimal Control for Linear Continuous-time Systems
    Bian, Tao
    Jiang, Zhong-Ping
    [J]. PROCEEDINGS OF THE 2015 7TH IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) AND ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), 2015, : 53 - 58
  • [6] Reinforcement Learning and Adaptive Optimal Control for Continuous-Time Nonlinear Systems: A Value Iteration Approach
    Bian, Tao
    Jiang, Zhong-Ping
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) : 2781 - 2790
  • [7] Adaptive optimal control for a class of continuous-time affine nonlinear systems with unknown internal dynamics
    Derong Liu
    Xiong Yang
    Hongliang Li
    [J]. Neural Computing and Applications, 2013, 23 : 1843 - 1850
  • [8] Adaptive optimal control for a class of continuous-time affine nonlinear systems with unknown internal dynamics
    Liu, Derong
    Yang, Xiong
    Li, Hongliang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2013, 23 (7-8): : 1843 - 1850
  • [9] Adaptive optimal tracking control for nonlinear continuous-time systems with time delay using value iteration algorithm
    Shi, Jing
    Yue, Dong
    Xie, Xiangpeng
    [J]. NEUROCOMPUTING, 2020, 396 : 172 - 178
  • [10] Hybrid adaptive fuzzy control of nonlinear continuous-time systems
    Xie, Xuejun
    Zhang, Dalei
    [J]. Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2000, 20 (01): : 25 - 29