Integrated adaptive dynamic programming for data -driven optimal controller design

被引:7
|
作者
Li, Guoqiang [1 ]
Goerges, Daniel [1 ]
Mu, Chaoxu [2 ]
机构
[1] Univ Kaiserslautern, Dept Elect & Comp Engn, Erwin Schrodinger Str 12, D-67663 Kaiserslautern, Germany
[2] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
关键词
TIME NONLINEAR-SYSTEMS; TRACKING CONTROL; SCHEME;
D O I
10.1016/j.neucom.2020.04.095
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper a novel integrated adaptive dynamic programming method with an advantage function is developed to solve model-free optimal control problems and improve the control performance. The advantage function is utilized to evaluate the cost resulting from the action (control variables) which does not follow the optimal control policy. The Q function in Q-learning can thus be built from a value function and the advantage function. The control policy is then improved through minimizing the Q function. To employ the proposed algorithm, an integrated multi-layer neural network (INN) is designed for the value function and the control variables. Only one single neural network requires adaption. This avoids the iterative learning of two separate networks in the heuristic dynamic programming-based methods. Simulation for linear and non-linear optimal control problems is studied. Comparing to the optimal solutions resulting from the linear quadratic regulator and dynamic programming (DP), the proposed INN design can lead to closer control performance than the ones with action dependent heuristic dynamic programming (ADHDP). Furthermore INN is applied to optimize the energy management strategy of hybrid electric vehicles for fuel economy. The fuel consumption based on INN is lower than the one from ADHDP and much closer to the optimal results by DP. The result indicates the near fuel-optimality and an effective practical application. © 2020 Elsevier B.V.
引用
收藏
页码:143 / 152
页数:10
相关论文
共 50 条
  • [1] Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design
    Bian, Tao
    Jiang, Zhong-Ping
    [J]. AUTOMATICA, 2016, 71 : 348 - 360
  • [2] Distributed adaptive dynamic programming for data-driven optimal control
    Tang, Wentao
    Daoutidis, Prodromos
    [J]. SYSTEMS & CONTROL LETTERS, 2018, 120 : 36 - 43
  • [3] Optimal preview controller design for multirate systems based on adaptive dynamic programming
    Di, Wang
    Ye, Jiayu
    Gao Suixiang
    Yang, Siliang
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 1608 - 1613
  • [4] Adaptive Dynamic Programming and Data-Driven Cooperative Optimal Output Regulation with Adaptive Observers
    Qasem, Omar
    Jebari, Khalid
    Gao, Weinan
    [J]. 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 2538 - 2543
  • [5] Advanced controller design for AUV based on adaptive dynamic programming
    Chen, Tim
    Khurram, Safiullahand
    Zoungrana, Joelli
    Pandey, Lallit
    Chen, J. C. Y.
    [J]. ADVANCES IN COMPUTATIONAL DESIGN, 2020, 5 (03): : 233 - 260
  • [6] Design and implementation of an optimal switching controller for uninterruptible power supply inverters using adaptive dynamic programming
    Gogani Khiabani, Ataollah
    Heydari, Ali
    [J]. IET POWER ELECTRONICS, 2019, 12 (12) : 3068 - 3076
  • [7] Dynamic programming strategy in optimal controller design for a wind turbine system
    Mitaw, Abibual Abate
    Kassie, Abrham Tadesse
    Negash, Dereje Shiferaw
    [J]. COGENT ENGINEERING, 2024, 11 (01):
  • [8] Robust Adaptive Dynamic Programming for Optimal Nonlinear Control Design
    Jiang, Yu
    Jiang, Zhong-Ping
    [J]. 2013 9TH ASIAN CONTROL CONFERENCE (ASCC), 2013,
  • [9] Adaptive Optimal Observer Design via Approximate Dynamic Programming
    Na, Jing
    Herrmann, Guido
    Vamvoudakis, Kyriakos G.
    [J]. 2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 3288 - 3293
  • [10] An integrated data-driven Markov parameters sequence identification and adaptive dynamic programming method to design fault-tolerant optimal tracking control for completely unknown model systems
    Han, Kezhen
    Feng, Jian
    Yao, Yu
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2017, 354 (13): : 5280 - 5301