Approximate Dynamic Programming Recurrence Relations for a Hybrid Optimal Control Problem

被引:0
|
作者
Lu, W. [1 ]
Ferrari, S. [1 ]
Fierro, R. [2 ]
Wettergren, T. A. [3 ]
机构
[1] Duke Univ, Dept Mech Engn & Mat Sci, LISC, Durham, NC 27706 USA
[2] Univ New Mexico, Dept Elect & Comp Engn, Multi Agent Robot Hybrid & Embedded Syst Lab, Albuquerque, NM 87131 USA
[3] Naval Undersea Warfare Ctr, Newport, RI USA
来源
关键词
Approximate dynamic programming (ADP); hybrid systems; optimal control; FRAMEWORK;
D O I
10.1117/12.919286
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a hybrid approximate dynamic programming (ADP) method for a hybrid dynamic system (HDS) optimal control problem, that occurs in many complex unmanned systems which are implemented via a hybrid architecture, regarding robot modes or the complex environment. The HDS considered in this paper is characterized by a well-known three-layer hybrid framework, which includes a discrete event controller layer, a discrete-continuous interface layer, and a continuous state layer. The hybrid optimal control problem (HOCP) is to find the optimal discrete event decisions and the optimal continuous controls subject to a deterministic minimization of a scalar function regarding the system state and control over time. Due to the uncertainty of environment and complexity of the HOCP, the cost-to-go cannot be evaluated before the HDS explores the entire system state space; as a result, the optimal control, neither continuous nor discrete, is not available ahead of time. Therefore, ADP is adopted to learn the optimal control while the HDS is exploring the environment, because of the online advantage of ADP method. Furthermore, ADP can break the curses of dimensionality which other optimizing methods, such as dynamic programming (DP) and Markov decision process (MDP), are facing due to the high dimensions of HOCP.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Approximate dynamic programming for an inventory problem: Empirical comparison
    Katanyukul, Tatpong
    Duff, William S.
    Chong, Edwin K. P.
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2011, 60 (04) : 719 - 743
  • [42] DYNAMIC PROGRAMMING PRINCIPLE FOR STOCHASTIC RECURSIVE OPTIMAL CONTROL PROBLEM WITH DELAYED SYSTEMS
    Chen, Li
    Wu, Zhen
    [J]. ESAIM-CONTROL OPTIMISATION AND CALCULUS OF VARIATIONS, 2012, 18 (04) : 1005 - 1026
  • [43] Optimal bounded control of random vibration and hybrid solutions to dynamic programming equations
    Dimentberg, M
    Iourtchenko, D
    Bratus, A
    [J]. CONTROL OF OSCILLATIONS AND CHAOS, VOLS 1-3, PROCEEDINGS, 2000, : 236 - 240
  • [44] Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints
    Duan, Jingliang
    Liu, Zhengyu
    Li, Shengbo Eben
    Sun, Qi
    Jia, Zhenzhong
    Cheng, Bo
    [J]. NEUROCOMPUTING, 2022, 484 : 128 - 141
  • [45] Dynamic programming based optimal control strategy of the hybrid vehicular power system
    Lian, Huijuan
    Zeng, Chunnian
    Cai, Zhenhua
    [J]. IECON 2017 - 43RD ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2017, : 7123 - 7127
  • [46] Scalable FPGA Implementation of Dynamic Programming for Optimal Control of Hybrid Electrical Vehicles
    Skarman, Frans
    Gustafsson, Oscar
    [J]. DESIGN AND ARCHITECTURES FOR SIGNAL AND IMAGE PROCESSING, DASIP 2024, 2024, 14622 : 27 - 39
  • [47] Online optimal control of nonlinear discrete-time systems using approximate dynamic programming
    Travis DIERKS
    Sarangapani JAGANNATHAN
    [J]. Control Theory and Technology, 2011, 9 (03) : 361 - 369
  • [48] Online optimal control of nonlinear discrete-time systems using approximate dynamic programming
    Dierks T.
    Jagannathan S.
    [J]. Journal of Control Theory and Applications, 2011, 9 (3): : 361 - 369
  • [49] Optimal train control by approximate dynamic programming: Comparison of three value function approximation methods
    Liu, Tong
    Xun, Jing
    Yin, Jiateng
    Xiao, Xiao
    [J]. 2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 2741 - 2746
  • [50] Novel iterative neural dynamic programming for data-based approximate optimal control design
    Mu, Chaoxu
    Wang, Ding
    He, Haibo
    [J]. AUTOMATICA, 2017, 81 : 240 - 252