Approximate Dynamic Programming Recurrence Relations for a Hybrid Optimal Control Problem

被引:0
|
作者
Lu, W. [1 ]
Ferrari, S. [1 ]
Fierro, R. [2 ]
Wettergren, T. A. [3 ]
机构
[1] Duke Univ, Dept Mech Engn & Mat Sci, LISC, Durham, NC 27706 USA
[2] Univ New Mexico, Dept Elect & Comp Engn, Multi Agent Robot Hybrid & Embedded Syst Lab, Albuquerque, NM 87131 USA
[3] Naval Undersea Warfare Ctr, Newport, RI USA
来源
关键词
Approximate dynamic programming (ADP); hybrid systems; optimal control; FRAMEWORK;
D O I
10.1117/12.919286
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a hybrid approximate dynamic programming (ADP) method for a hybrid dynamic system (HDS) optimal control problem, that occurs in many complex unmanned systems which are implemented via a hybrid architecture, regarding robot modes or the complex environment. The HDS considered in this paper is characterized by a well-known three-layer hybrid framework, which includes a discrete event controller layer, a discrete-continuous interface layer, and a continuous state layer. The hybrid optimal control problem (HOCP) is to find the optimal discrete event decisions and the optimal continuous controls subject to a deterministic minimization of a scalar function regarding the system state and control over time. Due to the uncertainty of environment and complexity of the HOCP, the cost-to-go cannot be evaluated before the HDS explores the entire system state space; as a result, the optimal control, neither continuous nor discrete, is not available ahead of time. Therefore, ADP is adopted to learn the optimal control while the HDS is exploring the environment, because of the online advantage of ADP method. Furthermore, ADP can break the curses of dimensionality which other optimizing methods, such as dynamic programming (DP) and Markov decision process (MDP), are facing due to the high dimensions of HOCP.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] OPEN-LOOP OPTIMAL CONTROL FOR TRACKING A REFERENCE SIGNAL WITH APPROXIMATE DYNAMIC PROGRAMMING
    Diaz, Jorge A.
    Xu, Lei
    Sardarmehni, Tohid
    [J]. PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5, 2022,
  • [32] Hybrid control and dynamic programming
    Bensoussan, A
    Menaldi, JL
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS, 1997, 3 (04): : 395 - 442
  • [33] Approximate Optimal tracking Control for Nonlinear Discrete-time Switched Systems via Approximate Dynamic Programming
    Qin, Chunbin
    Huang, Yizhe
    Yang, Yabin
    Zhang, Jishi
    Liu, Xianxing
    [J]. PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 1456 - 1461
  • [34] Approximate dynamic programming approach for process control
    Lee, Jay H.
    Wong, Weechin
    [J]. JOURNAL OF PROCESS CONTROL, 2010, 20 (09) : 1038 - 1048
  • [35] Approximate dynamic programming for ship course control
    Bai, Xuerui
    Yi, Jianqiang
    Zhao, Dongbin
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 349 - +
  • [36] Approximate Dynamic Programming for Output Feedback Control
    Jiang Yu
    Jiang Zhong-Ping
    [J]. PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 5815 - 5820
  • [37] Approximate dynamic programming approach for process control
    Lee, Jay H.
    [J]. INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 459 - 464
  • [38] Approximate dynamic programming for the military inventory routing problem
    Rebekah S. McKenna
    Matthew J. Robbins
    Brian J. Lunday
    Ian M. McCormack
    [J]. Annals of Operations Research, 2020, 288 : 391 - 416
  • [39] Approximate dynamic programming for the military inventory routing problem
    McKenna, Rebekah S.
    Robbins, Matthew J.
    Lunday, Brian J.
    McCormack, Ian M.
    [J]. ANNALS OF OPERATIONS RESEARCH, 2020, 288 (01) : 391 - 416
  • [40] An approximate dynamic programming approach for a product distribution problem
    Topaloglu, H
    [J]. IIE TRANSACTIONS, 2005, 37 (08) : 697 - 710