Approximate Dynamic Programming Recurrence Relations for a Hybrid Optimal Control Problem

被引:0
|
作者
Lu, W. [1 ]
Ferrari, S. [1 ]
Fierro, R. [2 ]
Wettergren, T. A. [3 ]
机构
[1] Duke Univ, Dept Mech Engn & Mat Sci, LISC, Durham, NC 27706 USA
[2] Univ New Mexico, Dept Elect & Comp Engn, Multi Agent Robot Hybrid & Embedded Syst Lab, Albuquerque, NM 87131 USA
[3] Naval Undersea Warfare Ctr, Newport, RI USA
来源
关键词
Approximate dynamic programming (ADP); hybrid systems; optimal control; FRAMEWORK;
D O I
10.1117/12.919286
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a hybrid approximate dynamic programming (ADP) method for a hybrid dynamic system (HDS) optimal control problem, that occurs in many complex unmanned systems which are implemented via a hybrid architecture, regarding robot modes or the complex environment. The HDS considered in this paper is characterized by a well-known three-layer hybrid framework, which includes a discrete event controller layer, a discrete-continuous interface layer, and a continuous state layer. The hybrid optimal control problem (HOCP) is to find the optimal discrete event decisions and the optimal continuous controls subject to a deterministic minimization of a scalar function regarding the system state and control over time. Due to the uncertainty of environment and complexity of the HOCP, the cost-to-go cannot be evaluated before the HDS explores the entire system state space; as a result, the optimal control, neither continuous nor discrete, is not available ahead of time. Therefore, ADP is adopted to learn the optimal control while the HDS is exploring the environment, because of the online advantage of ADP method. Furthermore, ADP can break the curses of dimensionality which other optimizing methods, such as dynamic programming (DP) and Markov decision process (MDP), are facing due to the high dimensions of HOCP.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Hybrid Approximate Dynamic Programming Approach for Dynamic Optimal Energy Flow in the Integrated Gas and Power Systems
    Shuai, Hang
    Ai, Xiaomeng
    Wen, Jinyu
    Fang, Jiakun
    Chen, Zhe
    He, Haibo
    [J]. 2017 IEEE CONFERENCE ON ENERGY INTERNET AND ENERGY SYSTEM INTEGRATION (EI2), 2017,
  • [22] Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming
    Lock, Jonathan
    McKelvey, Tomas
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2022, 95 (10) : 2854 - 2864
  • [23] An Approximate Dynamic Programming Approach to the Dynamic Traveling Repairperson Problem
    Shin, Hyung Sik
    Lall, Sanjay
    [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 2286 - 2291
  • [24] Approximate linear programming for a queueing control problem
    Samiedaluie, Saied
    Zhang, Dan
    Zhang, Rui
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2024, 169
  • [25] Approximate Dynamic Programming, Local or Global Optimal Solution?
    Heydari, Ali
    Balakrishnan, S. N.
    [J]. 2014 AMERICAN CONTROL CONFERENCE (ACC), 2014, : 1237 - 1242
  • [26] On-policy Approximate Dynamic Programming for Optimal Control of non-linear systems
    Shalini, K.
    Vrushabh, D.
    Sonam, K.
    [J]. 2020 7TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'20), VOL 1, 2020, : 1058 - 1062
  • [27] ε-Optimal Value and Approximate Multidimensional Dual Dynamic Programming
    Nowakowski, A.
    [J]. ASIAN JOURNAL OF CONTROL, 2013, 15 (02) : 444 - 452
  • [28] Automata Theory Meets Approximate Dynamic Programming: Optimal Control with Temporal Logic Constraints
    Papusha, Ivan
    Fu, Jie
    Topcu, Ufuk
    Murray, Richard M.
    [J]. 2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 434 - 440
  • [29] An approximate dynamic programming method for the optimal control of Alkai-Surfactant-Polymer flooding
    Ge, Yulei
    Li, Shurong
    Chan, Peng
    [J]. JOURNAL OF PROCESS CONTROL, 2018, 64 : 15 - 26
  • [30] Nonlinear Noncausal Optimal Control of Wave Energy Converters Via Approximate Dynamic Programming
    Zhan, Siyuan
    Na, Jing
    Li, Guang
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (11) : 6070 - 6079