An Actor Critic Method for Free Terminal Time Optimal Control

被引:0
|
作者
Burton, Evan [1 ]
Nakamura-Zimmerer, Tenavi [1 ,2 ]
Gong, Qi [1 ]
Kang, Wei [1 ,3 ]
机构
[1] Univ Calif Santa Cruz, Santa Cruz, CA 95060 USA
[2] NASA Langley Res Ctr, Flight Dynam Branch, Hampton, VA 23666 USA
[3] Naval Postgrad Sch, Monterey, CA 93943 USA
来源
IFAC PAPERSONLINE | 2023年 / 56卷 / 01期
基金
美国国家科学基金会;
关键词
Iterative learning control; Non-smooth and discontinuous optimal control problems;
D O I
10.1016/j.ifacol.2023.02.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Optimal control problems with free terminal time present many challenges including nonsmooth and discontinuous control laws, irregular value functions, many local optima, and the curse of dimensionality. To overcome these issues, we propose an adaptation of the model-based actor-critic paradigm from the field of Reinforcement Learning via an exponential transformation to learn an approximate feedback control and value function pair. We demonstrate the algorithm's effectiveness on prototypical examples featuring each of the main pathological issues present in problems of this type. Copyright (c) 2023 The Authors. This is an open access article under the CC BY-NC-ND license (<THESTERM>https://creativecommons.org/licenses/by-ne-nd/4.0/</THESTERM>)
引用
收藏
页码:49 / 54
页数:6
相关论文
共 50 条
  • [1] A NEW COMPUTATIONAL METHOD FOR A CLASS OF FREE TERMINAL TIME OPTIMAL CONTROL PROBLEMS
    Lin, Qun
    Loxton, Ryan
    Teo, Kok Lay
    Wu, Yong Hong
    PACIFIC JOURNAL OF OPTIMIZATION, 2011, 7 (01): : 63 - 81
  • [2] Actor-Critic Optimal Control for Semi-Markovian Jump Systems With Time Delay
    Zhang, Lulu
    Zhang, Huaguang
    Yue, Xiaohui
    Wang, Tianbiao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (04) : 2164 - 2168
  • [3] A computational method for solving time-delay optimal control problems with free terminal time
    Liu, Chongyang
    Loxton, Ryan
    Teo, Kok Lay
    SYSTEMS & CONTROL LETTERS, 2014, 72 : 53 - 60
  • [4] Relaxed Actor-Critic With Convergence Guarantees for Continuous-Time Optimal Control of Nonlinear Systems
    Duan, Jingliang
    Li, Jie
    Ge, Qiang
    Li, Shengbo Eben
    Bujarbaruah, Monimoy
    Ma, Fei
    Zhang, Dezhao
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (05): : 3299 - 3311
  • [5] A computational method for free terminal time optimal control problem governed by nonlinear time delayed systems
    Chai, Qinqin
    Wang, Wu
    APPLIED MATHEMATICAL MODELLING, 2018, 53 : 242 - 250
  • [6] Online Actor Critic Algorithm to Solve the Continuous-Time Infinite Horizon Optimal Control Problem
    Vamvoudakis, Kyriakos G.
    Lewis, Frank L.
    IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 58 - 65
  • [7] Actor-Critic or Critic-Actor? A Tale of Two Time Scales
    Bhatnagar, Shalabh
    Borkar, Vivek S.
    Guin, Soumyajit
    IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 2671 - 2676
  • [8] FRACTIONAL ORDER OPTIMAL CONTROL PROBLEMS WITH FREE TERMINAL TIME
    Pooseh, Shakoor
    Almeida, Ricardo
    Torres, Delfim F. M.
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2014, 10 (02) : 363 - 381
  • [9] An Exact Penalty Method for Free Terminal Time Optimal Control Problem with Continuous Inequality Constraints
    Canghua Jiang
    Qun Lin
    Changjun Yu
    Kok Lay Teo
    Guang-Ren Duan
    Journal of Optimization Theory and Applications, 2012, 154 : 30 - 53
  • [10] An Exact Penalty Method for Free Terminal Time Optimal Control Problem with Continuous Inequality Constraints
    Jiang, Canghua
    Lin, Qun
    Yu, Changjun
    Teo, Kok Lay
    Duan, Guang-Ren
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 154 (01) : 30 - 53