An Actor Critic Method for Free Terminal Time Optimal Control

被引:0
|
作者
Burton, Evan [1 ]
Nakamura-Zimmerer, Tenavi [1 ,2 ]
Gong, Qi [1 ]
Kang, Wei [1 ,3 ]
机构
[1] Univ Calif Santa Cruz, Santa Cruz, CA 95060 USA
[2] NASA Langley Res Ctr, Flight Dynam Branch, Hampton, VA 23666 USA
[3] Naval Postgrad Sch, Monterey, CA 93943 USA
来源
IFAC PAPERSONLINE | 2023年 / 56卷 / 01期
基金
美国国家科学基金会;
关键词
Iterative learning control; Non-smooth and discontinuous optimal control problems;
D O I
10.1016/j.ifacol.2023.02.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Optimal control problems with free terminal time present many challenges including nonsmooth and discontinuous control laws, irregular value functions, many local optima, and the curse of dimensionality. To overcome these issues, we propose an adaptation of the model-based actor-critic paradigm from the field of Reinforcement Learning via an exponential transformation to learn an approximate feedback control and value function pair. We demonstrate the algorithm's effectiveness on prototypical examples featuring each of the main pathological issues present in problems of this type. Copyright (c) 2023 The Authors. This is an open access article under the CC BY-NC-ND license (<THESTERM>https://creativecommons.org/licenses/by-ne-nd/4.0/</THESTERM>)
引用
收藏
页码:49 / 54
页数:6
相关论文
共 50 条
  • [11] Free Terminal Time Optimal Control Problem of an HIV Model Based on a Conjugate Gradient Method
    Taesoo Jang
    Hee-Dae Kwon
    Jeehyun Lee
    Bulletin of Mathematical Biology, 2011, 73 : 2408 - 2429
  • [12] Free Terminal Time Optimal Control Problem of an HIV Model Based on a Conjugate Gradient Method
    Jang, Taesoo
    Kwon, Hee-Dae
    Lee, Jeehyun
    BULLETIN OF MATHEMATICAL BIOLOGY, 2011, 73 (10) : 2408 - 2429
  • [13] Numerical Method for Solving Fractional Order Optimal Control Problems with Free and Non-Free Terminal Time
    Al-Shaher, Oday I.
    Mahmoudi, M.
    Mechee, Mohammed S.
    SYMMETRY-BASEL, 2023, 15 (03):
  • [14] Optimal Control of Affine Nonlinear Continuous-time Systems Using Online Actor-Critic Algorithm
    Chen Xue-song
    Yang Ming-sheng
    Liu Fu-chun
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 2891 - 2894
  • [15] An Online Actor/Critic Algorithm for Event-Triggered Optimal Control of Continuous-Time Nonlinear Systems
    Vamvoudakis, Kyriakos G.
    2014 AMERICAN CONTROL CONFERENCE (ACC), 2014, : 1 - 6
  • [16] Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    Vamvoudakis, Kyriakos G.
    Lewis, Frank L.
    AUTOMATICA, 2010, 46 (05) : 878 - 888
  • [17] Optimal Tracking Control for Robotic Manipulator using Actor-Critic Network
    Hu, Yong
    Cui, Lingguo
    Chai, Senchun
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 1556 - 1561
  • [18] Online Critic-Identifier-Actor Algorithm for Optimal Control of Nonlinear Systems
    Lin, Hanquan
    Wei, Qinglai
    Liu, Derong
    2015 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2015, : 399 - 405
  • [19] Actor-Critic-Based Optimal Adaptive Control Design for Morphing Aircraft
    Lee, Hanna
    Kim, Seong-hun
    Kim, Youdan
    IFAC PAPERSONLINE, 2020, 53 (02): : 14863 - 14868
  • [20] Online identifier-actor-critic algorithm for optimal control of nonlinear systems
    Lin, Hanquan
    Wei, Qinglai
    Liu, Derong
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2017, 38 (03): : 317 - 335