An Actor Critic Method for Free Terminal Time Optimal Control

被引：0

作者：

Burton, Evan ^{[1
]}

Nakamura-Zimmerer, Tenavi ^{[1
,2
]}

Gong, Qi ^{[1
]}

Kang, Wei ^{[1
,3
]}

机构：

[1] Univ Calif Santa Cruz, Santa Cruz, CA 95060 USA

[2] NASA Langley Res Ctr, Flight Dynam Branch, Hampton, VA 23666 USA

[3] Naval Postgrad Sch, Monterey, CA 93943 USA

来源：

IFAC PAPERSONLINE | 2023年 / 56卷 / 01期

基金：

美国国家科学基金会;

关键词：

Iterative learning control; Non-smooth and discontinuous optimal control problems;

D O I：

10.1016/j.ifacol.2023.02.009

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Optimal control problems with free terminal time present many challenges including nonsmooth and discontinuous control laws, irregular value functions, many local optima, and the curse of dimensionality. To overcome these issues, we propose an adaptation of the model-based actor-critic paradigm from the field of Reinforcement Learning via an exponential transformation to learn an approximate feedback control and value function pair. We demonstrate the algorithm's effectiveness on prototypical examples featuring each of the main pathological issues present in problems of this type. Copyright (c) 2023 The Authors. This is an open access article under the CC BY-NC-ND license (<THESTERM>https://creativecommons.org/licenses/by-ne-nd/4.0/</THESTERM>)

引用

页码：49 / 54

页数：6

共 50 条

[11] Free Terminal Time Optimal Control Problem of an HIV Model Based on a Conjugate Gradient Method
Taesoo Jang
Hee-Dae Kwon
Jeehyun Lee
Bulletin of Mathematical Biology, 2011, 73 : 2408 - 2429
[12] Free Terminal Time Optimal Control Problem of an HIV Model Based on a Conjugate Gradient Method
Jang, Taesoo
Kwon, Hee-Dae
Lee, Jeehyun
BULLETIN OF MATHEMATICAL BIOLOGY, 2011, 73 (10) : 2408 - 2429
[13] Numerical Method for Solving Fractional Order Optimal Control Problems with Free and Non-Free Terminal Time
Al-Shaher, Oday I.
Mahmoudi, M.
Mechee, Mohammed S.
SYMMETRY-BASEL, 2023, 15 (03):
[14] Optimal Control of Affine Nonlinear Continuous-time Systems Using Online Actor-Critic Algorithm
Chen Xue-song
Yang Ming-sheng
Liu Fu-chun
2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 2891 - 2894
[15] An Online Actor/Critic Algorithm for Event-Triggered Optimal Control of Continuous-Time Nonlinear Systems
Vamvoudakis, Kyriakos G.
2014 AMERICAN CONTROL CONFERENCE (ACC), 2014, : 1 - 6
[16] Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
Vamvoudakis, Kyriakos G.
Lewis, Frank L.
AUTOMATICA, 2010, 46 (05) : 878 - 888
[17] Optimal Tracking Control for Robotic Manipulator using Actor-Critic Network
Hu, Yong
Cui, Lingguo
Chai, Senchun
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 1556 - 1561
[18] Online Critic-Identifier-Actor Algorithm for Optimal Control of Nonlinear Systems
Lin, Hanquan
Wei, Qinglai
Liu, Derong
2015 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2015, : 399 - 405
[19] Actor-Critic-Based Optimal Adaptive Control Design for Morphing Aircraft
Lee, Hanna
Kim, Seong-hun
Kim, Youdan
IFAC PAPERSONLINE, 2020, 53 (02): : 14863 - 14868
[20] Online identifier-actor-critic algorithm for optimal control of nonlinear systems
Lin, Hanquan
Wei, Qinglai
Liu, Derong
OPTIMAL CONTROL APPLICATIONS & METHODS, 2017, 38 (03): : 317 - 335

← 1 2 3 4 5 →