Discounted near-optimal regulation of constrained nonlinear systems via generalized value iteration

被引：9

作者：

Wang, Ding ^{[1
,2
,3
]}

Zhao, Mingming ^{[1
,2
,3
]}

Ha, Mingming ^{[4
]}

Qiao, Junfei ^{[1
,2
,3
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

[2] Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing, Peoples R China

[3] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing, Peoples R China

[4] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL | 2021年 / 31卷 / 17期

基金：

北京市自然科学基金; 中国国家自然科学基金;

关键词：

adaptive critic; control constraints; convergence analysis; discounted optimal control; generalized value iteration; OPTIMAL TRACKING CONTROL; CONTROL SCHEME;

D O I：

10.1002/rnc.5729

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, a generalized value iteration algorithm is developed to address the discounted near-optimal control problem for discrete-time systems with control constraints. The initial cost function is permitted to be an arbitrary positive semi-definite function without being zero. First, a nonquadratic performance functional is utilized to overcome the challenge caused by saturating actuators. Then, the monotonicity and convergence of the iterative cost function sequence with the discount factor are analyzed. For facilitating the implementation of the iterative algorithm, two neural networks with Levenberg-Marquardt training algorithm are constructed to approximate the cost function and the control law. Furthermore, the initial control law is obtained by employing the fixed point iteration approach. Finally, two simulation examples are provided to validate the feasibility of the present strategy. It is emphasized that the established control laws are successfully constrained for randomly given initial state vectors.

引用

页码：8481 / 8503

页数：23

共 50 条

[1] Intelligent Optimal Tracking With Application Verifications via Discounted Generalized Value Iteration
Wang, Ding
Zhao, Ming-Ming
Ha, Ming-Ming
Qiao, Jun-Fei
[J]. Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (01): : 182 - 193
[2] Generalized value iteration for discounted optimal control with stability analysis
Ha, Mingming
Wang, Ding
Liu, Derong
[J]. SYSTEMS & CONTROL LETTERS, 2021, 147 (147)
[3] Discounted Near-Optimal Control of Affine Systems via a Progressive Cost Evolution Formulation
Wang, Ding
Wu, Junlong
Hu, Lingzhi
Qiao, Junfei
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (04) : 1535 - 1539
[4] Online Value Iteration for Intelligent Discounted Tracking Design of Constrained Systems
Wang, Ding
Wu, Junlong
Ren, Jin
Qiao, Junfei
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (09) : 3829 - 3833
[5] AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Discounted Markov Decision Processes with Near-Optimal Sample Complexity
Zeng, Yibo
Feng, Fei
Yin, Wotao
[J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 713 - 722
[6] Robust Near-optimal Control for Constrained Nonlinear System via Integral Reinforcement Learning
Qiu, Yu-Qing
Li, Yan
Wang, Zhong
[J]. INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (04) : 1319 - 1330
[7] Robust Near-optimal Control for Constrained Nonlinear System via Integral Reinforcement Learning
Yu-Qing Qiu
Yan Li
Zhong Wang
[J]. International Journal of Control, Automation and Systems, 2023, 21 : 1319 - 1330
[8] Discounted near-optimal control of general continuous-action nonlinear systems using optimistic planning
Busoniu, Lucian
Pall, Elod
Munos, Remi
[J]. 2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 203 - 208
[9] Near-optimal PAC bounds for discounted MDPs
Lattimore, Tor
Hutter, Marcus
[J]. THEORETICAL COMPUTER SCIENCE, 2014, 558 : 125 - 143
[10] Near-optimal cheap control of nonlinear systems
Braslavsky, JH
Seron, MM
Kokotovic, PV
[J]. NONLINEAR CONTROL SYSTEMS DESIGN 1998, VOLS 1& 2, 1998, : 107 - 112

← 1 2 3 4 5 →