Discounted near-optimal regulation of constrained nonlinear systems via generalized value iteration

被引:9
|
作者
Wang, Ding [1 ,2 ,3 ]
Zhao, Mingming [1 ,2 ,3 ]
Ha, Mingming [4 ]
Qiao, Junfei [1 ,2 ,3 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing, Peoples R China
[3] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing, Peoples R China
[4] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
adaptive critic; control constraints; convergence analysis; discounted optimal control; generalized value iteration; OPTIMAL TRACKING CONTROL; CONTROL SCHEME;
D O I
10.1002/rnc.5729
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, a generalized value iteration algorithm is developed to address the discounted near-optimal control problem for discrete-time systems with control constraints. The initial cost function is permitted to be an arbitrary positive semi-definite function without being zero. First, a nonquadratic performance functional is utilized to overcome the challenge caused by saturating actuators. Then, the monotonicity and convergence of the iterative cost function sequence with the discount factor are analyzed. For facilitating the implementation of the iterative algorithm, two neural networks with Levenberg-Marquardt training algorithm are constructed to approximate the cost function and the control law. Furthermore, the initial control law is obtained by employing the fixed point iteration approach. Finally, two simulation examples are provided to validate the feasibility of the present strategy. It is emphasized that the established control laws are successfully constrained for randomly given initial state vectors.
引用
收藏
页码:8481 / 8503
页数:23
相关论文
共 50 条
  • [1] Intelligent Optimal Tracking With Application Verifications via Discounted Generalized Value Iteration
    Wang, Ding
    Zhao, Ming-Ming
    Ha, Ming-Ming
    Qiao, Jun-Fei
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (01): : 182 - 193
  • [2] Generalized value iteration for discounted optimal control with stability analysis
    Ha, Mingming
    Wang, Ding
    Liu, Derong
    [J]. SYSTEMS & CONTROL LETTERS, 2021, 147 (147)
  • [3] Discounted Near-Optimal Control of Affine Systems via a Progressive Cost Evolution Formulation
    Wang, Ding
    Wu, Junlong
    Hu, Lingzhi
    Qiao, Junfei
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (04) : 1535 - 1539
  • [4] Online Value Iteration for Intelligent Discounted Tracking Design of Constrained Systems
    Wang, Ding
    Wu, Junlong
    Ren, Jin
    Qiao, Junfei
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (09) : 3829 - 3833
  • [5] AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Discounted Markov Decision Processes with Near-Optimal Sample Complexity
    Zeng, Yibo
    Feng, Fei
    Yin, Wotao
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 713 - 722
  • [6] Robust Near-optimal Control for Constrained Nonlinear System via Integral Reinforcement Learning
    Qiu, Yu-Qing
    Li, Yan
    Wang, Zhong
    [J]. INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (04) : 1319 - 1330
  • [7] Robust Near-optimal Control for Constrained Nonlinear System via Integral Reinforcement Learning
    Yu-Qing Qiu
    Yan Li
    Zhong Wang
    [J]. International Journal of Control, Automation and Systems, 2023, 21 : 1319 - 1330
  • [8] Discounted near-optimal control of general continuous-action nonlinear systems using optimistic planning
    Busoniu, Lucian
    Pall, Elod
    Munos, Remi
    [J]. 2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 203 - 208
  • [9] Near-optimal PAC bounds for discounted MDPs
    Lattimore, Tor
    Hutter, Marcus
    [J]. THEORETICAL COMPUTER SCIENCE, 2014, 558 : 125 - 143
  • [10] Near-optimal cheap control of nonlinear systems
    Braslavsky, JH
    Seron, MM
    Kokotovic, PV
    [J]. NONLINEAR CONTROL SYSTEMS DESIGN 1998, VOLS 1& 2, 1998, : 107 - 112