Advanced Policy Learning Near-Optimal Regulation

被引:0
|
作者
Ding Wang [1 ,2 ]
Xiangnan Zhong [1 ,3 ]
机构
[1] IEEE
[2] the Faculty of Information Technology, Beijing University of Technology, and also with the Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology
[3] the Department of Electrical Engineering, University of North Texas
基金
中国国家自然科学基金;
关键词
Adaptive critic algorithm; learning control; neural approximation; nonaffine dynamics; optimal regulation;
D O I
暂无
中图分类号
O232 [最优控制];
学科分类号
070105 ; 0711 ; 071101 ; 0811 ; 081101 ;
摘要
Designing advanced design techniques for feedback stabilization and optimization of complex systems is important to the modern control field. In this paper, a near-optimal regulation method for general nonaffine dynamics is developed with the help of policy learning. For addressing the nonaffine nonlinearity, a pre-compensator is constructed, so that the augmented system can be formulated as affine-like form. Different cost functions are defined for original and transformed controlled plants and then their relationship is analyzed in detail. Additionally, an adaptive critic algorithm involving stability guarantee is employed to solve the augmented optimal control problem. At last, several case studies are conducted for verifying the stability, robustness, and optimality of a torsional pendulum plant with suitable cost.
引用
收藏
页码:743 / 749
页数:7
相关论文
共 50 条
  • [21] Near-Optimal Reinforcement Learning with Self-Play
    Bai, Yu
    Jin, Chi
    Yu, Tiancheng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [22] Near-Optimal No-Regret Learning in General Games
    Daskalakis, Constantinos
    Fishelson, Maxwell
    Golowich, Noah
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [23] Learning to search efficiently for causally near-optimal treatments
    Hakansson, Samuel
    Lindblom, Viktor
    Gottesman, Omer
    Johansson, Fredrik D.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [24] Evaluating near-optimal scenarios with EnergyPLAN to support policy makers
    Prina, Matteo Giacomo
    Johannsen, Rasmus Magni
    Sparber, Wolfram
    Ostergaard, Poul Alberg
    [J]. SMART ENERGY, 2023, 10
  • [25] The dynamics and near-optimal controls of a dengue model with threshold policy
    You, Wei
    Meyer-Baese, Anke
    Xu, Xinzhong
    Zhang, Qimin
    [J]. MATHEMATICAL METHODS IN THE APPLIED SCIENCES, 2024,
  • [26] Near-optimal blacklisting
    Dimitrakakis, Christos
    Mitrokotsa, Aikaterini
    [J]. COMPUTERS & SECURITY, 2017, 64 : 110 - 121
  • [27] Near-Optimal Rapid MPC Using Neural Networks: A Primal-Dual Policy Learning Framework
    Zhang, Xiaojing
    Bujarbaruah, Monimoy
    Borrelli, Francesco
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2021, 29 (05) : 2102 - 2114
  • [28] Comparative Synthesis: Learning Near-Optimal Network Designs by Query
    Wang, Yanjun
    Li, Zixuan
    Jiang, Chuan
    Qiu, Xiaokang
    Rao, Sanjay
    [J]. PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2023, 7 (POPL): : 91 - 120
  • [29] LEARNING A NEAR-OPTIMAL ESTIMATOR FOR SURFACE SHAPE FROM SHADING
    KNILL, DC
    KERSTEN, D
    [J]. COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1990, 50 (01): : 75 - 100
  • [30] Selecting Near-Optimal Approximate State Representations in Reinforcement Learning
    Ortner, Ronald
    Maillard, Odalric-Ambrym
    Ryabko, Daniil
    [J]. Algorithmic Learning Theory (ALT 2014), 2014, 8776 : 140 - 154