Advanced Policy Learning Near-Optimal Regulation

被引:0
|
作者
Ding Wang [1 ,2 ]
Xiangnan Zhong [1 ,3 ]
机构
[1] IEEE
[2] the Faculty of Information Technology, Beijing University of Technology, and also with the Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology
[3] the Department of Electrical Engineering, University of North Texas
基金
中国国家自然科学基金;
关键词
Adaptive critic algorithm; learning control; neural approximation; nonaffine dynamics; optimal regulation;
D O I
暂无
中图分类号
O232 [最优控制];
学科分类号
070105 ; 0711 ; 071101 ; 0811 ; 081101 ;
摘要
Designing advanced design techniques for feedback stabilization and optimization of complex systems is important to the modern control field. In this paper, a near-optimal regulation method for general nonaffine dynamics is developed with the help of policy learning. For addressing the nonaffine nonlinearity, a pre-compensator is constructed, so that the augmented system can be formulated as affine-like form. Different cost functions are defined for original and transformed controlled plants and then their relationship is analyzed in detail. Additionally, an adaptive critic algorithm involving stability guarantee is employed to solve the augmented optimal control problem. At last, several case studies are conducted for verifying the stability, robustness, and optimality of a torsional pendulum plant with suitable cost.
引用
收藏
页码:743 / 749
页数:7
相关论文
共 50 条
  • [1] Advanced Policy Learning Near-Optimal Regulation
    Wang, Ding
    Zhong, Xiangnan
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (03) : 743 - 749
  • [2] Safe Learning for Near-Optimal Scheduling
    Busatto-Gaston, Damien
    Chakraborty, Debraj
    Guha, Shibashis
    Perez, Guillermo A.
    Raskin, Jean-Francois
    [J]. QUANTITATIVE EVALUATION OF SYSTEMS (QEST 2021), 2021, 12846 : 235 - 254
  • [3] Near-Optimal Collaborative Learning in Bandits
    Reda, Clemence
    Vakili, Sattar
    Kaufmann, Emilie
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [4] OPTIMAL AND NEAR-OPTIMAL REGULATION OF SPACECRAFT SPIN AXES
    YIN, M
    GRIMMELL, WC
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1968, AC13 (01) : 57 - &
  • [5] Near-optimal control policy for loss networks
    Ku, CY
    Yen, DC
    Chang, IC
    Huang, SM
    Jordan, S
    [J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2006, 34 (04): : 406 - 416
  • [6] Efficient Robot Skills Learning with Weighted Near-Optimal Experiences Policy Optimization
    Hou, Liwei
    Wang, Hengsheng
    Zou, Haoran
    Wang, Qun
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (03): : 1 - 20
  • [7] Learning Near-Optimal Cost-Sensitive Decision Policy for Object Detection
    Wu, Tianfu
    Zhu, Song-Chun
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 753 - 760
  • [8] Near-Optimal Provable Uniform Convergence in Offine Policy Evaluation for Reinforcement Learning
    Yin, Ming
    Bai, Yu
    Wang, Yu-Xiang
    [J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [9] Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs
    He, Jiafan
    Zhou, Dongruo
    Gu, Quanquan
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [10] Learning Near-Optimal Cost-Sensitive Decision Policy for Object Detection
    Wu, Tianfu
    Zhu, Song-Chun
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (05) : 1013 - 1027