Approximate Optimal Stabilization Control of Servo Mechanisms based on Reinforcement Learning Scheme

被引:16
|
作者
Lv, Yongfeng [1 ]
Ren, Xuemei [1 ]
Hu, Shuangyi [1 ]
Xu, Hao [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming; neural networks; optimal control; reinforcement learning; servomechanisms; ZERO-SUM GAMES; TRACKING CONTROL; ROBUST-CONTROL; MOTION CONTROL; TIME-SYSTEMS; PERFORMANCE; DYNAMICS;
D O I
10.1007/s12555-018-0551-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A reinforcement learning (RL) based adaptive dynamic programming (ADP) is developed to learn the approximate optimal stabilization input of the servo mechanisms, where the unknown system dynamics are approximated with a three-layer neural network (NN) identifier. First, the servo mechanism model is constructed and a three-layer NN identifier is used to approximate the unknown servo system. The NN weights of both the hidden layer and output layer are synchronously tuned with an adaptive gradient law. An RL-based critic three-layer NN is then used to learn the optimal cost function, where NN weights of the first layer are set as constants, NN weights of the second layer are updated by minimizing the squared Hamilton-Jacobi-Bellman (HJB) error. The optimal stabilization input of the servomechanism is obtained based on the three-layer NN identifier and RL-based critic NN scheme, which can stabilize the motor speed from its initial value to the given value. Moreover, the convergence analysis of the identifier and RL-based critic NN is proved, the stability of the cost function with the proposed optimal input is analyzed. Finally, a servo mechanism model and a complex system are provided to verify the correctness of the proposed methods.
引用
收藏
页码:2655 / 2665
页数:11
相关论文
共 50 条
  • [1] Approximate Optimal Stabilization Control of Servo Mechanisms based on Reinforcement Learning Scheme
    Yongfeng Lv
    Xuemei Ren
    Shuangyi Hu
    Hao Xu
    [J]. International Journal of Control, Automation and Systems, 2019, 17 : 2655 - 2665
  • [2] Data Driven Optimal Stabilization Control and Simulation Based on Reinforcement Learning
    Lu, Chaolun
    Li, Yongqiang
    Feng, Yuanjing
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (04): : 345 - 352
  • [3] Reinforcement Learning and Optimal Control of PMSM Speed Servo System
    Zhao, Jianguo
    Yang, Chunyu
    Gao, Weinan
    Zhou, Linna
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (08) : 8305 - 8313
  • [4] Efficient model-based reinforcement learning for approximate online optimal control
    Kamalapurkar, Rushikesh
    Rosenfeld, Joel A.
    Dixon, Warren E.
    [J]. AUTOMATICA, 2016, 74 : 247 - 258
  • [5] Reinforcement Learning based Approximate Optimal Control of Nonlinear Systems using Carleman Linearization
    Kar, Jishnudeep
    Bai, He
    Chakrabortty, Aranya
    [J]. 2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 3362 - 3367
  • [6] Model-Based Reinforcement Learning for Approximate Optimal Control with Temporal Logic Specifications
    Cohen, Max H.
    Belta, Cahn
    [J]. HSCC2021: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (PART OF CPS-IOT WEEK), 2021,
  • [7] Model-based reinforcement learning for approximate optimal regulation
    Kamalapurkar, Rushikesh
    Walters, Patrick
    Dixon, Warren E.
    [J]. AUTOMATICA, 2016, 64 : 94 - 104
  • [8] An Intelligent Control Method for Servo Motor Based on Reinforcement Learning
    Gao, Depeng
    Wang, Shuai
    Yang, Yuwei
    Zhang, Haifei
    Chen, Hao
    Mei, Xiangxiang
    Chen, Shuxi
    Qiu, Jianlin
    [J]. ALGORITHMS, 2024, 17 (01)
  • [9] Reinforcement Learning for Image-Based Visual Servo Control
    Dani, Ashwin P.
    Bhasin, Shubhendu
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4358 - 4363
  • [10] Optimal tracking control of mechatronic servo system using integral reinforcement learning
    Chen, Wei
    Hu, Jian
    Xu, Chenchen
    Zhou, Haibo
    Yao, Jianyong
    Nie, Weirong
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2023, 96 (12) : 3072 - 3082