Approximate Optimal Stabilization Control of Servo Mechanisms based on Reinforcement Learning Scheme

被引:16
|
作者
Lv, Yongfeng [1 ]
Ren, Xuemei [1 ]
Hu, Shuangyi [1 ]
Xu, Hao [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming; neural networks; optimal control; reinforcement learning; servomechanisms; ZERO-SUM GAMES; TRACKING CONTROL; ROBUST-CONTROL; MOTION CONTROL; TIME-SYSTEMS; PERFORMANCE; DYNAMICS;
D O I
10.1007/s12555-018-0551-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A reinforcement learning (RL) based adaptive dynamic programming (ADP) is developed to learn the approximate optimal stabilization input of the servo mechanisms, where the unknown system dynamics are approximated with a three-layer neural network (NN) identifier. First, the servo mechanism model is constructed and a three-layer NN identifier is used to approximate the unknown servo system. The NN weights of both the hidden layer and output layer are synchronously tuned with an adaptive gradient law. An RL-based critic three-layer NN is then used to learn the optimal cost function, where NN weights of the first layer are set as constants, NN weights of the second layer are updated by minimizing the squared Hamilton-Jacobi-Bellman (HJB) error. The optimal stabilization input of the servomechanism is obtained based on the three-layer NN identifier and RL-based critic NN scheme, which can stabilize the motor speed from its initial value to the given value. Moreover, the convergence analysis of the identifier and RL-based critic NN is proved, the stability of the cost function with the proposed optimal input is analyzed. Finally, a servo mechanism model and a complex system are provided to verify the correctness of the proposed methods.
引用
收藏
页码:2655 / 2665
页数:11
相关论文
共 50 条
  • [21] Adaptive Optimal Tracking Control of Servo Mechanisms via Generalized Policy Learning
    Zhao, Jun
    Lv, Yongfeng
    Zhao, Ziliang
    Wang, Zhangu
    [J]. IEEE Transactions on Instrumentation and Measurement, 2024, 73
  • [22] Data-Based Reinforcement Learning Approximate Optimal Control for an Uncertain Nonlinear System with Partial Loss of Control Effectiveness
    Deptula, Patryk
    Bell, Zachary, I
    Doucette, Emily A.
    Curtis, J. Willard
    Dixon, Warren E.
    [J]. 2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 2521 - 2526
  • [23] Model-Based Reinforcement Learning Control of Electrohydraulic Position Servo Systems
    Yao, Zhikai
    Liang, Xianglong
    Jiang, Guo-Ping
    Yao, Jianyong
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2023, 28 (03) : 1446 - 1455
  • [24] Safe Deep Reinforcement Learning-Based Constrained Optimal Control Scheme for HEV Energy Management
    Liu, Zemin Eitan
    Zhou, Quan
    Li, Yanfei
    Shuai, Shijin
    Xu, Hongming
    [J]. IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2023, 9 (03): : 4278 - 4293
  • [25] Safe deep reinforcement learning-based constrained optimal control scheme for active distribution networks
    Kou, Peng
    Liang, Deliang
    Wang, Chen
    Wu, Zihao
    Gao, Lin
    [J]. APPLIED ENERGY, 2020, 264
  • [26] Dual Control for Approximate Bayesian Reinforcement Learning
    Klenske, Edgar D.
    Hennig, Philipp
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [27] Reinforcement Learning Based Interference Control Scheme in Heterogeneous Networks
    Lee, Yunseong
    Park, Laihyuk
    Noh, Wonjong
    Cho, Sungrae
    [J]. 2020 34TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2020), 2020, : 83 - 85
  • [28] A Learning Optimal Control Scheme for Robust Stabilization of a Class of Uncertain Nonlinear Systems
    Wang Ding
    Liu Derong
    Li Hongliang
    Yang Xiong
    [J]. 2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 7834 - 7839
  • [29] Inverse Reinforcement Learning in Tracking Control Based on Inverse Optimal Control
    Xue, Wenqian
    Kolaric, Patrik
    Fan, Jialu
    Lian, Bosen
    Chai, Tianyou
    Lewis, Frank L.
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (10) : 10570 - 10581
  • [30] Stabilization and Analytic Approximate Solutions of an Optimal Control Problem
    Pop, Camelia
    Petrisor, Camelia
    Ene, Remus-Daniel
    [J]. OPEN PHYSICS, 2018, 16 (01): : 476 - 487