Data-Driven Model-Free Tracking Reinforcement Learning Control with VRFT-based Adaptive Actor-Critic

被引:35
|
作者
Radac, Mircea-Bogdan [1 ]
Precup, Radu-Emil [1 ]
机构
[1] Politehn Univ Timisoara, Dept Automat & Appl Informat, Timisoara 300006, Romania
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 09期
关键词
adaptive actor-critic; model-free control; data-driven control; reinforcement learning; approximate dynamic programming; output reference model tracking; multi-input multi-output systems; vertical tank systems; Virtual Reference Feedback Tuning; CONJUGATE-GRADIENT ALGORITHM; TRAJECTORY TRACKING; NEURAL-NETWORKS; CONTROL DESIGN; SYSTEMS; FEEDBACK; ITERATION;
D O I
10.3390/app9091807
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper proposes a neural network (NN)-based control scheme in an Adaptive Actor-Critic (AAC) learning framework designed for output reference model tracking, as a representative deep-learning application. The control learning scheme is model-free with respect to the process model. AAC designs usually require an initial controller to start the learning process; however, systematic guidelines for choosing the initial controller are not offered in the literature, especially in a model-free manner. Virtual Reference Feedback Tuning (VRFT) is proposed for obtaining an initially stabilizing NN nonlinear state-feedback controller, designed from input-state-output data collected from the process in open-loop setting. The solution offers systematic design guidelines for initial controller design. The resulting suboptimal state-feedback controller is next improved under the AAC learning framework by online adaptation of a critic NN and a controller NN. The mixed VRFT-AAC approach is validated on a multi-input multi-output nonlinear constrained coupled vertical two-tank system. Discussions on the control system behavior are offered together with comparisons with similar approaches.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforcement Learning
    Ruofan Wu
    Zhikai Yao
    Jennie Si
    He(Helen) Huang
    IEEE/CAA Journal of Automatica Sinica, 2022, 9 (01) : 19 - 30
  • [32] Swarm Reinforcement Learning Method Based on an Actor-Critic Method
    Iima, Hitoshi
    Kuroe, Yasuaki
    SIMULATED EVOLUTION AND LEARNING, 2010, 6457 : 279 - 288
  • [33] Manipulator Motion Planning based on Actor-Critic Reinforcement Learning
    Li, Qiang
    Nie, Jun
    Wang, Haixia
    Lu, Xiao
    Song, Shibin
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 4248 - 4254
  • [34] Evaluating Correctness of Reinforcement Learning based on Actor-Critic Algorithm
    Kim, Youngjae
    Hussain, Manzoor
    Suh, Jae-Won
    Hong, Jang-Eui
    2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 320 - 325
  • [35] Data-driven model-free adaptive attitude control for morphing vehicles
    Che, Haohui
    Chen, Jun
    Wang, Yonghai
    Wang, Jianying
    Luo, Yunhao
    IET CONTROL THEORY AND APPLICATIONS, 2022, 16 (16): : 1696 - 1707
  • [36] Actor-Critic Traction Control Based on Reinforcement Learning with Open-Loop Training
    Drechsler, M. Funk
    Fiorentin, T. A.
    Goellinger, H.
    MODELLING AND SIMULATION IN ENGINEERING, 2021, 2021
  • [37] Actor-Critic Reinforcement Learning and Application in Developing Computer-Vision-Based Interface Tracking
    Dogru, Oguzhan
    Velswamy, Kirubakaran
    Huang, Biao
    ENGINEERING, 2021, 7 (09) : 1248 - 1261
  • [38] Model-Free Adaptive Iterative Learning Control Based on Data-Driven for Noncircular Turning Tool Feed System
    Zhao Yunjie
    Cao Rongmin
    Zhou Huixing
    THEORY, METHODOLOGY, TOOLS AND APPLICATIONS FOR MODELING AND SIMULATION OF COMPLEX SYSTEMS, PT II, 2016, 644 : 3 - 10
  • [39] USING ACTOR-CRITIC REINFORCEMENT LEARNING FOR CONTROL AND FLIGHT FORMATION OF QUADROTORS
    Torres, Edgar
    Xu, Lei
    Sardarmehni, Tohid
    PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5, 2022,
  • [40] Simultaneous Vibration Control and Energy Harvesting Using Actor-Critic Based Reinforcement Learning
    Loong, Cheng Ning
    Chang, C. C.
    Dimitrakoloulos, Elias G.
    ACTIVE AND PASSIVE SMART STRUCTURES AND INTEGRATED SYSTEMS XII, 2018, 10595