Data-Driven Model-Free Tracking Reinforcement Learning Control with VRFT-based Adaptive Actor-Critic

被引：35

作者：

Radac, Mircea-Bogdan ^{[1
]}

Precup, Radu-Emil ^{[1
]}

机构：

[1] Politehn Univ Timisoara, Dept Automat & Appl Informat, Timisoara 300006, Romania

来源：

APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 09期

关键词：

adaptive actor-critic; model-free control; data-driven control; reinforcement learning; approximate dynamic programming; output reference model tracking; multi-input multi-output systems; vertical tank systems; Virtual Reference Feedback Tuning; CONJUGATE-GRADIENT ALGORITHM; TRAJECTORY TRACKING; NEURAL-NETWORKS; CONTROL DESIGN; SYSTEMS; FEEDBACK; ITERATION;

D O I：

10.3390/app9091807

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

This paper proposes a neural network (NN)-based control scheme in an Adaptive Actor-Critic (AAC) learning framework designed for output reference model tracking, as a representative deep-learning application. The control learning scheme is model-free with respect to the process model. AAC designs usually require an initial controller to start the learning process; however, systematic guidelines for choosing the initial controller are not offered in the literature, especially in a model-free manner. Virtual Reference Feedback Tuning (VRFT) is proposed for obtaining an initially stabilizing NN nonlinear state-feedback controller, designed from input-state-output data collected from the process in open-loop setting. The solution offers systematic design guidelines for initial controller design. The resulting suboptimal state-feedback controller is next improved under the AAC learning framework by online adaptation of a critic NN and a controller NN. The mixed VRFT-AAC approach is validated on a multi-input multi-output nonlinear constrained coupled vertical two-tank system. Discussions on the control system behavior are offered together with comparisons with similar approaches.

引用

页数：24

共 50 条

[31] Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforcement Learning
Ruofan Wu
Zhikai Yao
Jennie Si
He(Helen) Huang
IEEE/CAA Journal of Automatica Sinica, 2022, 9 (01) : 19 - 30
[32] Swarm Reinforcement Learning Method Based on an Actor-Critic Method
Iima, Hitoshi
Kuroe, Yasuaki
SIMULATED EVOLUTION AND LEARNING, 2010, 6457 : 279 - 288
[33] Manipulator Motion Planning based on Actor-Critic Reinforcement Learning
Li, Qiang
Nie, Jun
Wang, Haixia
Lu, Xiao
Song, Shibin
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 4248 - 4254
[34] Evaluating Correctness of Reinforcement Learning based on Actor-Critic Algorithm
Kim, Youngjae
Hussain, Manzoor
Suh, Jae-Won
Hong, Jang-Eui
2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 320 - 325
[35] Data-driven model-free adaptive attitude control for morphing vehicles
Che, Haohui
Chen, Jun
Wang, Yonghai
Wang, Jianying
Luo, Yunhao
IET CONTROL THEORY AND APPLICATIONS, 2022, 16 (16): : 1696 - 1707
[36] Actor-Critic Traction Control Based on Reinforcement Learning with Open-Loop Training
Drechsler, M. Funk
Fiorentin, T. A.
Goellinger, H.
MODELLING AND SIMULATION IN ENGINEERING, 2021, 2021
[37] Actor-Critic Reinforcement Learning and Application in Developing Computer-Vision-Based Interface Tracking
Dogru, Oguzhan
Velswamy, Kirubakaran
Huang, Biao
ENGINEERING, 2021, 7 (09) : 1248 - 1261
[38] Model-Free Adaptive Iterative Learning Control Based on Data-Driven for Noncircular Turning Tool Feed System
Zhao Yunjie
Cao Rongmin
Zhou Huixing
THEORY, METHODOLOGY, TOOLS AND APPLICATIONS FOR MODELING AND SIMULATION OF COMPLEX SYSTEMS, PT II, 2016, 644 : 3 - 10
[39] USING ACTOR-CRITIC REINFORCEMENT LEARNING FOR CONTROL AND FLIGHT FORMATION OF QUADROTORS
Torres, Edgar
Xu, Lei
Sardarmehni, Tohid
PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5, 2022,
[40] Simultaneous Vibration Control and Energy Harvesting Using Actor-Critic Based Reinforcement Learning
Loong, Cheng Ning
Chang, C. C.
Dimitrakoloulos, Elias G.
ACTIVE AND PASSIVE SMART STRUCTURES AND INTEGRATED SYSTEMS XII, 2018, 10595

← 1 2 3 4 5 →