Data-Driven Performance-Prescribed Reinforcement Learning Control of an Unmanned Surface Vehicle

被引：135

作者：

Wang, Ning ^{[1
,2
]}

Gao, Ying ^{[1
]}

Zhang, Xuefeng ^{[1
]}

机构：

[1] Dalian Maritime Univ, Sch Marine Elect Engn, Dalian 116026, Peoples R China

[2] Harbin Engn Univ, Coll Shipbldg Engn, Harbin 150001, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2021年 / 32卷 / 12期

关键词：

Optimal control; Vehicle dynamics; System dynamics; Field-flow fractionation; Transient analysis; Reinforcement learning; Steady-state; Data-driven control; optimal control; performance-prescribed control; reinforcement learning control; unmanned surface vehicle (USV); ADAPTIVE-CONTROL; NONLINEAR-SYSTEMS; DESIGN; ITERATION; TRACKING;

D O I：

10.1109/TNNLS.2021.3056444

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An unmanned surface vehicle (USV) under complicated marine environments can hardly be modeled well such that model-based optimal control approaches become infeasible. In this article, a self-learning-based model-free solution only using input-output signals of the USV is innovatively provided. To this end, a data-driven performance-prescribed reinforcement learning control (DPRLC) scheme is created to pursue control optimality and prescribed tracking accuracy simultaneously. By devising state transformation with prescribed performance, constrained tracking errors are substantially converted into constraint-free stabilization of tracking errors with unknown dynamics. Reinforcement learning paradigm using neural network-based actor-critic learning framework is further deployed to directly optimize controller synthesis deduced from the Bellman error formulation such that transformed tracking errors evolve a data-driven optimal controller. Theoretical analysis eventually ensures that the entire DPRLC scheme can guarantee prescribed tracking accuracy, subject to optimal cost. Both simulations and virtual-reality experiments demonstrate the remarkable effectiveness and superiority of the proposed DPRLC scheme.

引用

页码：5456 / 5467

页数：12

共 50 条

[41] Data-driven sliding mode control of shape memory alloy actuators with prescribed performance
Liu, Mingfang
Zhao, Zhirui
Hao, Lina
[J]. SMART MATERIALS AND STRUCTURES, 2021, 30 (06)
[42] Adaptive and extendable control of unmanned surface vehicle formations using distributed deep reinforcement learning
Wang, Shuwu
Ma, Feng
Yan, Xinping
Wu, Peng
Liu, Yuanchang
[J]. APPLIED OCEAN RESEARCH, 2021, 110
[43] A Data-Driven Model for Evaluating the Survivability of Unmanned Aerial Vehicle Routes
Jun Guo
Wei Xia
Huawei Ma
Xiaoxuan Hu
[J]. Journal of Intelligent & Robotic Systems, 2020, 100 : 629 - 646
[44] Research on Path Tracking Control Method of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
Guo, Rui
Yuan, Wei
[J]. INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021, 2021, 11884
[45] Adaptive dynamic programming and deep reinforcement learning for the control of an unmanned surface vehicle: Experimental results
Gonzalez-Garcia, Alejandro
Barragan-Alcantar, David
Collado-Gonzalez, Ivana
Garrido, Leonardo
[J]. CONTROL ENGINEERING PRACTICE, 2021, 111
[46] A Data-Driven Model for Evaluating the Survivability of Unmanned Aerial Vehicle Routes
Guo, Jun
Xia, Wei
Ma, Huawei
Hu, Xiaoxuan
[J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 100 (02) : 629 - 646
[47] Data-Driven Energy Management of an Electric Vehicle Charging Station Using Deep Reinforcement Learning
Rani, G. S. Asha
Priya, P. S. Lal
Jayan, Jino
Satheesh, Rahul
Kolhe, Mohan Lal
[J]. IEEE ACCESS, 2024, 12 : 65956 - 65966
[48] Data-Driven Economic NMPC Using Reinforcement Learning
Gros, Sebastien
Zanon, Mario
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (02) : 636 - 648
[49] Data-driven crowd evacuation: A reinforcement learning method
Yao, Zhenzhen
Zhang, Guijuan
Lu, Dianjie
Liu, Hong
[J]. NEUROCOMPUTING, 2019, 366 : 314 - 327
[50] Dynamic event-triggered data-driven iterative learning bipartite tracking control for nonlinear MASs with prescribed performance
Tao SHI
WeiWei CHE
[J]. Science China(Information Sciences), 2025, 68 (01) : 292 - 305

← 1 2 3 4 5 →