Data-Driven Performance-Prescribed Reinforcement Learning Control of an Unmanned Surface Vehicle

被引:135
|
作者
Wang, Ning [1 ,2 ]
Gao, Ying [1 ]
Zhang, Xuefeng [1 ]
机构
[1] Dalian Maritime Univ, Sch Marine Elect Engn, Dalian 116026, Peoples R China
[2] Harbin Engn Univ, Coll Shipbldg Engn, Harbin 150001, Peoples R China
关键词
Optimal control; Vehicle dynamics; System dynamics; Field-flow fractionation; Transient analysis; Reinforcement learning; Steady-state; Data-driven control; optimal control; performance-prescribed control; reinforcement learning control; unmanned surface vehicle (USV); ADAPTIVE-CONTROL; NONLINEAR-SYSTEMS; DESIGN; ITERATION; TRACKING;
D O I
10.1109/TNNLS.2021.3056444
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An unmanned surface vehicle (USV) under complicated marine environments can hardly be modeled well such that model-based optimal control approaches become infeasible. In this article, a self-learning-based model-free solution only using input-output signals of the USV is innovatively provided. To this end, a data-driven performance-prescribed reinforcement learning control (DPRLC) scheme is created to pursue control optimality and prescribed tracking accuracy simultaneously. By devising state transformation with prescribed performance, constrained tracking errors are substantially converted into constraint-free stabilization of tracking errors with unknown dynamics. Reinforcement learning paradigm using neural network-based actor-critic learning framework is further deployed to directly optimize controller synthesis deduced from the Bellman error formulation such that transformed tracking errors evolve a data-driven optimal controller. Theoretical analysis eventually ensures that the entire DPRLC scheme can guarantee prescribed tracking accuracy, subject to optimal cost. Both simulations and virtual-reality experiments demonstrate the remarkable effectiveness and superiority of the proposed DPRLC scheme.
引用
收藏
页码:5456 / 5467
页数:12
相关论文
共 50 条
  • [41] Data-driven sliding mode control of shape memory alloy actuators with prescribed performance
    Liu, Mingfang
    Zhao, Zhirui
    Hao, Lina
    [J]. SMART MATERIALS AND STRUCTURES, 2021, 30 (06)
  • [42] Adaptive and extendable control of unmanned surface vehicle formations using distributed deep reinforcement learning
    Wang, Shuwu
    Ma, Feng
    Yan, Xinping
    Wu, Peng
    Liu, Yuanchang
    [J]. APPLIED OCEAN RESEARCH, 2021, 110
  • [43] A Data-Driven Model for Evaluating the Survivability of Unmanned Aerial Vehicle Routes
    Jun Guo
    Wei Xia
    Huawei Ma
    Xiaoxuan Hu
    [J]. Journal of Intelligent & Robotic Systems, 2020, 100 : 629 - 646
  • [44] Research on Path Tracking Control Method of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
    Guo, Rui
    Yuan, Wei
    [J]. INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021, 2021, 11884
  • [45] Adaptive dynamic programming and deep reinforcement learning for the control of an unmanned surface vehicle: Experimental results
    Gonzalez-Garcia, Alejandro
    Barragan-Alcantar, David
    Collado-Gonzalez, Ivana
    Garrido, Leonardo
    [J]. CONTROL ENGINEERING PRACTICE, 2021, 111
  • [46] A Data-Driven Model for Evaluating the Survivability of Unmanned Aerial Vehicle Routes
    Guo, Jun
    Xia, Wei
    Ma, Huawei
    Hu, Xiaoxuan
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 100 (02) : 629 - 646
  • [47] Data-Driven Energy Management of an Electric Vehicle Charging Station Using Deep Reinforcement Learning
    Rani, G. S. Asha
    Priya, P. S. Lal
    Jayan, Jino
    Satheesh, Rahul
    Kolhe, Mohan Lal
    [J]. IEEE ACCESS, 2024, 12 : 65956 - 65966
  • [48] Data-Driven Economic NMPC Using Reinforcement Learning
    Gros, Sebastien
    Zanon, Mario
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (02) : 636 - 648
  • [49] Data-driven crowd evacuation: A reinforcement learning method
    Yao, Zhenzhen
    Zhang, Guijuan
    Lu, Dianjie
    Liu, Hong
    [J]. NEUROCOMPUTING, 2019, 366 : 314 - 327
  • [50] Dynamic event-triggered data-driven iterative learning bipartite tracking control for nonlinear MASs with prescribed performance
    Tao SHI
    WeiWei CHE
    [J]. Science China(Information Sciences), 2025, 68 (01) : 292 - 305