Data-Driven Performance-Prescribed Reinforcement Learning Control of an Unmanned Surface Vehicle

被引:135
|
作者
Wang, Ning [1 ,2 ]
Gao, Ying [1 ]
Zhang, Xuefeng [1 ]
机构
[1] Dalian Maritime Univ, Sch Marine Elect Engn, Dalian 116026, Peoples R China
[2] Harbin Engn Univ, Coll Shipbldg Engn, Harbin 150001, Peoples R China
关键词
Optimal control; Vehicle dynamics; System dynamics; Field-flow fractionation; Transient analysis; Reinforcement learning; Steady-state; Data-driven control; optimal control; performance-prescribed control; reinforcement learning control; unmanned surface vehicle (USV); ADAPTIVE-CONTROL; NONLINEAR-SYSTEMS; DESIGN; ITERATION; TRACKING;
D O I
10.1109/TNNLS.2021.3056444
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An unmanned surface vehicle (USV) under complicated marine environments can hardly be modeled well such that model-based optimal control approaches become infeasible. In this article, a self-learning-based model-free solution only using input-output signals of the USV is innovatively provided. To this end, a data-driven performance-prescribed reinforcement learning control (DPRLC) scheme is created to pursue control optimality and prescribed tracking accuracy simultaneously. By devising state transformation with prescribed performance, constrained tracking errors are substantially converted into constraint-free stabilization of tracking errors with unknown dynamics. Reinforcement learning paradigm using neural network-based actor-critic learning framework is further deployed to directly optimize controller synthesis deduced from the Bellman error formulation such that transformed tracking errors evolve a data-driven optimal controller. Theoretical analysis eventually ensures that the entire DPRLC scheme can guarantee prescribed tracking accuracy, subject to optimal cost. Both simulations and virtual-reality experiments demonstrate the remarkable effectiveness and superiority of the proposed DPRLC scheme.
引用
收藏
页码:5456 / 5467
页数:12
相关论文
共 50 条
  • [31] Model-free Data-driven Predictive Control Using Reinforcement Learning
    Sawant, Shambhuraj
    Reinhardt, Dirk
    Kordabad, Arash Bahari
    Gros, Sebastien
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4046 - 4052
  • [32] Data-driven constrained reinforcement learning for optimal control of a multistage evaporation process
    Yao, Yao
    Ding, Jinliang
    Zhao, Chunhui
    Wang, Yonggang
    Chai, Tianyou
    [J]. CONTROL ENGINEERING PRACTICE, 2022, 129
  • [33] Reinforcement Learning based Data-driven Optimal Control Strategy for Systems with Disturbance
    Fan, Zhong-Xin
    Li, Shihua
    Liu, Rongjie
    [J]. 2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 567 - 572
  • [34] Data-Driven Wind Farm Control via Multiplayer Deep Reinforcement Learning
    Dong, Hongyang
    Zhao, Xiaowei
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2023, 31 (03) : 1468 - 1475
  • [35] Adaptive State Feedback Shared Control for Unmanned Surface Vehicle With Fixed-Time Prescribed Performance Control
    He, Mengyue
    Li, Chaobo
    Huang, Hua
    Zhou, Fangfang
    He, Yuhang
    Shang, Wei
    [J]. IEEE ACCESS, 2024, 12 : 93781 - 93790
  • [36] Data-driven distributed formation control of under-actuated unmanned surface vehicles with collision avoidance via model-based deep reinforcement learning
    Pan, Chao
    Peng, Zhouhua
    Liu, Lu
    Wang, Dan
    [J]. OCEAN ENGINEERING, 2023, 267
  • [37] Unmanned-Surface-Vehicle-Aided Maritime Data Collection Using Deep Reinforcement Learning
    Su, Na
    Wang, Jun-Bo
    Zeng, Cheng
    Zhang, Hua
    Lin, Min
    Li, Geoffrey Ye
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (20) : 19773 - 19786
  • [38] Data-driven discrete terminal sliding mode decoupling control method with prescribed performance
    Hou, Mingdong
    Wang, Yinsong
    Han, Yaozhen
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2021, 358 (13): : 6612 - 6633
  • [39] Data-driven prescribed performance platooning sliding mode control under DoS attacks
    Zhang, Peng
    Che, Wei-Wei
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024,
  • [40] Data-driven discrete terminal sliding mode decoupling control method with prescribed performance
    Hou, Mingdong
    Wang, Yinsong
    Han, Yaozhen
    [J]. Journal of the Franklin Institute, 2021, 358 (13) : 6612 - 6633