Adaptive Control of a Marine Vessel Based on Reinforcement Learning

被引:0
|
作者
Yin, Zhao [1 ]
He, Wei [1 ]
Sun, Changyin [2 ]
Li, Guang [3 ]
Yang, Chenguang [4 ]
机构
[1] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
[2] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
[3] Queen Mary Univ London, Mile End Rd, London E1 4NS, England
[4] South China Univ Technol, Coll Automat Sci & Engn, Key Lab Autonomous Syst & Networked Control, Guangzhou 510640, Guangdong, Peoples R China
基金
英国工程与自然科学研究理事会; 中国国家自然科学基金;
关键词
Reinforcement Learning; Critic Neural Networks; Actor Neural Networks; Lyapunov Method; Marine Vessel; NEURAL-NETWORK CONTROL; TRAJECTORY TRACKING; SURFACE VESSELS; CONTINUOUS-TIME; DELAY SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, our main goal is to solve optimal control problem by using reinforcement learning (RL) algorithm for marine surface vessel system with known dynamic. And this algorithm is an optimal control algorithm based on policy iteration (PI), and it can obtain the suitable approximations of cost function and the optimized control policy. There are two neural networks (NNs), where critic NN aims to estimate the cost-to-go and actor NN is utilized to design suitable input controller and minimize the tracking error. A novel tuning method is given for critic NN and actor NN. The stability and convergence are proven by Lyapunov's direct method. Finally, the numerical simulations are conducted to demonstrate the feasibility and superiority of presented algorithm.
引用
收藏
页码:2735 / 2740
页数:6
相关论文
共 50 条
  • [31] ADAPTIVE CONTROL BY REINFORCEMENT LEARNING FOR SPACECRAFT ATTITUDE CONTROL
    Ramadan, Mohammad
    Younes, Ahmad Bani
    SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 1805 - 1815
  • [32] Adaptive Optimal Tracking Control of an Underactuated Surface Vessel Using Actor-Critic Reinforcement Learning
    Chen, Lin
    Dai, Shi-Lu
    Dong, Chao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7520 - 7533
  • [33] A LEARNING ARCHITECTURE BASED ON REINFORCEMENT LEARNING FOR ADAPTIVE-CONTROL OF THE WALKING MACHINE LAURON
    ILG, W
    BERNS, K
    ROBOTICS AND AUTONOMOUS SYSTEMS, 1995, 15 (04) : 321 - 334
  • [34] Special Issue on Aerospace and Mechanical Applications of Reinforcement Learning and Adaptive Learning Based Control
    How, Jonathan P.
    Chowdhary, Girish
    Walsh, Thomas
    JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2014, 11 (09): : 541 - 541
  • [35] Reinforcement learning based adaptive metaheuristics
    Tessari, Michele
    Iacca, Giovanni
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 1854 - 1861
  • [36] An Adaptive Authentication Based on Reinforcement Learning
    Cui, Ziqi
    Zhao, Yongxiang
    Li, Chunxi
    Zuo, Qi
    Zhang, Haipeng
    2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2019,
  • [37] Adaptive immunity based reinforcement learning
    Ito, Jungo
    Nakano, Kazushi
    Sakurama, Kazunori
    Hosokawa, Shu
    ARTIFICIAL LIFE AND ROBOTICS, 2008, 13 (01) : 188 - 193
  • [38] USV Path-Following Control Based On Deep Reinforcement Learning and Adaptive Control
    Gonzalez-Garcia, Alejandro
    Castaneda, Herman
    Garrido, Leonardo
    GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [39] Reinforcement learning to adaptive control of nonlinear systems
    Hwang, KS
    Tan, SW
    Tsai, MC
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2003, 33 (03): : 514 - 521
  • [40] QTCP: Adaptive Congestion Control with Reinforcement Learning
    Li, Wei
    Zhou, Fan
    Chowdhury, Kaushik Roy
    Meleis, Waleed
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2019, 6 (03): : 445 - 458