Adaptive Control of a Marine Vessel Based on Reinforcement Learning

被引:0
|
作者
Yin, Zhao [1 ]
He, Wei [1 ]
Sun, Changyin [2 ]
Li, Guang [3 ]
Yang, Chenguang [4 ]
机构
[1] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
[2] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
[3] Queen Mary Univ London, Mile End Rd, London E1 4NS, England
[4] South China Univ Technol, Coll Automat Sci & Engn, Key Lab Autonomous Syst & Networked Control, Guangzhou 510640, Guangdong, Peoples R China
基金
英国工程与自然科学研究理事会; 中国国家自然科学基金;
关键词
Reinforcement Learning; Critic Neural Networks; Actor Neural Networks; Lyapunov Method; Marine Vessel; NEURAL-NETWORK CONTROL; TRAJECTORY TRACKING; SURFACE VESSELS; CONTINUOUS-TIME; DELAY SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, our main goal is to solve optimal control problem by using reinforcement learning (RL) algorithm for marine surface vessel system with known dynamic. And this algorithm is an optimal control algorithm based on policy iteration (PI), and it can obtain the suitable approximations of cost function and the optimized control policy. There are two neural networks (NNs), where critic NN aims to estimate the cost-to-go and actor NN is utilized to design suitable input controller and minimize the tracking error. A novel tuning method is given for critic NN and actor NN. The stability and convergence are proven by Lyapunov's direct method. Finally, the numerical simulations are conducted to demonstrate the feasibility and superiority of presented algorithm.
引用
收藏
页码:2735 / 2740
页数:6
相关论文
共 50 条
  • [21] Adaptive sliding mode control of robotic manipulator based on reinforcement learning
    Ren, Ziwu
    Chen, Jie
    Miao, Yunxi
    Miao, Yujie
    Guo, Zibo
    Hu, Biao
    Lin, Rui
    ASIAN JOURNAL OF CONTROL, 2024, 26 (05) : 2703 - 2718
  • [22] Safety Filtering for Reinforcement Learning-based Adaptive Cruise Control
    Hailemichael, Habtamu
    Ayalew, Beshah
    Kerbel, Lindsey
    Ivanco, Andrej
    Loiselle, Keith
    IFAC PAPERSONLINE, 2022, 55 (24): : 149 - 154
  • [23] Adaptive Coordinated Traffic Control for Arterial Intersections based on Reinforcement Learning
    Ma, Zian
    Xu, Chengcheng
    Kan, Yuheng
    Wang, Maonan
    Wu, Wei
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2562 - 2567
  • [24] Adaptive optimal trajectory tracking control of AUVs based on reinforcement learning
    Li, Zhifu
    Wang, Ming
    Ma, Ge
    ISA TRANSACTIONS, 2023, 137 : 122 - 132
  • [25] Adaptive PID Controller based on Reinforcement Learning for Wind Turbine Control
    Sedighizadeh, M.
    Rezazadeh, A.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 27, 2008, 27 : 257 - 262
  • [26] Adaptive optics control using model-based reinforcement learning
    Nousiainen, Jalo
    Rajani, Chang
    Kasper, Markus
    Helin, Tapio
    OPTICS EXPRESS, 2021, 29 (10) : 15327 - 15344
  • [27] A Reinforcement Learning Based Adaptive Traffic Signal Control for Vehicular Networks
    Krishnendhu, S. P.
    Reddy, Mainampati Vigneshwari
    Basumatary, Thulunga
    Mohandas, Prabu
    PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2022, 2023, 475 : 547 - 561
  • [28] Reward-Based Exploration: Adaptive Control for Deep Reinforcement Learning
    Xu, Zhi-xiong
    Cao, Lei
    Chen, Xi-liang
    Li, Chen-xi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (09): : 2409 - 2412
  • [29] Event-triggered Suboptimal Control Based Adaptive Reinforcement Learning
    Ma, Tengcong
    Song, Ruizhuo
    2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 64 - 69
  • [30] Adaptive Traffic Signal Control for Urban Corridor Based on Reinforcement Learning
    Liu, Lishan
    Zhuang, Xiya
    Li, Qiang
    COMPUTATIONAL AND EXPERIMENTAL SIMULATIONS IN ENGINEERING, ICCES 2024-VOL 2, 2025, 173 : 25 - 35