DEMRL: Dynamic estimation meta reinforcement learning for path following on unseen unmanned surface vehicle

被引:3
|
作者
Jin, Kefan [1 ]
Zhu, Hao [2 ]
Gao, Rui [1 ]
Wang, Jian [1 ]
Wang, Hongdong [1 ]
Yi, Hong [1 ]
Shi, C. -J. Richard [3 ]
机构
[1] Shanghai Jiao Tong Univ, MOE Key Lab Marine Intelligent Equipment & Syst, State Key Lab Ocean Engn, Shanghai, Peoples R China
[2] Fudan Univ, Inst Brain Inspired Circuits & Syst, State Key Lab ASIC & Syst, Shanghai 201203, Peoples R China
[3] Univ Washington, Dept Elect & Comp Engn, Seattle, WA USA
基金
中国国家自然科学基金;
关键词
Meta reinforcement learning; Policy learning; Path following control; Unmanned surface vehicle; IDENTIFICATION; SHIP;
D O I
10.1016/j.oceaneng.2023.115958
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Reinforcement learning has been widely used for unmanned surface vehicle (USV) control tasks. However, the requirement of numerous training samples limits its transferability to new USVs. In this article, we propose a dynamic estimation meta reinforcement learning (DEMRL) approach that enables few-shot learning for the path following control policy. We first present a dynamic estimation method to learn a latent dynamic context feature. The learned context contains the hidden information of USV dynamics with only a few estimation samples. We then propose a meta reinforcement learning based training framework to learn the generalizable path following control policy. After that, given the prior knowledge from dynamic context, the well-trained policy can easily adapt to the target USV during the rapid adaptation process. This proposed method represents the initial effort in tackling the few-shot learning challenge associated with training reinforcement learning based USV path following policies. Extensive experiments demonstrate that the proposed method can achieve promising path following performance for unseen USV with very few training data and training volume.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Deep reinforcement learning-based controller for path following of an unmanned surface vehicle
    Woo, Joohyun
    Yu, Chanwoo
    Kim, Nakwan
    [J]. OCEAN ENGINEERING, 2019, 183 : 155 - 166
  • [2] LSTM-DPPO based deep reinforcement learning controller for path following optimization of unmanned surface vehicle
    XIA Jiawei
    ZHU Xufang
    LIU Zhong
    XIA Qingtao
    [J]. Journal of Systems Engineering and Electronics, 2023, 34 (05) : 1343 - 1358
  • [3] LSTM-DPPO based deep reinforcement learning controller for path following optimization of unmanned surface vehicle
    Xia, Jiawei
    Zhu, Xufang
    Liu, Zhong
    Xia, Qingtao
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2023, 34 (05) : 1343 - 1358
  • [4] An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning
    Xing, Bowen
    Wang, Xiao
    Yang, Liu
    Liu, Zhenchong
    Wu, Qingyun
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (03)
  • [5] Efficient Path Following Algorithm for Unmanned Surface Vehicle
    Niu, Hanlin
    Lu, Yu
    Savvaris, Al
    Tsourdos, Antonios
    [J]. OCEANS 2016 - SHANGHAI, 2016,
  • [6] The Path Following Controller Design of Unmanned Surface Vehicle
    Zhao, Yongsheng
    Bai, Yiming
    [J]. PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, : 824 - 827
  • [7] Research on Path Tracking Control Method of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
    Guo, Rui
    Yuan, Wei
    [J]. INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021, 2021, 11884
  • [8] Deep reinforcement learning-based controller for dynamic positioning of an unmanned surface vehicle
    Yuan, Wei
    Rui, Xingwen
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
  • [9] Unmanned Surface Vehicle Path Following based on Path Parameter Description
    Xu, Machangxiao
    Wang, Yintao
    Han, Zhengqing
    Jia, Xiaobao
    [J]. GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [10] Path-Following Algorithms and Experiments for an Unmanned Surface Vehicle
    Bibuli, Marco
    Bruzzone, Gabriele
    Caccia, Massimo
    Lapierre, Lionel
    [J]. JOURNAL OF FIELD ROBOTICS, 2009, 26 (08) : 669 - 688