Model-Free Recurrent Reinforcement Learning for AUV Horizontal Control

被引:7
|
作者
Huo, Yujia [1 ,2 ]
Li, Yiping [1 ]
Feng, Xisheng [1 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
D O I
10.1088/1757-899X/428/1/012063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, aiming at the problems of 2-DOF horizontal motion control with high precision for autonomous underwater vehicle(AUV) trajectory tracking tasks, deep reinforcement learning controllers are applied to these conditions. These control problems are considered as a POMDP (Partially Observable Markov Decision Process). Model-free reinforcement learning(RL) algorithms for continuous control mission based on Deterministic Policy Gradient(DPG) allows robots learn from received delayed rewards when interacting with environments. Recurrent neural networks LSTM (Long Short-Term Memory) are involved into the reinforcement learning algorithm. Through this deep reinforcement learning algorithm, AUVs learn from sequences of dynamic information. The horizontal trajectory tracking tasks are described by LOS method and the motion control are idealized as a SISO model. Tanh-estimators are presented as data normalization. Moreover, AUV horizontal trajectory tracking and motion control simulation results demonstrate this algorithm gets better accuracy compared with the PID method and other non-recurrent methods. Efforts show the efficiency and effectiveness of the improved deep reinforcement learning algorithm.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] DATA-DRIVEN MODEL-FREE ITERATIVE LEARNING CONTROL USING REINFORCEMENT LEARNING
    Song, Bing
    Phan, Minh Q.
    Longman, Richard W.
    [J]. ASTRODYNAMICS 2018, PTS I-IV, 2019, 167 : 2579 - 2597
  • [32] Model-free MIMO control tuning of a chiller process using reinforcement learning
    Rosdahl, Christian
    Bernhardsson, B. O.
    Eisenhower, Bryan
    [J]. SCIENCE AND TECHNOLOGY FOR THE BUILT ENVIRONMENT, 2023, 29 (08) : 782 - 794
  • [33] Online model-free reinforcement learning for the automatic control of a flexible wing aircraft
    Abouheaf, Mohammed
    Gueaieb, Wail
    Lewis, Frank
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2020, 14 (01): : 73 - 84
  • [34] Model-free Based Reinforcement Learning Control Strategy of Aircraft Attitude Systems
    Huang, Dingcui
    Hu, Jiangping
    Peng, Zhinan
    Chen, Bo
    Hao, Mingrui
    Ghosh, Bijoy Kumar
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 743 - 748
  • [35] Adaptive optics control with multi-agent model-free reinforcement learning
    Pou, B.
    Ferreira, F.
    Quinones, E.
    Gratadour, D.
    Martin, M.
    [J]. OPTICS EXPRESS, 2022, 30 (02) : 2991 - 3015
  • [36] A model-free deep reinforcement learning approach for control of exoskeleton gait patterns
    Rose, Lowell
    Bazzocchi, Michael C. F.
    Nejat, Goldie
    [J]. ROBOTICA, 2022, 40 (07) : 2189 - 2214
  • [37] Control of a Wave Energy Converter Using Model-free Deep Reinforcement Learning
    Chen, Kemeng
    Huang, Xuanrui
    Lin, Zechuan
    Xiao, Xi
    [J]. 2024 UKACC 14TH INTERNATIONAL CONFERENCE ON CONTROL, CONTROL, 2024, : 1 - 6
  • [38] Model-free Data-driven Predictive Control Using Reinforcement Learning
    Sawant, Shambhuraj
    Reinhardt, Dirk
    Kordabad, Arash Bahari
    Gros, Sebastien
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4046 - 4052
  • [39] Experimental evaluation of model-free reinforcement learning algorithms for continuous HVAC control
    Biemann, Marco
    Scheller, Fabian
    Liu, Xiufeng
    Huang, Lizhen
    [J]. APPLIED ENERGY, 2021, 298
  • [40] Policy Learning with Constraints in Model-free Reinforcement Learning: A Survey
    Liu, Yongshuai
    Halev, Avishai
    Liu, Xin
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4508 - 4515