An Improved DDPG Reinforcement Learning Control of Underwater Gliders for Energy Optimization

被引:0
|
作者
Jing, Anyan [1 ]
Tang, Zuocheng [1 ]
Gao, Jian [1 ]
Pan, Guang [1 ]
机构
[1] Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian, Peoples R China
关键词
underwater glider; reinforcement learning; deep deterministic policy gradient; prioritized experience replay; glide parameters optimization;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As a novel underwater vehicle, underwater gliders are widely used in marine environment exploration. Underwater gliders are designed for long-term and long-distance operation, adaptivity and energy optimization is a critical requirement for controller design. In this paper, the reinforcement learning control is studied for underwater gliders, and the problem of slow learning convergence and unstable learning process of the DDPG reinforcement learning algorithm. The proposed solution is based on the priority experience replay method, which effectively increase the convergence speed and stability of the algorithm is addressed. The gliding control parameters are optimized to reduce the energy consumption is proposed, by using the improved DDPG algorithm and the energy consumption model. In the simulation experiments with an underwater glider, a set of glide parameters is obtained at a given gliding depth.
引用
收藏
页码:621 / 626
页数:6
相关论文
共 50 条
  • [21] Modelling and Control Optimization for Underwater Gliders of Variable Buoyancy Processes in Vertical Plane
    Yan S.
    Zhang R.
    Yang S.
    Niu W.
    Zhang Y.
    Li B.
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2022, 33 (01): : 109 - 117
  • [22] Energy Consumption Minimization of Quadruped Robot Based on Reinforcement Learning of DDPG Algorithm
    Yan, Zhenzhuo
    Ji, Hongwei
    Chang, Qing
    ACTUATORS, 2024, 13 (01)
  • [23] UNDERWATER SHAKING TABLE TIME DELAY COMPENSATION AND CONTROL OPTIMIZATION BASED ON REINFORCEMENT LEARNING
    Tang J.-C.
    Li N.
    Li Z.-X.
    Gongcheng Lixue/Engineering Mechanics, 2023, 40 (12): : 64 - 75
  • [24] Elastic step DDPG: Multi-step reinforcement learning for improved sample efficiency
    Ly, Adrian
    Dazeley, Richard
    Vamplew, Peter
    Cruz, Francisco
    Aryal, Sunil
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [25] Deep Reinforcement Learning Enabled Network Routing Optimization Approach with an Enhanced DDPG Algorithm
    Meng, Lingyu
    Yang, Wen
    Guo, Bingli
    Huang, Shanguo
    2020 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP) AND INTERNATIONAL CONFERENCE ON INFORMATION PHOTONICS AND OPTICAL COMMUNICATIONS (IPOC), 2020,
  • [26] Trajectory Tracking Control of Intelligent Vehicle Based on DDPG Method of Reinforcement Learning
    He, Yi-Lin
    Song, Ruo-Yang
    Ma, Jian
    Zhongguo Gonglu Xuebao/China Journal of Highway and Transport, 2021, 34 (11): : 335 - 348
  • [27] Distributed formation control of a fleet of underwater gliders
    Ma, Xiaojuan
    Wang, Yanhui
    Niu, Wendong
    Ma, Wei
    Li, Shuai
    Xue, Changli
    OCEANS 2022, 2022,
  • [28] Path planning for underwater gliders in time-varying ocean current using deep reinforcement learning
    Lan, Wei
    Jin, Xiang
    Chang, Xin
    Wang, Tianlin
    Zhou, Han
    Tian, Wei
    Zhou, Lilei
    OCEAN ENGINEERING, 2022, 262
  • [29] Optimization design of neutrally buoyant hull for underwater gliders
    Yang, Ming
    Yang, Shaoqiong
    Wang, Yanhui
    Liang, Yan
    Wang, Shuxin
    Zhang, Lianhong
    OCEAN ENGINEERING, 2020, 209 (209)
  • [30] Control parameter optimization of underwater gliders with uncertainty using hierarchical optimization method and interval analysis theory
    Tan, Lijie
    Wu, Hongyu
    Jiang, Zhihong
    Wu, Qingjian
    Hao, Yuxing
    Yang, Yunqiang
    Yan, Shaoze
    ENGINEERING OPTIMIZATION, 2024, 56 (12) : 2387 - 2407