State Super Sampling Soft Actor-Critic Algorithm for Multi-AUV Hunting in 3D Underwater Environment

被引:2
|
作者
Wang, Zhuo [1 ]
Sui, Yancheng [1 ]
Qin, Hongde [1 ]
Lu, Hao [1 ]
机构
[1] Harbin Engn Univ, Sch Naval Engn, Harbin 150001, Peoples R China
关键词
multi-agent reinforcement learning; Soft Actor-Critic; generative adversarial networks; multiple autonomous underwater vehicle hunting;
D O I
10.3390/jmse11071257
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Reinforcement learning (RL) is known for its efficiency and practicality in single-agent planning, but it faces numerous challenges when applied to multi-agent scenarios. In this paper, a Super Sampling Info-GAN (SSIG) algorithm based on Generative Adversarial Networks (GANs) is proposed to address the problem of state instability in Multi-Agent Reinforcement Learning (MARL). The SSIG model allows a pair of GAN networks to analyze the previous state of dynamic system and predict the future state of consecutive state pairs. A multi-agent system (MAS) can deduce the complete state of all collaborating agents through SSIG. The proposed model has the potential to be employed in multi-autonomous underwater vehicle (multi-AUV) planning scenarios by combining it with the Soft Actor-Critic (SAC) algorithm. Hence, this paper presents State Super Sampling Soft Actor-Critic (S4AC), which is a new algorithm that combines the advantages of SSIG and SAC and can be applied to Multi-AUV hunting tasks. The simulation results demonstrate that the proposed algorithm has strong learning ability and adaptability and has a considerable success rate in hunting the evading target in multiple testing scenarios.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Manoeuvring of underwater snake robot with tail thrust using the actor-critic neural network super-twisting sliding mode control in the uncertain environment and disturbances
    Patel, Bhavik M.
    Dwivedy, Santosha K.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023,
  • [32] Potential field hierarchical reinforcement learning approach for target search by multi-AUV in 3-D underwater environments
    Cao, Xiang
    Sun, Hongbing
    Guo, Liqiang
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2020, 93 (07) : 1677 - 1683
  • [33] Improved RRT Algorithm for AUV Target Search in Unknown 3D Environment
    Li, Juan
    Li, Chengyue
    Chen, Tao
    Zhang, Yun
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (06)
  • [34] Soft Actor-Critic Based 3-D Deployment and Power Allocation in Cell-Free Unmanned Aerial Vehicle Networks
    Xu, Fanfei
    Ruan, Yuhan
    Li, Yongzhao
    [J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (10) : 1692 - 1696
  • [35] Optimal dynamic thermal management for data center via soft actor-critic algorithm with dynamic control interval and combined-value state space
    Guo, Yuxiang
    Qu, Shengli
    Wang, Chuang
    Xing, Ziwen
    Duan, Kaiwen
    [J]. APPLIED ENERGY, 2024, 373
  • [36] Coverage Optimization Algorithm Based on Sampling for 3D Underwater Sensor Networks
    Du Xiaoyu
    Sun Lijuan
    Liu Linfeng
    [J]. INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2013,
  • [37] Obstacle Avoidance Control of Multi-AUV Formation with Third-Order Dynamics Based on IAPF in 3D Environments
    Wang, Linling
    Zhu, Daqi
    Pang, Wen
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3711 - 3716
  • [38] Multi-AUV dynamic trajectory optimization and collaborative search combined with task urgency and energy consumption scheduling in 3-D underwater environment with random ocean currents and uncertain obstacles
    Bai, Guiqiang
    Chen, Yanli
    Hu, Xinyu
    Shi, Yu
    Jiang, Wenwen
    Zhang, Xueqing
    [J]. OCEAN ENGINEERING, 2023, 275
  • [39] Automatic Delineation of the 3D Left Atrium From LGE-MRI: Actor-Critic Based Detection and Semi-Supervised Segmentation
    Xiang, Shun
    Li, Nana
    Wang, Yuanquan
    Zhou, Shoujun
    Wei, Jin
    Li, Shuo
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (06) : 3545 - 3556
  • [40] Multi-Objective Prioritized Task Scheduler Using Improved Asynchronous Advantage Actor Critic (a3c) Algorithm in Multi Cloud Environment
    Mangalampalli, S. Sudheer
    Karri, Ganesh Reddy
    Mohanty, Sachi Nandan
    Ali, Shahid
    Ijaz Khan, Muhammad
    Abdullaev, Sherzod
    Alqahtani, Salman A.
    [J]. IEEE ACCESS, 2024, 12 : 11354 - 11377