State Super Sampling Soft Actor-Critic Algorithm for Multi-AUV Hunting in 3D Underwater Environment

被引：2

作者：

Wang, Zhuo ^{[1
]}

Sui, Yancheng ^{[1
]}

Qin, Hongde ^{[1
]}

Lu, Hao ^{[1
]}

机构：

[1] Harbin Engn Univ, Sch Naval Engn, Harbin 150001, Peoples R China

来源：

JOURNAL OF MARINE SCIENCE AND ENGINEERING | 2023年 / 11卷 / 07期

关键词：

multi-agent reinforcement learning; Soft Actor-Critic; generative adversarial networks; multiple autonomous underwater vehicle hunting;

D O I：

10.3390/jmse11071257

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

Reinforcement learning (RL) is known for its efficiency and practicality in single-agent planning, but it faces numerous challenges when applied to multi-agent scenarios. In this paper, a Super Sampling Info-GAN (SSIG) algorithm based on Generative Adversarial Networks (GANs) is proposed to address the problem of state instability in Multi-Agent Reinforcement Learning (MARL). The SSIG model allows a pair of GAN networks to analyze the previous state of dynamic system and predict the future state of consecutive state pairs. A multi-agent system (MAS) can deduce the complete state of all collaborating agents through SSIG. The proposed model has the potential to be employed in multi-autonomous underwater vehicle (multi-AUV) planning scenarios by combining it with the Soft Actor-Critic (SAC) algorithm. Hence, this paper presents State Super Sampling Soft Actor-Critic (S4AC), which is a new algorithm that combines the advantages of SSIG and SAC and can be applied to Multi-AUV hunting tasks. The simulation results demonstrate that the proposed algorithm has strong learning ability and adaptability and has a considerable success rate in hunting the evading target in multiple testing scenarios.

引用

页数：23

共 50 条

[31] Manoeuvring of underwater snake robot with tail thrust using the actor-critic neural network super-twisting sliding mode control in the uncertain environment and disturbances
Patel, Bhavik M.
Dwivedy, Santosha K.
[J]. NEURAL COMPUTING & APPLICATIONS, 2023,
[32] Potential field hierarchical reinforcement learning approach for target search by multi-AUV in 3-D underwater environments
Cao, Xiang
Sun, Hongbing
Guo, Liqiang
[J]. INTERNATIONAL JOURNAL OF CONTROL, 2020, 93 (07) : 1677 - 1683
[33] Improved RRT Algorithm for AUV Target Search in Unknown 3D Environment
Li, Juan
Li, Chengyue
Chen, Tao
Zhang, Yun
[J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (06)
[34] Soft Actor-Critic Based 3-D Deployment and Power Allocation in Cell-Free Unmanned Aerial Vehicle Networks
Xu, Fanfei
Ruan, Yuhan
Li, Yongzhao
[J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (10) : 1692 - 1696
[35] Optimal dynamic thermal management for data center via soft actor-critic algorithm with dynamic control interval and combined-value state space
Guo, Yuxiang
Qu, Shengli
Wang, Chuang
Xing, Ziwen
Duan, Kaiwen
[J]. APPLIED ENERGY, 2024, 373
[36] Coverage Optimization Algorithm Based on Sampling for 3D Underwater Sensor Networks
Du Xiaoyu
Sun Lijuan
Liu Linfeng
[J]. INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2013,
[37] Obstacle Avoidance Control of Multi-AUV Formation with Third-Order Dynamics Based on IAPF in 3D Environments
Wang, Linling
Zhu, Daqi
Pang, Wen
[J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3711 - 3716
[38] Multi-AUV dynamic trajectory optimization and collaborative search combined with task urgency and energy consumption scheduling in 3-D underwater environment with random ocean currents and uncertain obstacles
Bai, Guiqiang
Chen, Yanli
Hu, Xinyu
Shi, Yu
Jiang, Wenwen
Zhang, Xueqing
[J]. OCEAN ENGINEERING, 2023, 275
[39] Automatic Delineation of the 3D Left Atrium From LGE-MRI: Actor-Critic Based Detection and Semi-Supervised Segmentation
Xiang, Shun
Li, Nana
Wang, Yuanquan
Zhou, Shoujun
Wei, Jin
Li, Shuo
[J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (06) : 3545 - 3556
[40] Multi-Objective Prioritized Task Scheduler Using Improved Asynchronous Advantage Actor Critic (a3c) Algorithm in Multi Cloud Environment
Mangalampalli, S. Sudheer
Karri, Ganesh Reddy
Mohanty, Sachi Nandan
Ali, Shahid
Ijaz Khan, Muhammad
Abdullaev, Sherzod
Alqahtani, Salman A.
[J]. IEEE ACCESS, 2024, 12 : 11354 - 11377

← 1 2 3 4 5 →