State Super Sampling Soft Actor-Critic Algorithm for Multi-AUV Hunting in 3D Underwater Environment

被引:2
|
作者
Wang, Zhuo [1 ]
Sui, Yancheng [1 ]
Qin, Hongde [1 ]
Lu, Hao [1 ]
机构
[1] Harbin Engn Univ, Sch Naval Engn, Harbin 150001, Peoples R China
关键词
multi-agent reinforcement learning; Soft Actor-Critic; generative adversarial networks; multiple autonomous underwater vehicle hunting;
D O I
10.3390/jmse11071257
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Reinforcement learning (RL) is known for its efficiency and practicality in single-agent planning, but it faces numerous challenges when applied to multi-agent scenarios. In this paper, a Super Sampling Info-GAN (SSIG) algorithm based on Generative Adversarial Networks (GANs) is proposed to address the problem of state instability in Multi-Agent Reinforcement Learning (MARL). The SSIG model allows a pair of GAN networks to analyze the previous state of dynamic system and predict the future state of consecutive state pairs. A multi-agent system (MAS) can deduce the complete state of all collaborating agents through SSIG. The proposed model has the potential to be employed in multi-autonomous underwater vehicle (multi-AUV) planning scenarios by combining it with the Soft Actor-Critic (SAC) algorithm. Hence, this paper presents State Super Sampling Soft Actor-Critic (S4AC), which is a new algorithm that combines the advantages of SSIG and SAC and can be applied to Multi-AUV hunting tasks. The simulation results demonstrate that the proposed algorithm has strong learning ability and adaptability and has a considerable success rate in hunting the evading target in multiple testing scenarios.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Hunting Algorithm for Multi-AUV Based on Dynamic Prediction of Target Trajectory in 3D Underwater Environment
    Cao, Xiang
    Xu, Xinyuan
    [J]. IEEE ACCESS, 2020, 8 : 138529 - 138538
  • [2] A multi-AUV cooperative hunting method in 3-D underwater environment with obstacle
    Huang, Zongrui
    Zhu, Daqi
    Sun, Bing
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 50 : 192 - 200
  • [3] A cooperative hunting algorithm of multi-AUV in 3-D dynamic environment
    Huang, Zongrui
    Zhu, Daqi
    [J]. 2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 2571 - 2575
  • [4] A NOVEL COOPERATIVE HUNTING ALGORITHM FOR MULTI-AUV IN UNDERWATER ENVIRONMENTS
    Cao, Xiang
    Sun, Hongbing
    Xu, Xinyuan
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2020, 35 (06): : 425 - 435
  • [5] A Cooperative Hunting Method for Multi-AUV Swarm in Underwater Weak Information Environment with Obstacles
    Zhao, Zhenyi
    Hu, Qiao
    Feng, Haobo
    Feng, Xinglong
    Su, Wenbin
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (09)
  • [6] Multi-AUV Cooperative Target Hunting based on Improved Potential Field in Underwater Environment
    Cao, Xiang
    Sun, Changyin
    [J]. PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 118 - 122
  • [7] Multi-AUV Collaborative Hunting Method for the Non-cooperative Target in Underwater Environment
    Cai, Lei
    Zhou, Guangfu
    Zhang, Shujing
    [J]. 2018 3RD IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (IEEE ICARM), 2018, : 72 - 76
  • [8] Distributional Soft Actor-Critic-Based Multi-AUV Cooperative Pursuit for Maritime Security Protection
    Hou, Yun
    Han, Guangjie
    Zhang, Fan
    Lin, Chuan
    Peng, Jinlin
    Liu, Li
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (06) : 6049 - 6060
  • [9] Formation control and obstacle avoidance of Multi-AUV for 3-D underwater environment
    Ding Guohua
    Zhu Daqi
    Sun Bing
    [J]. 2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 8347 - 8352
  • [10] Multi-AUV Cooperative Target Search Algorithm in 3-D Underwater Workspace
    Cao, Xiang
    Yu, A-long
    [J]. JOURNAL OF NAVIGATION, 2017, 70 (06): : 1293 - 1311