共 50 条
Target Search Control of AUV in Underwater Environment With Deep Reinforcement Learning
被引:40
|作者:
Cao, Xiang
[1
,2
]
Sun, Changyin
[2
]
Yan, Mingzhong
[3
]
机构:
[1] Huaiyin Normal Univ, Sch Phys & Elect Elect Engn, Huaian 223300, Peoples R China
[2] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
[3] Shanghai Maritime Univ, Lab Underwater Vehicles & Intelligent Syst, Shanghai 201306, Peoples R China
来源:
基金:
中国博士后科学基金;
中国国家自然科学基金;
关键词:
Target search;
frontier exploration;
deep learning;
reinforcement learning;
SIMULTANEOUS LOCALIZATION;
ROBOT;
D O I:
10.1109/ACCESS.2019.2929120
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
The autonomous underwater vehicle (AUV) is widely used to search for unknown targets in the complex underwater environment. Due to the unpredictability of the underwater environment, this paper combines the traditional frontier exploration method with deep reinforcement learning (DRL) to enable the AUV to explore the unknown underwater environment autonomously. In this paper, a grid map of the search environment is built by the grid method. The designed asynchronous advantage actor-critic (A3C) network structure is used in the traditional frontier exploration method for target search tasks. This network structure enables the AUV to learn from its own experience and generate search strategies for the various unknown environment. At the same time, DRL and dual-stream Q-learning algorithms are used for AUV navigation to further optimize the search path. The simulations and experiments in an unknown underwater environment with different layouts show that the proposed algorithm can accomplish target search tasks with a high success rate, and it can adapt to different environments. In addition, compared to other search methods, the frontier exploration algorithm based on DRL can search a wider environment faster, which results in a higher search efficiency and reduced search time.
引用
收藏
页码:96549 / 96559
页数:11
相关论文