NAVS: A Neural Attention-Based Visual SLAM for Autonomous Navigation in Unknown 3D Environments

被引：2

作者：

Wu, Yu ^{[1
]}

Chen, Niansheng ^{[1
]}

Fan, Guangyu ^{[1
]}

Yang, Dingyu ^{[2
]}

Rao, Lei ^{[1
]}

Cheng, Songlin ^{[1
]}

Song, Xiaoyong ^{[1
]}

Ma, Yiping ^{[3
]}

机构：

[1] Shanghai DianJi Univ, Sch Elect Informat, Shanghai 200000, Peoples R China

[2] Alibaba Grp, Shanghai 200000, Peoples R China

[3] AVIC Huadong Photoelect Shanghai Co Ltd, Shanghai 200000, Peoples R China

来源：

NEURAL PROCESSING LETTERS | 2024年 / 56卷 / 02期

基金：

中国国家自然科学基金;

关键词：

SLAM; Navigation; Attention mechanism; Deep reinforcement learning; ACTIVE SLAM; EXPLORATION;

D O I：

10.1007/s11063-024-11502-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Navigation in unknown 3D environments aims to progressively find an efficient path to a given target goal in unseen scenarios. A challenge is how to explore the navigation quickly and effectively. An end-to-end learning approach has been proposed to extract geometric shapes from RGB images, but it is not suitable for large environments due to its exhaustive exploration with exponential search space. Active Neural SLAM (ANS) presents a Neural SLAM module to maximize the exploration coverage to tackle the active SLAM task. However, ANS still frequently visits the explored areas due to the inappropriate local target selection. In this paper, we propose a Neural Attention-based Visual SLAM (NAVS) model to explore unknown 3D environments. Spatial attention is provided to quickly identify obstacles (such as similarly colored tea table or floor). We also leverage the priority of unknown regions in the short-term goal decision to avoid frequent exploration with a channel attention. The experimental results show that our model can build a more accurate map than ANS and other baseline methods with less running time. In terms of relative coverage, NAVS achieves a 0.5%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} improvement over ANS in overall and a 1.1%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} improvement over ANS in large environments.

引用

页数：21

共 50 条

[41] Autonomous Navigation Based on Sequential Images for Planetary Landing in Unknown Environments
Xu, Chao
Wang, Dayi
Huang, Xiangyu
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2017, 40 (10) : 2587 - 2602
[42] Semantic SLAM-based Autonomous Tributary Navigation System Using 3D LiDAR Point Cloud for UAV
Pak, Jeonghyeon
Son, Hyoung Il
2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 1380 - 1382
[43] VISUAL NAVIGATION AND 3D RECONSTRUCTION OF UNDERWATER OBJECTS WITH AUTONOMOUS UNDERWATER VEHICLE
Bobkov, V. A.
Kudryashov, A. P.
Melman, S. V.
Scherbatyuk, A. F.
2017 24TH SAINT PETERSBURG INTERNATIONAL CONFERENCE ON INTEGRATED NAVIGATION SYSTEMS (ICINS), 2017,
[44] Survey of 3D Map in SLAM: Localization and Navigation
Yang, Aolei
Luo, Yu
Chen, Ling
Xu, Yulin
ADVANCED COMPUTATIONAL METHODS IN LIFE SYSTEM MODELING AND SIMULATION, LSMS 2017, PT I, 2017, 761 : 410 - 420
[45] A 3D Reactive Navigation Method for UAVs in Unknown Tunnel-like Environments
Elmokadem, Taha
2020 AUSTRALIAN AND NEW ZEALAND CONTROL CONFERENCE (ANZCC 2020), 2020, : 119 - 124
[46] Neural network based FastSLAM for autonomous robots in unknown environments
Li, Qing-Ling
Song, Yu
Hou, Zeng-Guang
NEUROCOMPUTING, 2015, 165 : 99 - 110
[47] MEAN: An attention-based approach for 3D mesh shape classification
Jicheng Dai
Rubin Fan
Yupeng Song
Qing Guo
Fazhi He
The Visual Computer, 2024, 40 : 2987 - 3000
[48] Attention-based 3D Object Reconstruction from a Single Image
Salvi, Andrey
Gavenski, Nathan
Pooch, Eduardo
Tasoniero, Felipe
Barros, Rodrigo
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[49] Endoscope Navigation and 3D Reconstruction of Oral Cavity by Visual SLAM with Mitigated Data Scarcity
Qiu, Liang
Ren, Hongliang
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 2278 - 2285
[50] MEAN: An attention-based approach for 3D mesh shape classification
Dai, Jicheng
Fan, Rubin
Song, Yupeng
Guo, Qing
He, Fazhi
VISUAL COMPUTER, 2024, 40 (04): : 2987 - 3000

← 1 2 3 4 5 →