NAVS: A Neural Attention-Based Visual SLAM for Autonomous Navigation in Unknown 3D Environments

被引:2
|
作者
Wu, Yu [1 ]
Chen, Niansheng [1 ]
Fan, Guangyu [1 ]
Yang, Dingyu [2 ]
Rao, Lei [1 ]
Cheng, Songlin [1 ]
Song, Xiaoyong [1 ]
Ma, Yiping [3 ]
机构
[1] Shanghai DianJi Univ, Sch Elect Informat, Shanghai 200000, Peoples R China
[2] Alibaba Grp, Shanghai 200000, Peoples R China
[3] AVIC Huadong Photoelect Shanghai Co Ltd, Shanghai 200000, Peoples R China
基金
中国国家自然科学基金;
关键词
SLAM; Navigation; Attention mechanism; Deep reinforcement learning; ACTIVE SLAM; EXPLORATION;
D O I
10.1007/s11063-024-11502-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Navigation in unknown 3D environments aims to progressively find an efficient path to a given target goal in unseen scenarios. A challenge is how to explore the navigation quickly and effectively. An end-to-end learning approach has been proposed to extract geometric shapes from RGB images, but it is not suitable for large environments due to its exhaustive exploration with exponential search space. Active Neural SLAM (ANS) presents a Neural SLAM module to maximize the exploration coverage to tackle the active SLAM task. However, ANS still frequently visits the explored areas due to the inappropriate local target selection. In this paper, we propose a Neural Attention-based Visual SLAM (NAVS) model to explore unknown 3D environments. Spatial attention is provided to quickly identify obstacles (such as similarly colored tea table or floor). We also leverage the priority of unknown regions in the short-term goal decision to avoid frequent exploration with a channel attention. The experimental results show that our model can build a more accurate map than ANS and other baseline methods with less running time. In terms of relative coverage, NAVS achieves a 0.5%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} improvement over ANS in overall and a 1.1%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} improvement over ANS in large environments.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Autonomous Navigation Based on Sequential Images for Planetary Landing in Unknown Environments
    Xu, Chao
    Wang, Dayi
    Huang, Xiangyu
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2017, 40 (10) : 2587 - 2602
  • [42] Semantic SLAM-based Autonomous Tributary Navigation System Using 3D LiDAR Point Cloud for UAV
    Pak, Jeonghyeon
    Son, Hyoung Il
    2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 1380 - 1382
  • [43] VISUAL NAVIGATION AND 3D RECONSTRUCTION OF UNDERWATER OBJECTS WITH AUTONOMOUS UNDERWATER VEHICLE
    Bobkov, V. A.
    Kudryashov, A. P.
    Melman, S. V.
    Scherbatyuk, A. F.
    2017 24TH SAINT PETERSBURG INTERNATIONAL CONFERENCE ON INTEGRATED NAVIGATION SYSTEMS (ICINS), 2017,
  • [44] Survey of 3D Map in SLAM: Localization and Navigation
    Yang, Aolei
    Luo, Yu
    Chen, Ling
    Xu, Yulin
    ADVANCED COMPUTATIONAL METHODS IN LIFE SYSTEM MODELING AND SIMULATION, LSMS 2017, PT I, 2017, 761 : 410 - 420
  • [45] A 3D Reactive Navigation Method for UAVs in Unknown Tunnel-like Environments
    Elmokadem, Taha
    2020 AUSTRALIAN AND NEW ZEALAND CONTROL CONFERENCE (ANZCC 2020), 2020, : 119 - 124
  • [46] Neural network based FastSLAM for autonomous robots in unknown environments
    Li, Qing-Ling
    Song, Yu
    Hou, Zeng-Guang
    NEUROCOMPUTING, 2015, 165 : 99 - 110
  • [47] MEAN: An attention-based approach for 3D mesh shape classification
    Jicheng Dai
    Rubin Fan
    Yupeng Song
    Qing Guo
    Fazhi He
    The Visual Computer, 2024, 40 : 2987 - 3000
  • [48] Attention-based 3D Object Reconstruction from a Single Image
    Salvi, Andrey
    Gavenski, Nathan
    Pooch, Eduardo
    Tasoniero, Felipe
    Barros, Rodrigo
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [49] Endoscope Navigation and 3D Reconstruction of Oral Cavity by Visual SLAM with Mitigated Data Scarcity
    Qiu, Liang
    Ren, Hongliang
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 2278 - 2285
  • [50] MEAN: An attention-based approach for 3D mesh shape classification
    Dai, Jicheng
    Fan, Rubin
    Song, Yupeng
    Guo, Qing
    He, Fazhi
    VISUAL COMPUTER, 2024, 40 (04): : 2987 - 3000