Attention-SLAM: A Visual Monocular SLAM Learning From Human Gaze

被引:34
|
作者
Li, Jinquan [1 ]
Pei, Ling [1 ]
Zou, Danping [1 ]
Xia, Songpengcheng [1 ]
Wu, Qi [1 ]
Li, Tao [1 ]
Sun, Zhen [1 ]
Yu, Wenxian [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai Key Lab Nav & Locat Based Serv, Shanghai 200240, Peoples R China
关键词
Simultaneous localization and mapping; Visualization; Semantics; Feature extraction; Data mining; Predictive models; Adaptation models; Visual sailency; monocular visual semantic SLAM; weighted bundle adjustment; SIMULTANEOUS LOCALIZATION; ODOMETRY;
D O I
10.1109/JSEN.2020.3038432
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a novel simultaneous localization and mapping (SLAM) approach, namely Attention-SLAM, which simulates human navigation mode by combining a visual saliency model (SalNavNet) with traditional monocular visual SLAM. Firstly a visual saliency model namely SalNavNet is proposed. In SalNavNet, we introduce a correlation module and propose an adaptive Exponential Moving Average (EMA) module. These modules mitigate the center bias, which most current saliency models have. This novel idea enables the saliency maps generated by SalNavNet to pay more attention to the same salient object. An open-source saliency SLAM dataset namely Salient-Euroc is published, it consists of Euroc dataset and corresponding saliency maps. Moreover, we propose a new optimization method called Weighted Bundle Adjustment (Weighted BA) in Attention-SLAM. Most SLAM methods treat all the features extracted from the images as equal importance during the optimization process. In weighted BA, the feature points extracted from the salient regions have greater importance. Comprehensive test results prove that Attention-SLAM outperforms benchmarks such as Direct Sparse Odometry (DSO), ORB-SLAM, and Salient DSO in the 7 of 11 test cases. The test cases are all indoor scenes, with varying brightness, speed, and image distortion. Compared with ORB-SLAM, our method improves the accuracy by 4% and efficiency by 6.5% on average.
引用
收藏
页码:6408 / 6420
页数:13
相关论文
共 50 条
  • [21] AN EFFICIENT AND ROBUST FRAMEWORK FOR COLLABORATIVE MONOCULAR VISUAL SLAM
    Das, Dipanjan
    Maity, Soumyadip
    Dhara, Bibhas Chandra
    2021 17TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2021), 2021,
  • [22] Stationary Detector for Monocular Visual-Inertial SLAM
    Guillemard, Richard
    Helenon, Francois
    Petit, Bruno
    Gay-Bellile, Vincent
    Carrier, Mathieu
    2019 INTERNATIONAL CONFERENCE ON INDOOR POSITIONING AND INDOOR NAVIGATION (IPIN), 2019,
  • [23] Visual Appearance Analysis of Forest Scenes for Monocular SLAM
    Garforth, James
    Webb, Barbara
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 1794 - 1800
  • [24] Monocular Visual SLAM with Robust and Efficient Line Features
    Long, Limin
    Yang, Jinfu
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2325 - 2330
  • [25] INVESTIGATION OF THE CHALLENGES OF UNDERWATER-VISUAL-MONOCULAR-SLAM
    Grimaldi, Michele
    Nakath, David
    She, Mengkun
    Koeser, Kevin
    GEOSPATIAL WEEK 2023, VOL. 10-1, 2023, : 1113 - 1121
  • [26] A Survey of UAV Visual Navigation Based on Monocular SLAM
    Wei, Wenle
    Tan, Linin
    Jin, Guodong
    Lu, Libin
    Sun, Changjiang
    PROCEEDINGS OF 2018 IEEE 4TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2018), 2018, : 1849 - 1853
  • [27] DROID-SLAM: Deep Visual SLAM for Monocular, Stereo, and RGB-D Cameras
    Teed, Zachary
    Deng, Jia
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [28] Attentional Landmarks and Active Gaze Control for Visual SLAM
    Frintrop, Simone
    Jensfelt, Patric
    IEEE TRANSACTIONS ON ROBOTICS, 2008, 24 (05) : 1054 - 1065
  • [29] Differentiable SLAM-net: Learning Particle SLAM for Visual Navigation
    Karkus, Peter
    Cai, Shaojun
    Hsu, David
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2814 - 2824
  • [30] Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM
    Sheng, Lu
    Xu, Dan
    Ouyang, Wanli
    Wang, Xiaogang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4301 - 4310