Attention-SLAM: A Visual Monocular SLAM Learning From Human Gaze

被引:34
|
作者
Li, Jinquan [1 ]
Pei, Ling [1 ]
Zou, Danping [1 ]
Xia, Songpengcheng [1 ]
Wu, Qi [1 ]
Li, Tao [1 ]
Sun, Zhen [1 ]
Yu, Wenxian [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai Key Lab Nav & Locat Based Serv, Shanghai 200240, Peoples R China
关键词
Simultaneous localization and mapping; Visualization; Semantics; Feature extraction; Data mining; Predictive models; Adaptation models; Visual sailency; monocular visual semantic SLAM; weighted bundle adjustment; SIMULTANEOUS LOCALIZATION; ODOMETRY;
D O I
10.1109/JSEN.2020.3038432
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a novel simultaneous localization and mapping (SLAM) approach, namely Attention-SLAM, which simulates human navigation mode by combining a visual saliency model (SalNavNet) with traditional monocular visual SLAM. Firstly a visual saliency model namely SalNavNet is proposed. In SalNavNet, we introduce a correlation module and propose an adaptive Exponential Moving Average (EMA) module. These modules mitigate the center bias, which most current saliency models have. This novel idea enables the saliency maps generated by SalNavNet to pay more attention to the same salient object. An open-source saliency SLAM dataset namely Salient-Euroc is published, it consists of Euroc dataset and corresponding saliency maps. Moreover, we propose a new optimization method called Weighted Bundle Adjustment (Weighted BA) in Attention-SLAM. Most SLAM methods treat all the features extracted from the images as equal importance during the optimization process. In weighted BA, the feature points extracted from the salient regions have greater importance. Comprehensive test results prove that Attention-SLAM outperforms benchmarks such as Direct Sparse Odometry (DSO), ORB-SLAM, and Salient DSO in the 7 of 11 test cases. The test cases are all indoor scenes, with varying brightness, speed, and image distortion. Compared with ORB-SLAM, our method improves the accuracy by 4% and efficiency by 6.5% on average.
引用
收藏
页码:6408 / 6420
页数:13
相关论文
共 50 条
  • [1] Monocular SLAM for visual odometry
    Munguia, Rodrigo
    Grau, Antoni
    2007 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING, CONFERENCE PROCEEDINGS BOOK, 2007, : 443 - 448
  • [2] Edge SLAM: Edge Points Based Monocular Visual SLAM
    Maity, Soumyadip
    Saha, Arindam
    Bhowmick, Brojeshwar
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2408 - 2417
  • [3] Visual SLAM for Handheld Monocular Endoscope
    Grasa, Oscar G.
    Bernal, Ernesto
    Casado, Santiago
    Gil, Ismael
    Montiel, J. M. M.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2014, 33 (01) : 135 - 146
  • [4] LIFT-SLAM: A deep-learning feature-based monocular visual SLAM method
    Silva Bruno, Hudson Martins
    Colombini, Esther Luna
    NEUROCOMPUTING, 2021, 455 : 97 - 110
  • [5] UVS: underwater visual SLAM—a robust monocular visual SLAM system for lifelong underwater operations
    Marco Leonardi
    Annette Stahl
    Edmund Førland Brekke
    Martin Ludvigsen
    Autonomous Robots, 2023, 47 : 1367 - 1385
  • [6] Observer design for monocular visual inertial SLAM
    Fink, Geoff
    Franke, Mirko
    Lynch, Alan F.
    Roebenack, Klaus
    AT-AUTOMATISIERUNGSTECHNIK, 2018, 66 (03) : 246 - 257
  • [7] Robust Large Scale Monocular Visual SLAM
    Bourmaud, Guillaume
    Megret, Remi
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 1638 - 1647
  • [8] Monocular Visual SLAM for Tactical Situational Awareness
    Ruotsalainen, Laura
    Grohn, Simo
    Kirkko-Jaakkola, Martti
    Chen, Liang
    Guinness, Robert
    Kuusniemi, Heidi
    2015 INTERNATIONAL CONFERENCE ON INDOOR POSITIONING AND INDOOR NAVIGATION (IPIN), 2015,
  • [9] A novel visual-inertial Monocular SLAM
    Yue, Xiaofeng
    Zhang, Wenjuan
    Xu, Li
    Liu, JiangGuo
    MIPPR 2017: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2018, 10608
  • [10] Implementation of an update scheme for monocular visual SLAM
    Chen, Zhenhe
    Rodrigo, Ranga
    Samarabandu, Jagath
    2006 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2007, : 212 - 217