Semantic visual simultaneous localization and mapping (SLAM) using deep learning for dynamic scenes

被引:1
|
作者
Zhang, Xiao Ya [1 ]
Abd Rahman, Abdul Hadi [1 ]
Qamar, Faizan [2 ]
机构
[1] Univ Kebangsaan Malaysia, Ctr Artificial Intelligence Technol, Bangi, Malaysia
[2] Univ Kebangsaan Malaysia, Ctr Cyber Secur, Bangi, Malaysia
关键词
Simultaneous localization and mapping (SLAM); Pose estimation; Deep learning; Semantic segmentation; Dynamic scene; Moving consistency check;
D O I
10.7717/peerj-cs.1628
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Simultaneous localization and mapping (SLAM) is a fundamental problem in robotics and computer vision. It involves the task of a robot or an autonomous system navigating an unknown environment, simultaneously creating a map of the surroundings, and accurately estimating its position within that map. While significant progress has been made in SLAM over the years, challenges still need to be addressed. One prominent issue is robustness and accuracy in dynamic environments, which can cause uncertainties and errors in the estimation process. Traditional methods using temporal information to differentiate static and dynamic objects have limitations in accuracy and applicability. Nowadays, many research trends have leaned towards utilizing deep learning-based methods which leverage the capabilities to handle dynamic objects, semantic segmentation, and motion estimation, aiming to improve accuracy and adaptability in complex scenes. This article proposed an approach to enhance monocular visual odometry's robustness and precision in dynamic environments. An enhanced algorithm using the semantic segmentation algorithm DeeplabV3+ is used to identify dynamic objects in the image and then apply the motion consistency check to remove feature points belonging to dynamic objects. The remaining static feature points are then used for feature matching and pose estimation based on ORB-SLAM2 using the Technical University of Munich (TUM) dataset. Experimental results show that our method outperforms traditional visual odometry methods in accuracy and robustness, especially in dynamic environments. By eliminating the influence of moving objects, our method improves the accuracy and robustness of visual odometry in dynamic environments. Compared to the traditional ORB-SLAM2, the results show that the system significantly reduces the absolute trajectory error and the relative pose error in dynamic scenes. Our approach has significantly improved the accuracy and robustness of the SLAM system's pose estimation.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Semantic visual simultaneous localization and mapping (SLAM) using deep learning for dynamic scenes
    Zhang, Xiao Ya
    Rahman, Abdul Hadi Abd
    Qamar, Faizan
    [J]. PeerJ Computer Science, 2023, 9
  • [2] Dynamic-SLAM: Semantic monocular visual localization and mapping based on deep learning in dynamic environment
    Xiao, Linhui
    Wang, Jinge
    Qiu, Xiaosong
    Rong, Zheng
    Zou, Xudong
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 117 : 1 - 16
  • [3] Visual Simultaneous Localization and Mapping Method of Semantic Octree Map Toward Indoor Dynamic Scenes
    Zhang Rongfen
    Yuan Wenhao
    Lu Jin
    Liu Yuhong
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [4] Visual simultaneous localization and mapping (vSLAM) algorithm based on improved Vision Transformer semantic segmentation in dynamic scenes
    Chen, Mengyuan
    Guo, Hangrong
    Qian, Runbang
    Gong, Guangqiang
    Cheng, Hao
    [J]. MECHANICAL SCIENCES, 2024, 15 (01) : 1 - 16
  • [5] Increasing the localization accuracy of visual SLAM with semantic segmentation and motion consistency detection in dynamic scenes
    Shen, Dong
    Fang, Haoyu
    Li, Qiang
    Liu, Jiale
    Guo, Sheng
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (05) : 7501 - 7512
  • [6] A 3D Semantic Visual SLAM in Dynamic Scenes
    Hu, Shanshan
    Li, Dan
    Tang, Gujie
    Xu, Xiangrong
    [J]. 2021 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2021), 2021, : 522 - 528
  • [7] Dynamic visual simultaneous localization and mapping based on semantic segmentation module
    Jin, Jing
    Jiang, Xufeng
    Yu, Chenhui
    Zhao, Lingna
    Tang, Zhen
    [J]. APPLIED INTELLIGENCE, 2023, 53 (16) : 19418 - 19432
  • [8] Dynamic visual simultaneous localization and mapping based on semantic segmentation module
    Jing Jin
    Xufeng Jiang
    Chenhui Yu
    Lingna Zhao
    Zhen Tang
    [J]. Applied Intelligence, 2023, 53 : 19418 - 19432
  • [9] A Lightweight Visual Simultaneous Localization and Mapping Method with a High Precision in Dynamic Scenes
    Zhang, Qi
    Yu, Wentao
    Liu, Weirong
    Xu, Hao
    He, Yuan
    [J]. SENSORS, 2023, 23 (22)
  • [10] RU-SLAM: A Robust Deep-Learning Visual Simultaneous Localization and Mapping (SLAM) System for Weakly Textured Underwater Environments
    Wang, Zhuo
    Cheng, Qin
    Mu, Xiaokai
    [J]. SENSORS, 2024, 24 (06)