Semantic visual simultaneous localization and mapping (SLAM) using deep learning for dynamic scenes

被引：1

作者：

Zhang, Xiao Ya ^{[1
]}

Abd Rahman, Abdul Hadi ^{[1
]}

Qamar, Faizan ^{[2
]}

机构：

[1] Univ Kebangsaan Malaysia, Ctr Artificial Intelligence Technol, Bangi, Malaysia

[2] Univ Kebangsaan Malaysia, Ctr Cyber Secur, Bangi, Malaysia

来源：

PEERJ COMPUTER SCIENCE | 2023年 / 9卷

关键词：

Simultaneous localization and mapping (SLAM); Pose estimation; Deep learning; Semantic segmentation; Dynamic scene; Moving consistency check;

D O I：

10.7717/peerj-cs.1628

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Simultaneous localization and mapping (SLAM) is a fundamental problem in robotics and computer vision. It involves the task of a robot or an autonomous system navigating an unknown environment, simultaneously creating a map of the surroundings, and accurately estimating its position within that map. While significant progress has been made in SLAM over the years, challenges still need to be addressed. One prominent issue is robustness and accuracy in dynamic environments, which can cause uncertainties and errors in the estimation process. Traditional methods using temporal information to differentiate static and dynamic objects have limitations in accuracy and applicability. Nowadays, many research trends have leaned towards utilizing deep learning-based methods which leverage the capabilities to handle dynamic objects, semantic segmentation, and motion estimation, aiming to improve accuracy and adaptability in complex scenes. This article proposed an approach to enhance monocular visual odometry's robustness and precision in dynamic environments. An enhanced algorithm using the semantic segmentation algorithm DeeplabV3+ is used to identify dynamic objects in the image and then apply the motion consistency check to remove feature points belonging to dynamic objects. The remaining static feature points are then used for feature matching and pose estimation based on ORB-SLAM2 using the Technical University of Munich (TUM) dataset. Experimental results show that our method outperforms traditional visual odometry methods in accuracy and robustness, especially in dynamic environments. By eliminating the influence of moving objects, our method improves the accuracy and robustness of visual odometry in dynamic environments. Compared to the traditional ORB-SLAM2, the results show that the system significantly reduces the absolute trajectory error and the relative pose error in dynamic scenes. Our approach has significantly improved the accuracy and robustness of the SLAM system's pose estimation.

引用

页数：21

共 50 条

[1] Semantic visual simultaneous localization and mapping (SLAM) using deep learning for dynamic scenes
Zhang, Xiao Ya
Rahman, Abdul Hadi Abd
Qamar, Faizan
[J]. PeerJ Computer Science, 2023, 9
[2] Dynamic-SLAM: Semantic monocular visual localization and mapping based on deep learning in dynamic environment
Xiao, Linhui
Wang, Jinge
Qiu, Xiaosong
Rong, Zheng
Zou, Xudong
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 117 : 1 - 16
[3] Visual Simultaneous Localization and Mapping Method of Semantic Octree Map Toward Indoor Dynamic Scenes
Zhang Rongfen
Yuan Wenhao
Lu Jin
Liu Yuhong
[J]. LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
[4] Visual simultaneous localization and mapping (vSLAM) algorithm based on improved Vision Transformer semantic segmentation in dynamic scenes
Chen, Mengyuan
Guo, Hangrong
Qian, Runbang
Gong, Guangqiang
Cheng, Hao
[J]. MECHANICAL SCIENCES, 2024, 15 (01) : 1 - 16
[5] Increasing the localization accuracy of visual SLAM with semantic segmentation and motion consistency detection in dynamic scenes
Shen, Dong
Fang, Haoyu
Li, Qiang
Liu, Jiale
Guo, Sheng
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (05) : 7501 - 7512
[6] A 3D Semantic Visual SLAM in Dynamic Scenes
Hu, Shanshan
Li, Dan
Tang, Gujie
Xu, Xiangrong
[J]. 2021 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2021), 2021, : 522 - 528
[7] Dynamic visual simultaneous localization and mapping based on semantic segmentation module
Jin, Jing
Jiang, Xufeng
Yu, Chenhui
Zhao, Lingna
Tang, Zhen
[J]. APPLIED INTELLIGENCE, 2023, 53 (16) : 19418 - 19432
[8] Dynamic visual simultaneous localization and mapping based on semantic segmentation module
Jing Jin
Xufeng Jiang
Chenhui Yu
Lingna Zhao
Zhen Tang
[J]. Applied Intelligence, 2023, 53 : 19418 - 19432
[9] A Lightweight Visual Simultaneous Localization and Mapping Method with a High Precision in Dynamic Scenes
Zhang, Qi
Yu, Wentao
Liu, Weirong
Xu, Hao
He, Yuan
[J]. SENSORS, 2023, 23 (22)
[10] RU-SLAM: A Robust Deep-Learning Visual Simultaneous Localization and Mapping (SLAM) System for Weakly Textured Underwater Environments
Wang, Zhuo
Cheng, Qin
Mu, Xiaokai
[J]. SENSORS, 2024, 24 (06)

← 1 2 3 4 5 →