Visual simultaneous localization and mapping (vSLAM) algorithm based on improved Vision Transformer semantic segmentation in dynamic scenes

被引:0
|
作者
Chen, Mengyuan [1 ,2 ]
Guo, Hangrong [1 ]
Qian, Runbang [1 ]
Gong, Guangqiang [1 ]
Cheng, Hao [1 ]
机构
[1] Anhui Polytech Univ, Sch Elect Engn, Wuhu, Peoples R China
[2] Minist Educ, Key Lab Adv Percept & Intelligent Control High End, Hefei, Anhui, Peoples R China
关键词
D O I
10.5194/ms-15-1-2024
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Identifying dynamic objects in dynamic scenes remains a challenge for traditional simultaneous localization and mapping (SLAM) algorithms. Additionally, these algorithms are not able to adequately inpaint the culling regions that result from excluding dynamic objects. In light of these challenges, this study proposes a novel visual SLAM (vSLAM) algorithm based on improved Vision Transformer semantic segmentation in dynamic scenes (VTD-SLAM), which leverages an improved Vision Transformer semantic segmentation technique to address these limitations. Specifically, VTD-SLAM utilizes a residual dual-pyramid backbone network to extract dynamic object region features and a multiclass feature transformer segmentation module to enhance the pixel weight of potential dynamic objects and to improve global semantic information for precise identification of potential dynamic objects. The method of multi-view geometry is applied to judge and remove the dynamic objects. Meanwhile, according to static information in the adjacent frames, the optimal nearest-neighbor pixel-matching method is applied to restore the static background, where the feature points are extracted for pose estimation. With validation in the public dataset TUM (The Entrepreneurial University Dataset) and real scenarios, the experimental results show that the root-mean-square error of the algorithm is reduced by 17.1 % compared with dynamic SLAM (DynaSLAM), which shows better map composition capability.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [1] Dynamic visual simultaneous localization and mapping based on semantic segmentation module
    Jin, Jing
    Jiang, Xufeng
    Yu, Chenhui
    Zhao, Lingna
    Tang, Zhen
    [J]. APPLIED INTELLIGENCE, 2023, 53 (16) : 19418 - 19432
  • [2] Dynamic visual simultaneous localization and mapping based on semantic segmentation module
    Jing Jin
    Xufeng Jiang
    Chenhui Yu
    Lingna Zhao
    Zhen Tang
    [J]. Applied Intelligence, 2023, 53 : 19418 - 19432
  • [3] Improved Transformer Instance Segmentation Under Dynamic Occlusion Based VSLAM Algorithm
    Chen, Meng-Yuan
    Han, Peng-Peng
    Liu, Jin-Hui
    Zhang, Yu-Kun
    Jiang, Hao-Wei
    Ding, Ling-Mei
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (07): : 1812 - 1825
  • [4] Improved simultaneous localization and mapping algorithm combined with semantic segmentation model
    Cui, Xuerong
    Xue, Shengjie
    Li, Juan
    Li, Shibao
    Liu, Jianhang
    Chen, Haihua
    [J]. INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2021, 17 (04)
  • [5] A Review on Vision Simultaneous Localization and Mapping (VSLAM)
    Makhubela, Jabulani K.
    Zuva, Tranos
    Agunbiade, Olusanya Yinka
    [J]. 2018 INTERNATIONAL CONFERENCE ON INTELLIGENT AND INNOVATIVE COMPUTING APPLICATIONS (ICONIC), 2018, : 572 - 576
  • [6] Semantic visual simultaneous localization and mapping (SLAM) using deep learning for dynamic scenes
    Zhang, Xiao Ya
    Rahman, Abdul Hadi Abd
    Qamar, Faizan
    [J]. PeerJ Computer Science, 2023, 9
  • [7] Semantic visual simultaneous localization and mapping (SLAM) using deep learning for dynamic scenes
    Zhang, Xiao Ya
    Abd Rahman, Abdul Hadi
    Qamar, Faizan
    [J]. PEERJ COMPUTER SCIENCE, 2023, 9
  • [8] Robust Vision-based Simultaneous Localization and Mapping for Highly Dynamic Scenes
    Zhang, Zijian
    Lei, Qiaoyu
    Li, Chao
    Zhuang, Zhipeng
    Yan, Bo
    [J]. 2021 6TH INTERNATIONAL CONFERENCE ON UK-CHINA EMERGING TECHNOLOGIES (UCET 2021), 2021, : 221 - 228
  • [9] Visual Simultaneous Localization and Mapping Method of Semantic Octree Map Toward Indoor Dynamic Scenes
    Zhang Rongfen
    Yuan Wenhao
    Lu Jin
    Liu Yuhong
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [10] VSLAM algorithm based on instance segmentation and motion consistency constraints in dynamic scenes
    Chen, Mengyuan
    Qian, Runbang
    Guo, Hangrong
    Gong, Guangqiang
    [J]. Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology, 2023, 31 (10): : 986 - 995