Visual simultaneous localization and mapping (vSLAM) algorithm based on improved Vision Transformer semantic segmentation in dynamic scenes

被引:0
|
作者
Chen, Mengyuan [1 ,2 ]
Guo, Hangrong [1 ]
Qian, Runbang [1 ]
Gong, Guangqiang [1 ]
Cheng, Hao [1 ]
机构
[1] Anhui Polytech Univ, Sch Elect Engn, Wuhu, Peoples R China
[2] Minist Educ, Key Lab Adv Percept & Intelligent Control High End, Hefei, Anhui, Peoples R China
关键词
D O I
10.5194/ms-15-1-2024
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Identifying dynamic objects in dynamic scenes remains a challenge for traditional simultaneous localization and mapping (SLAM) algorithms. Additionally, these algorithms are not able to adequately inpaint the culling regions that result from excluding dynamic objects. In light of these challenges, this study proposes a novel visual SLAM (vSLAM) algorithm based on improved Vision Transformer semantic segmentation in dynamic scenes (VTD-SLAM), which leverages an improved Vision Transformer semantic segmentation technique to address these limitations. Specifically, VTD-SLAM utilizes a residual dual-pyramid backbone network to extract dynamic object region features and a multiclass feature transformer segmentation module to enhance the pixel weight of potential dynamic objects and to improve global semantic information for precise identification of potential dynamic objects. The method of multi-view geometry is applied to judge and remove the dynamic objects. Meanwhile, according to static information in the adjacent frames, the optimal nearest-neighbor pixel-matching method is applied to restore the static background, where the feature points are extracted for pose estimation. With validation in the public dataset TUM (The Entrepreneurial University Dataset) and real scenarios, the experimental results show that the root-mean-square error of the algorithm is reduced by 17.1 % compared with dynamic SLAM (DynaSLAM), which shows better map composition capability.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [31] Fusing Semantic Segmentation and Object Detection for Visual SLAM in Dynamic Scenes
    Yu, Peilin
    Guo, Chi
    Liu, Yang
    Zhang, Huyin
    [J]. PROCEEDINGS OF 27TH ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY, VRST 2021, 2021,
  • [32] A visual simultaneous localization and mapping approach based on scene segmentation and incremental optimization
    Zhang, Xiaoguo
    Liu, Qihan
    Zheng, Bingqing
    Wang, Huiqing
    Wang, Qing
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (06)
  • [33] A Simultaneous Localization and Mapping Algorithm of Mobile Robot Based on Improved FastSLAM
    Xia, Yi-min
    Yang, Yi-min
    [J]. PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III, 2009, : 88 - 92
  • [34] Simultaneous Localization and Mapping Algorithm Based on the Asynchronous Fusion of Laser and Vision Sensors
    Xing, Kexin
    Zhang, Xingsheng
    Lin, Yegui
    Ci, Wenqi
    Dong, Wei
    [J]. FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [35] Visual SLAM based on instance segmentation in dynamic scenes
    Yan, Zhe
    Chu, Shuchun
    Deng, Liwei
    [J]. MEASUREMENT SCIENCE AND TECHNOLOGY, 2021, 32 (09)
  • [36] Monocular SLAM System in Dynamic Scenes Based on Semantic Segmentation
    Sheng, Chao
    Pan, Shuguo
    Zeng, Pan
    Huang, Lixiao
    Zhao, Tao
    [J]. IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 593 - 603
  • [37] Multimodal Semantic Segmentation Based On Improved Vision Transformers
    Qi, Weimin
    Chen, Hangong
    Wang, Zhiming
    Wang, Meng
    [J]. PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 565 - 569
  • [38] On Improved Simultaneous Localization and Mapping Algorithm for Underwater Navigation
    Cao Menglong
    Li Zhen
    Li Feifei
    [J]. PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 5284 - 5287
  • [39] Simultaneous Localization and Mapping for Mobile Robot Based on an Improved Particle Filter Algorithm
    Wang, Zhong Min
    Miao, De Hua
    Du, Zhi Jiang
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 1106 - +
  • [40] RTS-vSLAM: Real-Time Visual Semantic Tracking and Mapping Under Dynamic Environments
    Liu, Yubao
    Miura, Jun
    [J]. INTELLIGENT AUTONOMOUS SYSTEMS 16, IAS-16, 2022, 412 : 29 - 40