Improved Transformer Instance Segmentation Under Dynamic Occlusion Based VSLAM Algorithm

被引:0
|
作者
Chen M.-Y. [1 ,2 ,3 ]
Han P.-P. [1 ]
Liu J.-H. [1 ]
Zhang Y.-K. [1 ]
Jiang H.-W. [1 ]
Ding L.-M. [1 ]
机构
[1] School of Electrical Engineering, Anhui Polytechnic University, Anhui, Wuhu
[2] Key Laboratory of Advanced Perception and Intelligent Control of High-End Equipment, Ministry of Education, Anhui, Wuhu
[3] Industry Innovation Technology Co.,Ltd., Anhui Polytechnic University, Anhui, Wuhu
来源
基金
中国国家自然科学基金;
关键词
background repair; dynamic environment; instance segmentation; motion judgment; object occlusion; simultaneous localization and mapping;
D O I
10.12263/DZXB.20220310
中图分类号
学科分类号
摘要
For traditional SLAM (Simultaneous Localization And Mapping) algorithms, it is difficult to mark occluded objects in dynamic scenes with occlusion, and is impossible to accurately judge the motion state of potential objects as well as the number of feature points after culling dynamic objects is small. This paper proposes a VSLAM algorithm based on improved transformer instance segmentation under dynamic occlusion (ITD-SLAM) in dynamic occlusion scenarios. By designing a multi-attention module, this algorithm guides the model to pay attention to the occluded area, and at the same time improves the relative position encoding to optimize the boundary semantics of occluded objects, and accurately mark potential dynamic objects. In order to reduce the influence of dynamic objects on the positioning accuracy of the SLAM system, the motion state of potential dynamic objects is estimated through three steps of camera pose estimation, object motion estimation and object motion judgment, and dynamic objects are eliminated. According to the grid flow motion model, the static background of the culled area is completed, and the feature points of the repair area are screened and repaired by information entropy, and the high-quality feature points are supplemented for camera pose estimation. Experimental results on the public datasets show that this algorithm has better composition ability with its root mean square error reduced by 22.94% when compared with DynaSLAM. © 2023 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:1812 / 1825
页数:13
相关论文
共 26 条
  • [1] CHEN M Y, DING L M, ZHANG Y K., Fast PL-SLAM algorithm based on improved keyframe extraction strategy, Acta Electronica Sinica, 50, 3, pp. 608-618, (2022)
  • [2] LI B Y, LIU S J, CUI M Y, Et al., Multi-vehicle collaborative SLAM framework for minimum loop detection, Acta Electronica Sinica, 49, 11, pp. 2241-2250, (2021)
  • [3] GAO X B, SHI X H, GE Q F, Et al., A survey of visual SLAM for scenes with dynamic objects, Robot, 43, 6, pp. 733-750, (2021)
  • [4] CAO J F, YU J C, PAN S J, Et al., A SLAM pose graph optimization method using dual visual odometry, Journal of Computer-Aided Design & Computer Graphics, 33, 8, pp. 1264-1272, (2021)
  • [5] CHEN B H, DENG L, CHEN Z X, Et al., Instant dense 3D reconstruction based UAV vision localization, Acta Electronica Sinica, 45, 6, pp. 1294-1300, (2017)
  • [6] LUO H L, CHEN H K., Survey of object detection based on deep learning, Acta Electronica Sinica, 48, 6, pp. 1230-1239, (2020)
  • [7] MUR-ARTAL R, MONTIEL J M M, TARDOS J D., ORB-SLAM: A versatile and accurate monocular SLAM system, IEEE Transactions on Robotics, 31, 5, pp. 1147-1163, (2015)
  • [8] ENGEL J, KOLTUN V, CREMERS D., Direct sparse odometry, IEEE Transactions on Pattern Analysis and Machine Intelligence, 40, 3, pp. 611-625, (2017)
  • [9] LIU J F, SUN L F, PU J X, Et al., Cooperative localization in a team of two mobile robots based on rigid constraints, Acta Electronica Sinica, 48, 9, pp. 1777-1785, (2020)
  • [10] ZHANG H J, FANG Z J, YANG G L., RGB-D visual odometry in dynamic environments using line features, Robot, 41, 1, pp. 75-82, (2019)