Visual Simultaneous Localization and Mapping Method of Semantic Octree Map Toward Indoor Dynamic Scenes

被引：0

作者：

Zhang Rongfen ^{[1
]}

Yuan Wenhao ^{[1
]}

Lu Jin ^{[1
]}

Liu Yuhong ^{[1
]}

机构：

[1] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Guizhou, Peoples R China

来源：

LASER & OPTOELECTRONICS PROGRESS | 2022年 / 59卷 / 18期

关键词：

simultaneous localization and mapping; moving point elimination; semantic segmentation; stepping random sampling consistent algorithm; voxel filtering; semantic octree map; SLAM;

D O I：

10.3788/LOP202259.1811003

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Aiming at the problems that traditional visual simultaneous localization and mapping (vSLAM) systems cannot remove moving objects in dynamic scenes effectively and lack semantic maps for high-level interactive applications, a vSLAM system scheme was proposed. The scheme can remove moving objects effectively and build semantic octree maps representing indoor static environments. First, Fast-SCNN was used as a semantic segmentation network to extract semantic information from images. Meanwhile, a pyramid optical flow method was used to track and match feature points. Then, for step sampling of the feature points, a stepping random sampling consistent algorithm (Multi-stage RANSAC) was used to perform the RANSAC, process on different scales several times. Later, the epipolar geometry constraint and semantic information extracted from the Fast-SCNN were combined to remove the dynamic feature points of the visual odometer. Finally, the semantic octree map representing the static indoor environment was built by the point cloud after using voxel filtering to reduce redundancy. Experimental results show that the performance indicators of a camera, including relative displacement, relative rotation, and global trajectory errors in the 8 RGB-D high dynamic sequence of common datasets TUM RGB-D, are improved by more than 94% compared with the ORB-SI,AM2 system, and the global trajectory error is only 0. 1 m. Compared with a similar DS-SLAM system, the total time for eliminating a moving point is reduced by 21%. After voxel filtering, the semantic point cloud and octree maps occupy 9. 6 MB and 685 kB storage space, respectively, in terms of map construction performance. Compared with the original point cloud of 17 MB, the semantic octree map occupies only 4% of the storage space; therefore, it could he used for high-level intelligent interactive applications due to its semantics.

引用

页数：15

共 22 条

[1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[2] DynaSLAM: Tracking, Mapping, and Inpainting in Dynamic Scenes
Bescos, Berta
Facil, Jose M.
Civera, Javier
Neira, Jose
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 4076 - 4083
[3] ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial, and Multimap SLAM
Campos, Carlos
Elvira, Richard
Gomez Rodriguez, Juan J.
Montiel, Jose M. M.
Tardos, Juan D.
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (06) : 1874 - 1890
[4] [陈兴华 Chen Xinghua], 2020, [机器人, Robot], V42, P485
[5] Everingham M., 2010, INT J COMPUT VISION, V88, P303, DOI DOI 10.1007/s11263-009-0275-4
[6] Fang Q, 2021, LASER OPTOELECTRON P, V58
[7] RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY
FISCHLER, MA
BOLLES, RC
[J]. COMMUNICATIONS OF THE ACM, 1981, 24 (06) : 381 - 395
[8] Gao X., 2017, 14 Lectures on Visual SLAM: From Theory to Practice
[9] Klein George, 2007, P1
[10] DP-SLAM: A visual SLAM with moving probability towards dynamic environments
Li, Ao
Wang, Jikai
Xu, Meng
Chen, Zonghai
[J]. INFORMATION SCIENCES, 2021, 556 : 128 - 142

← 1 2 3 →