Real-Time Globally Consistent 3D Reconstruction With Semantic Priors

被引:10
|
作者
Huang, Shi-Sheng [1 ]
Chen, Haoxiang [2 ]
Huang, Jiahui [2 ]
Fu, Hongbo [3 ]
Hu, Shi-Min [2 ]
机构
[1] Beijing Normal Univ, Sch Artificial Intelligence, Beijing 100875, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, BNRist, Beijing 100190, Peoples R China
[3] City Univ Hong Kong, Sch Creat Media, Hong Kong, Peoples R China
关键词
Three-dimensional displays; Semantics; Cameras; Pose estimation; Real-time systems; Geometry; Simultaneous localization and mapping; 3D reconstruction; semantic fusion; semantic tracker; semantic pose graph;
D O I
10.1109/TVCG.2021.3137912
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Maintaining global consistency continues to be critical for online 3D indoor scene reconstruction. However, it is still challenging to generate satisfactory 3D reconstruction in terms of global consistency for previous approaches using purely geometric analysis, even with bundle adjustment or loop closure techniques. In this article, we propose a novel real-time 3D reconstruction approach which effectively integrates both semantic and geometric cues. The key challenge is how to map this indicative information, i.e., semantic priors, into a metric space as measurable information, thus enabling more accurate semantic fusion leveraging both the geometric and semantic cues. To this end, we introduce a semantic space with a continuous metric function measuring the distance between discrete semantic observations. Within the semantic space, we present an accurate frame-to-model semantic tracker for camera pose estimation, and semantic pose graph equipped with semantic links between submaps for globally consistent 3D scene reconstruction. With extensive evaluation on public synthetic and real-world 3D indoor scene RGB-D datasets, we show that our approach outperforms the previous approaches for 3D scene reconstruction both quantitatively and qualitatively, especially in terms of global consistency.
引用
收藏
页码:1977 / 1991
页数:15
相关论文
共 50 条
  • [21] REAL-TIME DEPTH DIFFUSION FOR 3D SURFACE RECONSTRUCTION
    Varadarajan, Karthik Mahesh
    Vincze, Markus
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 4149 - 4152
  • [22] A Real-Time 3D Reconstruction of Staircases for Rehabilitative Exoskeletons
    Raineri, Marina
    Monica, Riccardo
    Lo Bianco, Corrado Guarino
    IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2021, 3 (01): : 220 - 229
  • [23] Real-Time 3D Ultrasound Reconstruction Using Octrees
    Victoria, Cesar
    Torres, Fabian
    Garduno, Edgar
    Cosio, Fernando Arambula
    Gastelum-Strozzi, Alfonso
    IEEE ACCESS, 2023, 11 : 78970 - 78983
  • [24] Real-Time Global Registration for Globally Consistent RGB-D SLAM
    Han, Lei
    Xu, Lan
    Bobkov, Dmytro
    Steinbach, Eckehard
    Fang, Lu
    IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (02) : 498 - 508
  • [25] Real-time Progressive 3D Semantic Segmentation for Indoor Scenes
    Quang-Hieu Pham
    Binh-Son Hua
    Duc Thanh Nguyen
    Yeung, Sai-Kit
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1089 - 1098
  • [26] Real-time 3D semantic map building in indoor scene
    Shan J.
    Li X.
    Zhang X.
    Jia S.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2019, 40 (05): : 240 - 248
  • [27] Real-time 3D Eyelids Tracking from Semantic Edges
    Wen, Quan
    Xu, Feng
    Lu, Ming
    Yong, Jun-Hai
    ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (06):
  • [28] Real-time 3D
    Coco, D
    COMPUTER GRAPHICS WORLD, 1995, 18 (12) : 22 - +
  • [29] Real-Time 3D Eye Performance Reconstruction for RGBD Cameras
    Wen, Quan
    Xu, Feng
    Yong, Jun-Hai
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2017, 23 (12) : 2586 - 2598
  • [30] Real-time 3D Reconstruction at Scale using Voxel Hashing
    Niessner, Matthias
    Zollhoefer, Michael
    Izadi, Shahram
    Stamminger, Marc
    ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (06):