Real-Time Globally Consistent 3D Reconstruction With Semantic Priors

被引:10
|
作者
Huang, Shi-Sheng [1 ]
Chen, Haoxiang [2 ]
Huang, Jiahui [2 ]
Fu, Hongbo [3 ]
Hu, Shi-Min [2 ]
机构
[1] Beijing Normal Univ, Sch Artificial Intelligence, Beijing 100875, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, BNRist, Beijing 100190, Peoples R China
[3] City Univ Hong Kong, Sch Creat Media, Hong Kong, Peoples R China
关键词
Three-dimensional displays; Semantics; Cameras; Pose estimation; Real-time systems; Geometry; Simultaneous localization and mapping; 3D reconstruction; semantic fusion; semantic tracker; semantic pose graph;
D O I
10.1109/TVCG.2021.3137912
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Maintaining global consistency continues to be critical for online 3D indoor scene reconstruction. However, it is still challenging to generate satisfactory 3D reconstruction in terms of global consistency for previous approaches using purely geometric analysis, even with bundle adjustment or loop closure techniques. In this article, we propose a novel real-time 3D reconstruction approach which effectively integrates both semantic and geometric cues. The key challenge is how to map this indicative information, i.e., semantic priors, into a metric space as measurable information, thus enabling more accurate semantic fusion leveraging both the geometric and semantic cues. To this end, we introduce a semantic space with a continuous metric function measuring the distance between discrete semantic observations. Within the semantic space, we present an accurate frame-to-model semantic tracker for camera pose estimation, and semantic pose graph equipped with semantic links between submaps for globally consistent 3D scene reconstruction. With extensive evaluation on public synthetic and real-world 3D indoor scene RGB-D datasets, we show that our approach outperforms the previous approaches for 3D scene reconstruction both quantitatively and qualitatively, especially in terms of global consistency.
引用
收藏
页码:1977 / 1991
页数:15
相关论文
共 50 条
  • [41] Real-time 3D features reconstruction through monocular vision
    Liverani, Alfredo
    Leali, Francesco
    Pellicciari, Marcello
    INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2010, 4 (02): : 103 - 112
  • [42] 3D Points Splatting for real-time dynamic Hand Reconstruction
    Jiang, Zheheng
    Rahmani, Hossein
    Black, Sue
    Williams, Bryan
    PATTERN RECOGNITION, 2025, 162
  • [43] Real-Time Visualize the 3D Reconstruction Procedure Using CUDA
    Bi, Wenyuan
    Chen, Zhiqiang
    Zhang, Li
    Xing, Yuxiang
    Wang, Yajie
    2009 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD, VOLS 1-5, 2009, : 883 - +
  • [44] Robust 3D Surface Reconstruction in Real-Time with Localization Sensor
    Li, Wei
    Wu, Yi
    Shen, Chunlin
    Gong, Huajun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (08) : 2168 - 2172
  • [45] Real-Time Simultaneous 3D Reconstruction and Optical Flow Estimation
    Roxas, Menandro
    Oishi, Takeshi
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 885 - 893
  • [46] 3D real-time human reconstruction with a single RGBD camera
    Yang Lu
    Han Yu
    Wei Ni
    Liang Song
    Applied Intelligence, 2023, 53 : 8735 - 8745
  • [47] Real-Time 3D Reconstruction Method Based on Monocular Vision
    Jia, Qingyu
    Chang, Liang
    Qiang, Baohua
    Zhang, Shihao
    Xie, Wu
    Yang, Xianyi
    Sun, Yangchang
    Yang, Minghao
    SENSORS, 2021, 21 (17)
  • [48] Real-Time 3D Reconstruction for Collision Avoidance in Interventional Environments
    Ladikos, Alexander
    Benhimane, Selim
    Navab, Nassir
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2008, PT II, PROCEEDINGS, 2008, 5242 : 526 - 534
  • [49] 3D real-time human reconstruction with a single RGBD camera
    Lu, Yang
    Yu, Han
    Ni, Wei
    Song, Liang
    APPLIED INTELLIGENCE, 2023, 53 (08) : 8735 - 8745
  • [50] A Novel Photometric Method for Real-Time 3D Reconstruction of Fingerprint
    Xie, Wuyuan
    Song, Zhan
    Zhang, Xiaoting
    ADVANCES IN VISUAL COMPUTING, PT II, 2010, 6454 : 31 - 40