Real-Time Globally Consistent 3D Reconstruction With Semantic Priors

被引：10

作者：

Huang, Shi-Sheng ^{[1
]}

Chen, Haoxiang ^{[2
]}

Huang, Jiahui ^{[2
]}

Fu, Hongbo ^{[3
]}

Hu, Shi-Min ^{[2
]}

机构：

[1] Beijing Normal Univ, Sch Artificial Intelligence, Beijing 100875, Peoples R China

[2] Tsinghua Univ, Dept Comp Sci & Technol, BNRist, Beijing 100190, Peoples R China

[3] City Univ Hong Kong, Sch Creat Media, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS | 2023年 / 29卷 / 04期

关键词：

Three-dimensional displays; Semantics; Cameras; Pose estimation; Real-time systems; Geometry; Simultaneous localization and mapping; 3D reconstruction; semantic fusion; semantic tracker; semantic pose graph;

D O I：

10.1109/TVCG.2021.3137912

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Maintaining global consistency continues to be critical for online 3D indoor scene reconstruction. However, it is still challenging to generate satisfactory 3D reconstruction in terms of global consistency for previous approaches using purely geometric analysis, even with bundle adjustment or loop closure techniques. In this article, we propose a novel real-time 3D reconstruction approach which effectively integrates both semantic and geometric cues. The key challenge is how to map this indicative information, i.e., semantic priors, into a metric space as measurable information, thus enabling more accurate semantic fusion leveraging both the geometric and semantic cues. To this end, we introduce a semantic space with a continuous metric function measuring the distance between discrete semantic observations. Within the semantic space, we present an accurate frame-to-model semantic tracker for camera pose estimation, and semantic pose graph equipped with semantic links between submaps for globally consistent 3D scene reconstruction. With extensive evaluation on public synthetic and real-world 3D indoor scene RGB-D datasets, we show that our approach outperforms the previous approaches for 3D scene reconstruction both quantitatively and qualitatively, especially in terms of global consistency.

引用

页码：1977 / 1991

页数：15

共 50 条

[41] Real-time 3D features reconstruction through monocular vision
Liverani, Alfredo
Leali, Francesco
Pellicciari, Marcello
INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2010, 4 (02): : 103 - 112
[42] 3D Points Splatting for real-time dynamic Hand Reconstruction
Jiang, Zheheng
Rahmani, Hossein
Black, Sue
Williams, Bryan
PATTERN RECOGNITION, 2025, 162
[43] Real-Time Visualize the 3D Reconstruction Procedure Using CUDA
Bi, Wenyuan
Chen, Zhiqiang
Zhang, Li
Xing, Yuxiang
Wang, Yajie
2009 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD, VOLS 1-5, 2009, : 883 - +
[44] Robust 3D Surface Reconstruction in Real-Time with Localization Sensor
Li, Wei
Wu, Yi
Shen, Chunlin
Gong, Huajun
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (08) : 2168 - 2172
[45] Real-Time Simultaneous 3D Reconstruction and Optical Flow Estimation
Roxas, Menandro
Oishi, Takeshi
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 885 - 893
[46] 3D real-time human reconstruction with a single RGBD camera
Yang Lu
Han Yu
Wei Ni
Liang Song
Applied Intelligence, 2023, 53 : 8735 - 8745
[47] Real-Time 3D Reconstruction Method Based on Monocular Vision
Jia, Qingyu
Chang, Liang
Qiang, Baohua
Zhang, Shihao
Xie, Wu
Yang, Xianyi
Sun, Yangchang
Yang, Minghao
SENSORS, 2021, 21 (17)
[48] Real-Time 3D Reconstruction for Collision Avoidance in Interventional Environments
Ladikos, Alexander
Benhimane, Selim
Navab, Nassir
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2008, PT II, PROCEEDINGS, 2008, 5242 : 526 - 534
[49] 3D real-time human reconstruction with a single RGBD camera
Lu, Yang
Yu, Han
Ni, Wei
Song, Liang
APPLIED INTELLIGENCE, 2023, 53 (08) : 8735 - 8745
[50] A Novel Photometric Method for Real-Time 3D Reconstruction of Fingerprint
Xie, Wuyuan
Song, Zhan
Zhang, Xiaoting
ADVANCES IN VISUAL COMPUTING, PT II, 2010, 6454 : 31 - 40

← 1 2 3 4 5 →