Semantic Scene Completion from a Single Depth Image

被引：672

作者：

Song, Shuran ^{[1
]}

Yu, Fisher ^{[1
]}

Zeng, Andy ^{[1
]}

Chang, Angel X. ^{[1
]}

Savva, Manolis ^{[1
]}

Funkhouser, Thomas ^{[1
]}

机构：

[1] Princeton Univ, Princeton, NJ 08544 USA

来源：

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/CVPR.2017.28

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper focuses on semantic scene completion, a task for producing a complete 3D voxel representation of volumetric occupancy and semantic labels for a scene from a single-view depth map observation. Previous work has considered scene completion and semantic labeling of depth maps separately. However, we observe that these two problems are tightly intertwined. To leverage the coupled nature of these two tasks, we introduce the semantic scene completion network (SSCNet), an end-to-end 3D convolutional network that takes a single depth image as input and simultaneously outputs occupancy and semantic labels for all voxels in the camera view frustum. Our network uses a dilation-based 3D context module to efficiently expand the receptive field and enable 3D context learning. To train our network, we construct SUNCG - a manually created large-scale dataset of synthetic 3D scenes with dense volumetric annotations. Our experiments demonstrate that the joint model outperforms methods addressing each task in isolation and outperforms alternative approaches on the semantic scene completion task. The dataset and code is available at http://sscnet.cs.princeton.edu.

引用

页码：190 / 198

页数：9

共 50 条

[21] Ego-Semantic Labeling of Scene from Depth Image for Visually Impaired and Blind People
Zatout, Chayma
Larabi, Slimane
Mendili, Ilyes
Barnabe, Soedji Ablam Edoh
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4376 - 4384
[22] Deep Depth Completion of a Single RGB-D Image
Zhang, Yinda
Funkhouser, Thomas
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 175 - 185
[23] Unleashing Network Potentials for Semantic Scene Completion
Wang, Fengyun
Sung, Qianru
Zhang, Dong
Tang, Jinhui
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10314 - 10323
[24] Stereo-augmented Depth Completion from a Single RGB-LiDAR image
Choi, Keunhoon
Jeong, Somi
Kim, Youngjung
Sohn, Kwanghoon
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13641 - 13647
[25] SCPNet: Semantic Scene Completion on Point Cloud
Xia, Zhaoyang
Liu, Youquan
Li, Xin
Zhu, Xinge
Ma, Yuexin
Li, Yikang
Hou, Yuenan
Qiao, Yu
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17642 - 17651
[26] See and Think: Disentangling Semantic Scene Completion
Liu, Shice
Hu, Yu
Zeng, Yiming
Tang, Qiankun
Jin, Beibei
Han, Yinhe
Li, Xiaowei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[27] A comparative review of plausible hole filling strategies in the context of scene depth image completion
Atapour-Abarghouei, Amir
Breckon, Toby P.
COMPUTERS & GRAPHICS-UK, 2018, 72 : 39 - 58
[28] PesRec: A parametric estimation method for indoor semantic scene reconstruction from a single image
Cao, Xingwen
Zheng, Xueting
Zheng, Hongwei
Chen, Xi
Bao, Anming
Liu, Ying
Liu, Tie
Zhang, Haoran
Zhao, Muhua
Zhang, Zichen
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 133
[29] A Semantic Occlusion Model for Human Pose Estimation from a Single Depth Image
Rafi, Umer
Gall, Juergen
Leibe, Bastian
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
[30] Indoor Scene Structure Analysis for Single Image Depth Estimation
Zhuo, Wei
Salzmann, Mathieu
He, Xuming
Liu, Miaomiao
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 614 - 622

← 1 2 3 4 5 →