Semantic Scene Completion from a Single Depth Image

被引:672
|
作者
Song, Shuran [1 ]
Yu, Fisher [1 ]
Zeng, Andy [1 ]
Chang, Angel X. [1 ]
Savva, Manolis [1 ]
Funkhouser, Thomas [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2017.28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on semantic scene completion, a task for producing a complete 3D voxel representation of volumetric occupancy and semantic labels for a scene from a single-view depth map observation. Previous work has considered scene completion and semantic labeling of depth maps separately. However, we observe that these two problems are tightly intertwined. To leverage the coupled nature of these two tasks, we introduce the semantic scene completion network (SSCNet), an end-to-end 3D convolutional network that takes a single depth image as input and simultaneously outputs occupancy and semantic labels for all voxels in the camera view frustum. Our network uses a dilation-based 3D context module to efficiently expand the receptive field and enable 3D context learning. To train our network, we construct SUNCG - a manually created large-scale dataset of synthetic 3D scenes with dense volumetric annotations. Our experiments demonstrate that the joint model outperforms methods addressing each task in isolation and outperforms alternative approaches on the semantic scene completion task. The dataset and code is available at http://sscnet.cs.princeton.edu.
引用
收藏
页码:190 / 198
页数:9
相关论文
共 50 条
  • [21] Ego-Semantic Labeling of Scene from Depth Image for Visually Impaired and Blind People
    Zatout, Chayma
    Larabi, Slimane
    Mendili, Ilyes
    Barnabe, Soedji Ablam Edoh
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4376 - 4384
  • [22] Deep Depth Completion of a Single RGB-D Image
    Zhang, Yinda
    Funkhouser, Thomas
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 175 - 185
  • [23] Unleashing Network Potentials for Semantic Scene Completion
    Wang, Fengyun
    Sung, Qianru
    Zhang, Dong
    Tang, Jinhui
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10314 - 10323
  • [24] Stereo-augmented Depth Completion from a Single RGB-LiDAR image
    Choi, Keunhoon
    Jeong, Somi
    Kim, Youngjung
    Sohn, Kwanghoon
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13641 - 13647
  • [25] SCPNet: Semantic Scene Completion on Point Cloud
    Xia, Zhaoyang
    Liu, Youquan
    Li, Xin
    Zhu, Xinge
    Ma, Yuexin
    Li, Yikang
    Hou, Yuenan
    Qiao, Yu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17642 - 17651
  • [26] See and Think: Disentangling Semantic Scene Completion
    Liu, Shice
    Hu, Yu
    Zeng, Yiming
    Tang, Qiankun
    Jin, Beibei
    Han, Yinhe
    Li, Xiaowei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [27] A comparative review of plausible hole filling strategies in the context of scene depth image completion
    Atapour-Abarghouei, Amir
    Breckon, Toby P.
    COMPUTERS & GRAPHICS-UK, 2018, 72 : 39 - 58
  • [28] PesRec: A parametric estimation method for indoor semantic scene reconstruction from a single image
    Cao, Xingwen
    Zheng, Xueting
    Zheng, Hongwei
    Chen, Xi
    Bao, Anming
    Liu, Ying
    Liu, Tie
    Zhang, Haoran
    Zhao, Muhua
    Zhang, Zichen
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 133
  • [29] A Semantic Occlusion Model for Human Pose Estimation from a Single Depth Image
    Rafi, Umer
    Gall, Juergen
    Leibe, Bastian
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [30] Indoor Scene Structure Analysis for Single Image Depth Estimation
    Zhuo, Wei
    Salzmann, Mathieu
    He, Xuming
    Liu, Miaomiao
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 614 - 622