RGBD Based Dimensional Decomposition Residual Network for 3D Semantic Scene Completion

被引:36
|
作者
Li, Jie [1 ,2 ]
Liu, Yu [2 ]
Gong, Dong [2 ]
Shi, Qinfeng [2 ]
Yuan, Xia [1 ]
Zhao, Chunxia [1 ]
Reid, Ian [2 ]
机构
[1] Nanjing Univ Sci & Technol, Nanjing, Jiangsu, Peoples R China
[2] Univ Adelaide, Adelaide, SA, Australia
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
D O I
10.1109/CVPR.2019.00788
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RGB images differentiate from depth as they carry more details about the color and texture information, which can be utilized as a vital complement to depth for boosting the performance of 3D semantic scene completion (SSC). SSC is composed of 3D shape completion (SC) and semantic scene labeling while most of the existing approaches use depth as the sole input which causes the performance bottleneck. Moreover, the state-of-the-art methods employ 3D CNNs which have cumbersome networks and tremendous parameters. We introduce a light-weight Dimensional Decomposition Residual network (DDR)for 3D dense prediction tasks. The novel factorized convolution layer is effective for reducing the network parameters, and the proposed multi-scale fusion mechanism for depth and color image can improve the completion and segmentation accuracy simultaneously. Our method demonstrates excellent performance on two public datasets. Compared with the latest method SSCNet, we achieve 5.9% gains in SC-IoU and 5.7% gains in SSC-IOU, albeit with only 21% network parameters and 16.6% FLOPs employed compared with that of SSCNet.
引用
收藏
页码:7685 / 7694
页数:10
相关论文
共 50 条
  • [1] Semantic Point Completion Network for 3D Semantic Scene Completion
    Zhong, Min
    Zeng, Gang
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2824 - 2831
  • [2] 3D Semantic Scene Completion: A Survey
    Luis Roldão
    Raoul de Charette
    Anne Verroust-Blondet
    [J]. International Journal of Computer Vision, 2022, 130 : 1978 - 2005
  • [3] 3D Semantic Scene Completion: A Survey
    Roldao, Luis
    de Charette, Raoul
    Verroust-Blondet, Anne
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (08) : 1978 - 2005
  • [4] MonoScene: Monocular 3D Semantic Scene Completion
    Anh-Quan Cao
    de Charette, Raoul
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3981 - 3991
  • [5] Two Stream 3D Semantic Scene Completion
    Garbade, Martin
    Chen, Yueh-Tung
    Sawatzky, Johann
    Gall, Juergen
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 416 - 425
  • [6] 3D Motion Decomposition for RGBD Future Dynamic Scene Synthesis
    Qi, Xiaojuan
    Liu, Zhengzhe
    Chen, Qifeng
    Jia, Jiaya
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7665 - 7674
  • [7] Anisotropic Convolutional Networks for 3D Semantic Scene Completion
    Li, Jie
    Han, Kai
    Wang, Peng
    Liu, Yu
    Yuan, Xia
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3348 - 3356
  • [8] Resolution-switchable 3D Semantic Scene Completion
    Luo, Shoutong
    Sun, Zhengxing
    Sun, Yunhan
    Wang, Yi
    [J]. COMPUTER GRAPHICS FORUM, 2022, 41 (07) : 121 - 130
  • [9] Instance-Aware Monocular 3D Semantic Scene Completion
    Xiao, Haihong
    Xu, Hongbin
    Kang, Wenxiong
    Li, Yuqiong
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (07) : 6543 - 6554
  • [10] From Front to Rear: 3D Semantic Scene Completion Through Planar Convolution and Attention-Based Network
    Li, Jie
    Song, Qi
    Yan, Xiaohu
    Chen, Yongquan
    Huang, Rui
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8294 - 8307