RGBD Based Dimensional Decomposition Residual Network for 3D Semantic Scene Completion

被引:36
|
作者
Li, Jie [1 ,2 ]
Liu, Yu [2 ]
Gong, Dong [2 ]
Shi, Qinfeng [2 ]
Yuan, Xia [1 ]
Zhao, Chunxia [1 ]
Reid, Ian [2 ]
机构
[1] Nanjing Univ Sci & Technol, Nanjing, Jiangsu, Peoples R China
[2] Univ Adelaide, Adelaide, SA, Australia
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
D O I
10.1109/CVPR.2019.00788
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RGB images differentiate from depth as they carry more details about the color and texture information, which can be utilized as a vital complement to depth for boosting the performance of 3D semantic scene completion (SSC). SSC is composed of 3D shape completion (SC) and semantic scene labeling while most of the existing approaches use depth as the sole input which causes the performance bottleneck. Moreover, the state-of-the-art methods employ 3D CNNs which have cumbersome networks and tremendous parameters. We introduce a light-weight Dimensional Decomposition Residual network (DDR)for 3D dense prediction tasks. The novel factorized convolution layer is effective for reducing the network parameters, and the proposed multi-scale fusion mechanism for depth and color image can improve the completion and segmentation accuracy simultaneously. Our method demonstrates excellent performance on two public datasets. Compared with the latest method SSCNet, we achieve 5.9% gains in SC-IoU and 5.7% gains in SSC-IOU, albeit with only 21% network parameters and 16.6% FLOPs employed compared with that of SSCNet.
引用
收藏
页码:7685 / 7694
页数:10
相关论文
共 50 条
  • [32] Deep 3D semantic scene extrapolation
    Abbasi, Ali
    Kalkan, Sinan
    Sahillioglu, Yusuf
    [J]. VISUAL COMPUTER, 2019, 35 (02): : 271 - 279
  • [33] Deep 3D semantic scene extrapolation
    Ali Abbasi
    Sinan Kalkan
    Yusuf Sahillioğlu
    [J]. The Visual Computer, 2019, 35 : 271 - 279
  • [34] Survey on Semantic Scene Completion Based on RGB-D Images
    Zhang, Kang
    An, Bo-Zhou
    Li, Jie
    Yuan, Xia
    Zhao, Chun-Xia
    [J]. Ruan Jian Xue Bao/Journal of Software, 2023, 34 (01): : 444 - 462
  • [35] Residual Attention Graph Convolutional Network for Geometric 3D Scene Classification
    Mosella-Montoro, Albert
    Ruiz-Hidalgo, Javier
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4123 - 4132
  • [36] Semantic Segmentation of 3D Scene based on Global Feature Fusion
    Wang, Dan
    Liu, Shuaijun
    Xu, Nansheng
    Lin, Xiaobo
    Wang, Zijiang
    [J]. 2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 286 - 290
  • [37] Semantic Tree-Based 3D Scene Model Recognition
    Yuan, Juefei
    Wang, Tianyang
    Zhe, Shandian
    Lu, Yijuan
    Li, Bo
    [J]. THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 85 - 90
  • [38] Heuristic 3D Object Shape Completion based on Symmetry and Scene Context
    Schiebener, David
    Schmidt, Andreas
    Vahrenkamp, Nikolaus
    Asfour, Tamim
    [J]. 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 74 - 81
  • [39] FFNet: Frequency Fusion Network for Semantic Scene Completion
    Wang, Xuzhi
    Lin, Di
    Wan, Liang
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2550 - 2557
  • [40] LMSCNet: Lightweight Multiscale 3D Semantic Completion
    Roldao, Luis
    de Charette, Raoul
    Verroust-Blondet, Anne
    [J]. 2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 111 - 119