Not All Voxels Are Equal: Semantic Scene Completion from the Point-Voxel Perspective

被引:0
|
作者
Tang, Jiaxiang [1 ]
Chen, Xiaokang [1 ]
Wang, Jingbo [2 ]
Zeng, Gang [1 ]
机构
[1] Peking Univ, Sch, Key Lab Machine Percept MoE, Beijing, Peoples R China
[2] Chinese Univ Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We revisit Semantic Scene Completion (SSC), a useful task to predict the semantic and occupancy representation of 3D scenes, in this paper. A number of methods for this task are always based on voxelized scene representations for keeping local scene structure. However, due to the existence of visible empty voxels, these methods always suffer from heavy computation redundancy when the network goes deeper, and thus limit the completion quality. To address this dilemma, we propose our novel point-voxel aggregation network for this task. Firstly, we transfer the voxelized scenes to point clouds by removing these visible empty voxels and adopt a deep point stream to capture semantic information from the scene efficiently. Meanwhile, a light-weight voxel stream containing only two 3D convolution layers preserves local structures of the voxelized scenes. Furthermore, we design an anisotropic voxel aggregation operator to fuse the structure details from the voxel stream into the point stream, and a semantic-aware propagation module to enhance the up-sampling process in the point stream by semantic labels. We demonstrate that our model surpasses state-of-the-arts on two benchmarks by a large margin, with only depth images as the input.
引用
收藏
页码:2352 / 2360
页数:9
相关论文
共 41 条
  • [1] Sparse point-voxel aggregation network for efficient point cloud semantic segmentation
    Fang, Zheng
    Xiong, Binyu
    Liu, Fei
    [J]. IET COMPUTER VISION, 2022, 16 (07) : 644 - 654
  • [2] 3D Shape Generation and Completion through Point-Voxel Diffusion
    Zhou, Linqi
    Du, Yilun
    Wu, Jiajun
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5806 - 5815
  • [3] 3D Point-Voxel Correlation Fields for Scene Flow Estimation
    Wang, Ziyi
    Wei, Yi
    Rao, Yongming
    Zhou, Jie
    Lu, Jiwen
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13621 - 13635
  • [4] PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds
    Wei, Yi
    Wang, Ziyi
    Rao, Yongming
    Lu, Jiwen
    Zhou, Jie
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6950 - 6959
  • [5] Semantic Point Completion Network for 3D Semantic Scene Completion
    Zhong, Min
    Zeng, Gang
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2824 - 2831
  • [6] SCPNet: Semantic Scene Completion on Point Cloud
    Xia, Zhaoyang
    Liu, Youquan
    Li, Xin
    Zhu, Xinge
    Ma, Yuexin
    Li, Yikang
    Hou, Yuenan
    Qiao, Yu
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17642 - 17651
  • [7] Voxel- and Bird's-Eye-View-Based Semantic Scene Completion for LiDAR Point Clouds
    Liang, Li
    Akhtar, Naveed
    Vice, Jordan
    Mian, Ajmal
    [J]. REMOTE SENSING, 2024, 16 (13)
  • [8] Point Cloud Semantic Scene Completion from RGB-D Images
    Zhang, Shoulong
    Li, Shuai
    Hao, Aimin
    Qin, Hong
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 3385 - 3393
  • [9] Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds
    Yang, Xuemeng
    Zou, Hao
    Kong, Xin
    Huang, Tianxin
    Liu, Yong
    Li, Wanlong
    Wen, Feng
    Zhang, Hongbo
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3555 - 3562
  • [10] Semantic Scene Completion from a Single Depth Image
    Song, Shuran
    Yu, Fisher
    Zeng, Andy
    Chang, Angel X.
    Savva, Manolis
    Funkhouser, Thomas
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 190 - 198