Not All Voxels Are Equal: Semantic Scene Completion from the Point-Voxel Perspective

被引：0

作者：

Tang, Jiaxiang ^{[1
]}

Chen, Xiaokang ^{[1
]}

Wang, Jingbo ^{[2
]}

Zeng, Gang ^{[1
]}

机构：

[1] Peking Univ, Sch, Key Lab Machine Percept MoE, Beijing, Peoples R China

[2] Chinese Univ Hong Kong, Hong Kong, Peoples R China

来源：

THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2022年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We revisit Semantic Scene Completion (SSC), a useful task to predict the semantic and occupancy representation of 3D scenes, in this paper. A number of methods for this task are always based on voxelized scene representations for keeping local scene structure. However, due to the existence of visible empty voxels, these methods always suffer from heavy computation redundancy when the network goes deeper, and thus limit the completion quality. To address this dilemma, we propose our novel point-voxel aggregation network for this task. Firstly, we transfer the voxelized scenes to point clouds by removing these visible empty voxels and adopt a deep point stream to capture semantic information from the scene efficiently. Meanwhile, a light-weight voxel stream containing only two 3D convolution layers preserves local structures of the voxelized scenes. Furthermore, we design an anisotropic voxel aggregation operator to fuse the structure details from the voxel stream into the point stream, and a semantic-aware propagation module to enhance the up-sampling process in the point stream by semantic labels. We demonstrate that our model surpasses state-of-the-arts on two benchmarks by a large margin, with only depth images as the input.

引用

页码：2352 / 2360

页数：9

共 41 条

[1] Sparse point-voxel aggregation network for efficient point cloud semantic segmentation
Fang, Zheng
Xiong, Binyu
Liu, Fei
[J]. IET COMPUTER VISION, 2022, 16 (07) : 644 - 654
[2] 3D Shape Generation and Completion through Point-Voxel Diffusion
Zhou, Linqi
Du, Yilun
Wu, Jiajun
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5806 - 5815
[3] 3D Point-Voxel Correlation Fields for Scene Flow Estimation
Wang, Ziyi
Wei, Yi
Rao, Yongming
Zhou, Jie
Lu, Jiwen
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13621 - 13635
[4] PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds
Wei, Yi
Wang, Ziyi
Rao, Yongming
Lu, Jiwen
Zhou, Jie
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6950 - 6959
[5] Semantic Point Completion Network for 3D Semantic Scene Completion
Zhong, Min
Zeng, Gang
[J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2824 - 2831
[6] SCPNet: Semantic Scene Completion on Point Cloud
Xia, Zhaoyang
Liu, Youquan
Li, Xin
Zhu, Xinge
Ma, Yuexin
Li, Yikang
Hou, Yuenan
Qiao, Yu
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17642 - 17651
[7] Voxel- and Bird's-Eye-View-Based Semantic Scene Completion for LiDAR Point Clouds
Liang, Li
Akhtar, Naveed
Vice, Jordan
Mian, Ajmal
[J]. REMOTE SENSING, 2024, 16 (13)
[8] Point Cloud Semantic Scene Completion from RGB-D Images
Zhang, Shoulong
Li, Shuai
Hao, Aimin
Qin, Hong
[J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 3385 - 3393
[9] Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds
Yang, Xuemeng
Zou, Hao
Kong, Xin
Huang, Tianxin
Liu, Yong
Li, Wanlong
Wen, Feng
Zhang, Hongbo
[J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3555 - 3562
[10] Semantic Scene Completion from a Single Depth Image
Song, Shuran
Yu, Fisher
Zeng, Andy
Chang, Angel X.
Savva, Manolis
Funkhouser, Thomas
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 190 - 198

← 1 2 3 4 5 →