Towards Balanced RGB-TSDF Fusion for Consistent Semantic Scene Completion by 3D RGB Feature Completion and a Classwise Entropy Loss Function

被引：0

作者：

Ding, Laiyan ^{[1
]}

Hu, Panwen ^{[1
]}

Li, Jie ^{[2
]}

Huang, Rui ^{[1
]}

机构：

[1] Chinese Univ Hong Kong Shenzhen, Sch Sci & Engn, Shenzhen, Guangdong, Peoples R China

[2] Shenzhen Polytech Univ, Sch Artificial Intelligence, Shenzhen, Guangdong, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT II | 2024年 / 14426卷

关键词：

Semantic Scene Completion; RGB-TSDF fusion; Entropy-based loss function; NETWORK;

D O I：

10.1007/978-981-99-8432-9_11

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic Scene Completion (SSC) aims to jointly infer semantics and occupancies of 3D scenes. Truncated Signed Distance Function (TSDF), a 3D encoding of depth, has been a common input for SSC. Furthermore, RGB-TSDF fusion, seems promising since these two modalities provide color and geometry information, respectively. Nevertheless, RGB-TSDF fusion has been considered nontrivial and commonly-used naive addition will result in inconsistent results. We argue that the inconsistency comes from the sparsity of RGB features upon projecting into 3D space, while TSDF features are dense, leading to imbalanced feature maps when summed up. To address this RGB-TSDF distribution difference, we propose a two-stage network with a 3D RGB feature completion module that completes RGB features with meaningful values for occluded areas. Moreover, we propose an effective classwise entropy loss function to punish inconsistency. Extensive experiments on public datasets verify that our method achieves state-of-the-art performance among methods that do not adopt extra data.

引用

页码：128 / 141

页数：14

共 3 条

[1] Semantic Scene Completion With 2D and 3D Feature Fusion
Park, Sang-Min
Ha, Jong-Eun
[J]. IEEE Access, 2024, 12 : 141594 - 141603
[2] MRFTrans: Multimodal Representation Fusion Transformer for monocular 3D semantic scene completion
Xu, Rongtao
Zhang, Jiguang
Sun, Jiaxi
Wang, Changwei
Wu, Yifan
Xu, Shibiao
Meng, Weiliang
Zhang, Xiaopeng
[J]. INFORMATION FUSION, 2024, 111
[3] Edge-aware Depth Completion for Point-cloud 3D Scene Visualization on an RGB-D Camera
Huang, Yung-Lin
Hsu, Tang-Wei
Chien, Shao-Yi
[J]. 2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE, 2014, : 422 - 425

← 1 →