Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks

被引：49

作者：

Liang, Zhihao ^{[1
,2
]}

Li, Zhihao ^{[3
]}

Xu, Songcen ^{[3
]}

Tan, Mingkui ^{[1
]}

Jia, Kui ^{[1
,4
,5
]}

机构：

[1] South China Univ Technol, Guangzhou, Peoples R China

[2] DexForce Technol Co Ltd, Seattle, WA 98164 USA

[3] Huawei Technol, Noahs Ark Lab, Hong Kong, Peoples R China

[4] Pazhou Lab, Guangzhou, Peoples R China

[5] Peng Cheng Lab, Shenzhen, Peoples R China

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/ICCV48922.2021.00278

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Instance segmentation in 3D scenes is fundamental in many applications of scene understanding. It is yet challenging due to the compound factors of data irregularity and uncertainty in the numbers of instances. State-of-the-art methods largely rely on a general pipeline that first learns point-wise features discriminative at semantic and instance levels, followed by a separate step of point grouping for proposing object instances. While promising, they have the shortcomings that (1) the second step is not supervised by the main objective of instance segmentation, and (2) their point-wise feature learning and grouping are less effective to deal with data irregularities, possibly resulting in fragmented segmentations. To address these issues, we propose in this work an end-to-end solution of Semantic Superpoint Tree Network (SSTNet) for proposing object instances from scene points. Key in SSTNet is an intermediate, semantic superpoint tree (SST), which is constructed based on the learned semantic features of superpoints, and which will be traversed and split at intermediate tree nodes for proposals of object instances. We also design in SSTNet a refinement module, termed CliqueNet, to prune superpoints that may be wrongly grouped into instance proposals. Experiments on the benchmarks of ScanNet and S3DIS show the efficacy of our proposed method. At the time of submission, SSTNet ranks top on the ScanNet (V2) leaderboard, with 2% higher of mAP than the second best method. The source code in PyTorch is available at https://github.com/Gorilla-Lab-SCUT/SSTNet.

引用

页码：2763 / 2772

页数：10

共 50 条

[1] Superpoint Transformer for 3D Scene Instance Segmentation
Sun, Jiahao
Qing, Chunmei
Tan, Junpeng
Xu, Xiangmin
[J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2393 - 2401
[2] Learning Superpoint Graph Cut for 3D Instance Segmentation
Hui, Le
Tang, Linghua
Shen, Yaqi
Xie, Jin
Yang, Jian
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[3] Efficient 3D Semantic Segmentation with Superpoint Transformer
Robert, Damien
Raguet, Hugo
Landrieu, Loic
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17149 - 17158
[4] Learning Inter-superpoint Affinity for Weakly Supervised 3D Instance Segmentation
Tang, Linghua
Hui, Le
Xie, Jin
[J]. COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 176 - 192
[5] Nonparametric Semantic Segmentation for 3D Street Scenes
He, Hu
Upcroft, Ben
[J]. 2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 3697 - 3703
[6] Semantic Segmentation Networks of 3D Point Clouds for RGB-D Indoor Scenes
Wang, Ya
Zell, Andreas
[J]. TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
[7] Semantic object segmentation of 3D scenes using color and shape compatibility
Yazdi, M
Zaccarin, A
[J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IX, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING II, 2002, : 268 - 272
[8] Joint 2D and 3D Semantic Segmentation with Consistent Instance Semantic
Wan, Yingcai
Fang, Lijin
[J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2024, E107A (08) : 1309 - 1318
[9] Joint Semantic-Instance Segmentation of 3D Point Clouds: Instance Separation and Semantic Fusion
Zhong, Min
Zeng, Gang
[J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6616 - 6623
[10] Superpoint-guided Semi-supervised Semantic Segmentation of 3D Point Clouds
Deng, Shuang
Dong, Qiulei
Liu, Bo
Hu, Zhanyi
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 9214 - 9220

← 1 2 3 4 5 →