Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks

被引:49
|
作者
Liang, Zhihao [1 ,2 ]
Li, Zhihao [3 ]
Xu, Songcen [3 ]
Tan, Mingkui [1 ]
Jia, Kui [1 ,4 ,5 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
[2] DexForce Technol Co Ltd, Seattle, WA 98164 USA
[3] Huawei Technol, Noahs Ark Lab, Hong Kong, Peoples R China
[4] Pazhou Lab, Guangzhou, Peoples R China
[5] Peng Cheng Lab, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV48922.2021.00278
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instance segmentation in 3D scenes is fundamental in many applications of scene understanding. It is yet challenging due to the compound factors of data irregularity and uncertainty in the numbers of instances. State-of-the-art methods largely rely on a general pipeline that first learns point-wise features discriminative at semantic and instance levels, followed by a separate step of point grouping for proposing object instances. While promising, they have the shortcomings that (1) the second step is not supervised by the main objective of instance segmentation, and (2) their point-wise feature learning and grouping are less effective to deal with data irregularities, possibly resulting in fragmented segmentations. To address these issues, we propose in this work an end-to-end solution of Semantic Superpoint Tree Network (SSTNet) for proposing object instances from scene points. Key in SSTNet is an intermediate, semantic superpoint tree (SST), which is constructed based on the learned semantic features of superpoints, and which will be traversed and split at intermediate tree nodes for proposals of object instances. We also design in SSTNet a refinement module, termed CliqueNet, to prune superpoints that may be wrongly grouped into instance proposals. Experiments on the benchmarks of ScanNet and S3DIS show the efficacy of our proposed method. At the time of submission, SSTNet ranks top on the ScanNet (V2) leaderboard, with 2% higher of mAP than the second best method. The source code in PyTorch is available at https://github.com/Gorilla-Lab-SCUT/SSTNet.
引用
收藏
页码:2763 / 2772
页数:10
相关论文
共 50 条
  • [1] Superpoint Transformer for 3D Scene Instance Segmentation
    Sun, Jiahao
    Qing, Chunmei
    Tan, Junpeng
    Xu, Xiangmin
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2393 - 2401
  • [2] Learning Superpoint Graph Cut for 3D Instance Segmentation
    Hui, Le
    Tang, Linghua
    Shen, Yaqi
    Xie, Jin
    Yang, Jian
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [3] Efficient 3D Semantic Segmentation with Superpoint Transformer
    Robert, Damien
    Raguet, Hugo
    Landrieu, Loic
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17149 - 17158
  • [4] Learning Inter-superpoint Affinity for Weakly Supervised 3D Instance Segmentation
    Tang, Linghua
    Hui, Le
    Xie, Jin
    [J]. COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 176 - 192
  • [5] Nonparametric Semantic Segmentation for 3D Street Scenes
    He, Hu
    Upcroft, Ben
    [J]. 2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 3697 - 3703
  • [6] Semantic Segmentation Networks of 3D Point Clouds for RGB-D Indoor Scenes
    Wang, Ya
    Zell, Andreas
    [J]. TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [7] Semantic object segmentation of 3D scenes using color and shape compatibility
    Yazdi, M
    Zaccarin, A
    [J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IX, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING II, 2002, : 268 - 272
  • [8] Joint 2D and 3D Semantic Segmentation with Consistent Instance Semantic
    Wan, Yingcai
    Fang, Lijin
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2024, E107A (08) : 1309 - 1318
  • [9] Joint Semantic-Instance Segmentation of 3D Point Clouds: Instance Separation and Semantic Fusion
    Zhong, Min
    Zeng, Gang
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6616 - 6623
  • [10] Superpoint-guided Semi-supervised Semantic Segmentation of 3D Point Clouds
    Deng, Shuang
    Dong, Qiulei
    Liu, Bo
    Hu, Zhanyi
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 9214 - 9220