Transformer-based 3D Instance Segmentation With Auxiliary Denoising Learning

被引:0
|
作者
Song, Sung-Ho [1 ]
Kim, Incheol [1 ]
机构
[1] Department of Computer Science, Kyonggi University, Korea, Republic of
关键词
Learning systems;
D O I
10.5302/J.ICROS.2023.23.0150
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
3D point cloud instance segmentation, as a task in comprehending 3D scenes, involves predicting both 3D masks and class labels for individual object instances within a given point cloud. The development of an efficient transformer-based model for this task requires addressing the following key issues: refining instance masks and positions, initializing instance queries, and incorporating auxiliary task learning. To overcome the limitations of existing models, our study proposes a novel transformer-based model, T3DIS. This model refines both the mask and position, along with the query content of each instance during instance query decoding, thereby enhancing the quality of the final instance features. To expedite the instance decoding process, the model initializes the initial instance queries using a finite set of representative points selected from the point cloud. Furthermore, our approach incorporates auxiliary denoising task learning to facilitate rapid training of the transformer decoder. Through experiments conducted on the ScanNet-V2 benchmark dataset, we demonstrated the superiority of the proposed model. The evaluation involves comparing different methods of instance query initialization, position refinement, and auxiliary query denoising. © ICROS 2023.
引用
收藏
页码:954 / 965
相关论文
共 50 条
  • [1] Query Refinement Transformer for 3D Instance Segmentation
    Lu, Jiahao
    Deng, Jiacheng
    Wang, Chuxin
    He, Jianfeng
    Zhang, Tianzhu
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18470 - 18480
  • [2] Superpoint Transformer for 3D Scene Instance Segmentation
    Sun, Jiahao
    Qing, Chunmei
    Tan, Junpeng
    Xu, Xiangmin
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2393 - 2401
  • [3] A Transformer-Based Network for Anisotropic 3D Medical Image Segmentation
    Guo, Danfeng
    Terzopoulos, Demetri
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8857 - 8861
  • [4] A robust transformer-based pipeline of 3D cell alignment, denoise and instance segmentation on electron microscopy sequence images
    Liu, Jiazheng
    Zheng, Yafeng
    Lin, Limei
    Guo, Jingyue
    Lv, Yanan
    Yuan, Jingbin
    Zhai, Hao
    Chen, Xi
    Shen, Lijun
    Li, Linlin
    Bai, Shunong
    Han, Hua
    [J]. JOURNAL OF PLANT PHYSIOLOGY, 2024, 297
  • [5] Transformer-based deep learning denoising of single and multi-delay 3D arterial spin labeling
    Shou, Qinyang
    Zhao, Chenyang
    Shao, Xingfeng
    Jann, Kay
    Kim, Hosung
    Helmer, Karl G.
    Lu, Hanzhang
    Wang, Danny J. J.
    [J]. MAGNETIC RESONANCE IN MEDICINE, 2023, 91 (02) : 803 - 818
  • [6] Joint Semantic and Instance Segmentation in 3D Point Cloud Based on Transformer
    Liu, Suyi
    Wu, Chengdong
    Xu, Fang
    Wang, Juxiang
    Chi, Jianning
    Yu, Xiaosheng
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4074 - 4080
  • [7] Mask3D: Mask Transformer for 3D Semantic Instance Segmentation
    Schult, Jonas
    Engelmann, Francis
    Hermans, Alexander
    Litany, Or
    Tang, Siyu
    Leibe, Bastian
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 8216 - 8223
  • [8] Abstract: 3D Medical Image Segmentation with Transformer-based Scaling of ConvNets MedNeXt
    Roy, Saikat
    Koehler, Gregor
    Baumgartner, Michael
    Ulrich, Constantin
    Isensee, Fabian
    Jaeger, Paul F.
    Maier-Hein, Klaus
    [J]. BILDVERARBEITUNG FUR DIE MEDIZIN 2024, 2024, : 79 - 79
  • [9] Automatic 3D horizon picking using a volumetric transformer-based segmentation network
    Liao, Xiaofang
    Cao, Junxing
    Tan, Feng
    You, Jachun
    [J]. Journal of Applied Geophysics, 2025, 236
  • [10] Mask-Attention-Free Transformer for 3D Instance Segmentation
    Lai, Xin
    Yuan, Yuhui
    Chu, Ruihang
    Chen, Yukang
    Hu, Han
    Jia, Jiaya
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3670 - 3680