Semantic Segmentation Method of Point Cloud Based on Sparse Convolution and Attention Mechanism

被引:0
|
作者
Zuo Meng [1 ,2 ,3 ,4 ]
Liu Yiyang [1 ,2 ,3 ]
Cui Hao [1 ,2 ,3 ]
Bai Hongfei [2 ]
机构
[1] Chinese Acad Sci, Key Lab Networked Control Syst, Shenyang 110016, Liaoning, Peoples R China
[2] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Liaoning, Peoples R China
[3] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110169, Liaoning, Peoples R China
[4] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
machine vision; semantic segmentation of point cloud; sparse convolution; attention mechanism; spatial; pyramid sampling;
D O I
10.3788/LOP222819
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, three- dimensional point cloud semantic segmentation techniques based on sparse convolution have made great progress. However, sparse convolution causes a loss of global context information. In this study, a point cloud semantic segmentation method based on a sparse convolution and attention mechanism is proposed. Here, the attention mechanism is introduced into a sparse convolutional network to improve the network's ability to achieve global context information. However, extensive computation of the attention mechanism limits the applicability of the proposed method. Hence, to expand its usage while decreasing the amount of computation, spatial pyramid sampling is further introduced in the attention mechanism. Experimental results demonstrate that the proposed method achieves 71. 825% of the average intersection over union (MIOU) on the Scannet V2 dataset and 70. 5% on the S3DIS dataset, suggesting the proposed method's effectiveness and its superiority to the comparison method.
引用
下载
收藏
页数:12
相关论文
共 26 条
  • [1] Armeni I., 2017, arXiv
  • [2] 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
    Choy, Christopher
    Gwak, JunYoung
    Savarese, Silvio
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3070 - 3079
  • [3] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
    Dai, Angela
    Qi, Charles Ruizhongtai
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
  • [4] ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
    Dai, Angela
    Chang, Angel X.
    Savva, Manolis
    Halber, Maciej
    Funkhouser, Thomas
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2432 - 2443
  • [5] Engelcke Martin, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P1355, DOI 10.1109/ICRA.2017.7989161
  • [6] Graham B, 2014, Arxiv, DOI arXiv:1409.6070
  • [7] 3D Semantic Segmentation with Submanifold Sparse Convolutional Networks
    Graham, Benjamin
    Engelcke, Martin
    van der Maaten, Laurens
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9224 - 9232
  • [8] DyCo3D: Robust Instance Segmentation of 3D Point Clouds through Dynamic Convolution
    He, Tong
    Shen, Chunhua
    van den Hengel, Anton
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 354 - 363
  • [9] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
  • [10] Kumar KSC, 2021, IEEE WINT C APPL COM, P117, DOI [10.1109/WACVW52041.2021.00017, 10.1109/ICCES51350.2021.9488941]