Point-Cloud Semantic Segmentation Network Considering Normals

被引:2
|
作者
Shang Pengfei [1 ,2 ]
Chen Yi [1 ,2 ]
Lv Weijia [1 ,2 ]
Zheng Fang [1 ,2 ]
Wang Jielong [1 ,2 ]
机构
[1] Tongji Univ, Coll Surveying & Geoinformat, Shanghai 200092, Peoples R China
[2] Natl Adm Surveying Mapping & Geoinformat, Key Lab Modern Engn Surveying, Shanghai 200092, Peoples R China
关键词
image processing; point cloud; deep learning; semantic segmentation; normals; principal component analysis; SURFACE;
D O I
10.3788/LOP202259.1610011
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In deep learning-based point-cloud semantic classification, PointNet considers the three-dimensional coordinates of the point cloud as a direct input, however, the classification of irregular shape objects is a challenge. In this study, we propose a semantic segmentation network considering the normals of point cloud by adding a normal estimation module on PointNet. We estimate the normals using a principal component analysis method. Compared with the original model, the overall accuracy, mean per-class accuracy, and mean per-class intersection-over-union of the improved model are improved by 2. 3 percentage points, 7. 1 percentage points, and 3. 9 percentage points respectively. Among the 13 semantic classes, the classification accuracy for 10 classes is improved, of which the classification accuracy of sofa and column is improved by 45. 6 percentage points and 42. 2 percentage points, respectively, and the mean per-class intersection-over-union is improved by 19. 8 percentage points and 25. 0 percentage points, respectively. Results show that the semantic segmentation network considering normals can improve the overall performance of the network to a certain extent and can significantly improve the classification effect of sofa and column.
引用
收藏
页数:8
相关论文
共 39 条
  • [1] 3D Semantic Parsing of Large-Scale Indoor Spaces
    Armeni, Iro
    Sener, Ozan
    Zamir, Amir R.
    Jiang, Helen
    Brilakis, Ioannis
    Fischer, Martin
    Savarese, Silvio
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1534 - 1543
  • [2] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
    Behley, Jens
    Garbade, Martin
    Milioto, Andres
    Quenzel, Jan
    Behnke, Sven
    Stachniss, Cyrill
    Gall, Juergen
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9296 - 9306
  • [3] Chen XJ., 2015, CHIN J LASERS, V42, P336
  • [4] Gross M., 2007, POINT BASED GRAPHICS
  • [5] Deep Learning for 3D Point Clouds: A Survey
    Guo, Yulan
    Wang, Hanyun
    Hu, Qingyong
    Liu, Hao
    Liu, Li
    Bennamoun, Mohammed
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (12) : 4338 - 4364
  • [6] HOPPE H, 1992, COMP GRAPH, V26, P71, DOI 10.1145/142920.134011
  • [7] Momenet: Flavor the Moments in Learning to Classify Shapes
    Joseph-Rivlin, Mor
    Zvirin, Alon
    Kimmel, Ron
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4085 - 4094
  • [8] PointGrid: A Deep Network for 3D Shape Understanding
    Le, Truc
    Duan, Ye
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9204 - 9214
  • [9] Lei XD, 2021, CHIN J LASERS, V48, DOI [10.3788/CJL202148.1610001, 10.3788/CJL202118.1610001]
  • [10] Li Bao, 2010, Computer Engineering and Applications, V46, P1, DOI 10.3778/j.issn.1002-8331.2010.23.001