Multiview Fusion Driven 3-D Point Cloud Semantic Segmentation Based on Hierarchical Transformer

被引:3
|
作者
Xu, Wang [1 ]
Li, Xu [1 ]
Ni, Peizhou [1 ]
Guang, Xingxing [2 ,3 ]
Luo, Hang [2 ,3 ]
Zhao, Xijun [2 ,3 ]
机构
[1] Southeast Univ, Sch Instrument Sci & Engn, Nanjing 210096, Peoples R China
[2] China North Artificial Intelligence & Innovat Res, Beijing 100072, Peoples R China
[3] Collective Intelligence & Collaborat Lab CIC, Beijing 100072, Peoples R China
关键词
3-D point cloud; multihead attention; multiview fusion; semantic segmentation;
D O I
10.1109/JSEN.2023.3328603
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Three-dimensional semantic segmentation is a key task of environment understanding in various outdoor scenes. Due to the sparsity and varying density of point clouds, it becomes challenging to obtain fine-gained segmentation results. Previous point-based and voxel-based methods suffer from the expensive computational cost. Recent 2-D projection-based methods, including range-view (RV), bird-eye-view (BEV), and multiview fusion methods, can run in real time, but the information loss during the projection leads to the low accuracy. Also, we find that the occlusion and interlacing problems exist in single projection-based methods and most multiview fusion networks only focus on the output-level fusion. Considering the above issues, we propose a multilevel multiview fusion network using attention modules and hierarchical transformer, which ensures the effectiveness and efficiency mainly by the following three aspects: 1) the spatial-channel attention module (SCAM) integrates contextual information between points and learn differences of each channel's features; 2) the proposed geometry-based multiprojection fusion module (GMFM) achieves the geometric feature alignment between RV and BEV and fuses the features of the two views at both feature level and output level; and 3) we introduce KPConv to replace KNN, which can reduce the information loss during the postprocessing. Experiments are conducted on both structured and unstructured datasets, including urban dataset SemanticKITTI and off-road dataset Rellis3D. Our results achieve a better performance compared to other projection-based methods and are comparable with the state-of-the-art Cylinder3D.
引用
收藏
页码:31461 / 31470
页数:10
相关论文
共 50 条
  • [31] Improved 3D Semantic Segmentation Model Based on RGB Image and LiDAR Point Cloud Fusion for Automantic Driving
    Du, Jiahao
    Huang, Xiaoci
    Xing, Mengyang
    Zhang, Tao
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2023, 24 (03) : 787 - 797
  • [32] FF-Net: Feature-Fusion-Based Network for Semantic Segmentation of 3D Plant Point Cloud
    Guo, Xindong
    Sun, Yu
    Yang, Hua
    PLANTS-BASEL, 2023, 12 (09):
  • [33] 3DGTN: 3-D Dual-Attention GLocal Transformer Network for Point Cloud Classification and Segmentation
    Lu, Dening
    Gao, Kyle
    Xie, Qian
    Xu, Linlin
    Li, Jonathan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 13
  • [34] Point-Cloud Transformer for 3-D Electrical Impedance Tomography
    Chen, Zhou
    Zhang, Haijing
    Hu, Delin
    Tan, Chao
    Liu, Zhe
    Yang, Yunjie
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [35] Fully Sparse Transformer 3-D Detector for LiDAR Point Cloud
    Zhang, Diankun
    Zheng, Zhijie
    Niu, Haoyu
    Wang, Xueqing
    Liu, Xiaojun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 12
  • [36] SGT-Net: A Transformer-Based Stratified Graph Convolutional Network for 3D Point Cloud Semantic Segmentation
    Liu, Suyi
    Chi, Jianning
    Wu, Chengdong
    Xu, Fang
    Yu, Xiaosheng
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (03): : 4471 - 4489
  • [37] Transformer based 3D tooth segmentation via point cloud region partition
    You Wu
    Hongping Yan
    Kun Ding
    Scientific Reports, 14 (1)
  • [38] SEMANTIC SCENE COMPLETION WITH POINT CLOUD REPRESENTATION AND TRANSFORMER-BASED FEATURE FUSION
    Fu, Ruochong
    Wu, Hang
    Hao, Mengxiang
    Miao, Yubin
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3369 - 3373
  • [39] Semantic Scene Completion with Point Cloud Representation and Transformer-based feature fusion
    Fu, Ruochong
    Wu, Hang
    Hao, Mengxiang
    Miao, Yubin
    Proceedings - International Conference on Image Processing, ICIP, 2023, : 3369 - 3373
  • [40] Hierarchical Point-Edge Interaction Network for Point Cloud Semantic Segmentation
    Jiang, Li
    Zhao, Hengshuang
    Liu, Shu
    Shen, Xiaoyong
    Fu, Chi-Wing
    Jia, Jiaya
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10432 - 10440