Multiview Fusion Driven 3-D Point Cloud Semantic Segmentation Based on Hierarchical Transformer

被引:3
|
作者
Xu, Wang [1 ]
Li, Xu [1 ]
Ni, Peizhou [1 ]
Guang, Xingxing [2 ,3 ]
Luo, Hang [2 ,3 ]
Zhao, Xijun [2 ,3 ]
机构
[1] Southeast Univ, Sch Instrument Sci & Engn, Nanjing 210096, Peoples R China
[2] China North Artificial Intelligence & Innovat Res, Beijing 100072, Peoples R China
[3] Collective Intelligence & Collaborat Lab CIC, Beijing 100072, Peoples R China
关键词
3-D point cloud; multihead attention; multiview fusion; semantic segmentation;
D O I
10.1109/JSEN.2023.3328603
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Three-dimensional semantic segmentation is a key task of environment understanding in various outdoor scenes. Due to the sparsity and varying density of point clouds, it becomes challenging to obtain fine-gained segmentation results. Previous point-based and voxel-based methods suffer from the expensive computational cost. Recent 2-D projection-based methods, including range-view (RV), bird-eye-view (BEV), and multiview fusion methods, can run in real time, but the information loss during the projection leads to the low accuracy. Also, we find that the occlusion and interlacing problems exist in single projection-based methods and most multiview fusion networks only focus on the output-level fusion. Considering the above issues, we propose a multilevel multiview fusion network using attention modules and hierarchical transformer, which ensures the effectiveness and efficiency mainly by the following three aspects: 1) the spatial-channel attention module (SCAM) integrates contextual information between points and learn differences of each channel's features; 2) the proposed geometry-based multiprojection fusion module (GMFM) achieves the geometric feature alignment between RV and BEV and fuses the features of the two views at both feature level and output level; and 3) we introduce KPConv to replace KNN, which can reduce the information loss during the postprocessing. Experiments are conducted on both structured and unstructured datasets, including urban dataset SemanticKITTI and off-road dataset Rellis3D. Our results achieve a better performance compared to other projection-based methods and are comparable with the state-of-the-art Cylinder3D.
引用
收藏
页码:31461 / 31470
页数:10
相关论文
共 50 条
  • [21] Semantic segmentation of 3D point cloud based on contextual attention CNN
    Yang J.
    Dang J.
    Tongxin Xuebao/Journal on Communications, 2020, 41 (07): : 195 - 203
  • [22] PSTNet: Transformer for aggregating neighborhood features in 3D point cloud semantic segmentation of eggplant plants
    Ma, Linqian
    Kong, Lingyuan
    Peng, Xingshuo
    Wang, Keyuan
    Geng, Nan
    SCIENTIA HORTICULTURAE, 2024, 331
  • [23] Semantic Segmentation of Indoor 3D Point Cloud Model Based on 2D-3D Semantic Transfer
    Xiong H.
    Zheng X.
    Ding Y.
    Zhang Y.
    Wu X.
    Zhou Y.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2018, 43 (12): : 2303 - 2309
  • [24] An Efficient 3-D Point Cloud Place Recognition Approach Based on Feature Point Extraction and Transformer
    Ye, Tao
    Yan, Xiangming
    Wang, Shouan
    Li, Yunwang
    Zhou, Fuqiang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [25] An Efficient 3-D Point Cloud Place Recognition Approach Based on Feature Point Extraction and Transformer
    Ye, Tao
    Yan, Xiangming
    Wang, Shouan
    Li, Yunwang
    Zhou, Fuqiang
    IEEE Transactions on Instrumentation and Measurement, 2022, 71
  • [26] 3D semantic map construction based on point cloud and image fusion
    Li, Huijun
    Zhao, Hailong
    Ye, Bin
    Zhang, Yu
    IET CYBER-SYSTEMS AND ROBOTICS, 2023, 5 (01)
  • [27] ALS Point Cloud Semantic Segmentation Based on Graph Convolution and Transformer With Elevation Attention
    Huang, Shuowen
    Hu, Qingwu
    Zhao, Pengcheng
    Li, Jiayuan
    Ai, Mingyao
    Wang, Shaohua
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 2877 - 2889
  • [28] DEEP LEARNING FOR SEMANTIC SEGMENTATION OF 3D POINT CLOUD
    Malinverni, E. S.
    Pierdicca, R.
    Paolanti, M.
    Martini, M.
    Morbidoni, C.
    Matrone, F.
    Lingua, A.
    27TH CIPA INTERNATIONAL SYMPOSIUM: DOCUMENTING THE PAST FOR A BETTER FUTURE, 2019, 42-2 (W15): : 735 - 742
  • [29] A Fusion of CNNs and ICP for 3-D Point Cloud Registration
    Chang, Wen-Chung
    Van-Toan Pham
    Huang, Yang-Cheng
    2020 17TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2020, : 124 - 129
  • [30] Improved 3D Semantic Segmentation Model Based on RGB Image and LiDAR Point Cloud Fusion for Automantic Driving
    Jiahao Du
    Xiaoci Huang
    Mengyang Xing
    Tao Zhang
    International Journal of Automotive Technology, 2023, 24 : 787 - 797