Learning point cloud context information based on 3D transformer for more accurate and efficient classification

被引:2
|
作者
Chen, Yiping [1 ]
Zhang, Shuai [1 ,3 ]
Lin, Weisheng [2 ]
Zhang, Shuhang [1 ,3 ]
Zhang, Wuming [1 ]
机构
[1] Sun Yat Sen Univ, Sch Geospatial Engn & Sci, Zhuhai, Peoples R China
[2] Xiamen Univ, Fujian Key Lab Sensing & Comp Smart Cities, Xiamen, Peoples R China
[3] Sun Yat Sen Univ, Sch Geospatial Engn & Sci, Zhuhai 519082, Peoples R China
来源
PHOTOGRAMMETRIC RECORD | 2023年 / 38卷 / 184期
基金
中国国家自然科学基金;
关键词
classification; context information; point cloud; 3D transformer;
D O I
10.1111/phor.12469
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
The point cloud semantic understanding task has made remarkable progress along with the development of 3D deep learning. However, aggregating spatial information to improve the local feature learning capability of the network remains a major challenge. Many methods have been used for improving local information learning, such as constructing a multi-area structure for capturing different area information. However, it will lose some local information due to the independent learning point feature. To solve this problem, a new network is proposed that considers the importance of the differences between points in the neighbourhood. Capturing local feature information can be enhanced by highlighting the different feature importance of the point cloud in the neighbourhood. First, T-Net is constructed to learn the point cloud transformation matrix for point cloud disorder. Second, transformer is used to improve the problem of local information loss due to the independence of each point in the neighbourhood. The experimental results show that 92.2% accuracy overall was achieved on the ModelNet40 dataset and 93.8% accuracy overall was achieved on the ModelNet10 dataset. The figure shows the pipeline of point cloud classification which is similar to PointNet. T-Net is used to eliminate the effect of point cloud rotation and a 3D transformer module is utilised to learn the point cloud context information. Finally, the MLP is utilised to map to the category dimension. Experiments show that our method is accurate and efficient.image
引用
收藏
页码:603 / 616
页数:14
相关论文
共 50 条
  • [1] Semantic Context Encoding for Accurate 3D Point Cloud Segmentation
    Liu, Hao
    Guo, Yulan
    Ma, Yanni
    Lei, Yinjie
    Wen, Gongjian
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2045 - 2055
  • [2] Time and Memory Efficient 3D Point Cloud Classification
    Ullah, Shan
    Qayyum, Usman
    Choudhry, Aadil Jaleel
    [J]. PROCEEDINGS OF 2019 16TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2019, : 521 - 525
  • [3] Deep Learning for 3D Classification Based on Point Cloud with Local Structure
    Song, Yanan
    Li, Xinyu
    Gao, Liang
    [J]. 2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 405 - 409
  • [4] Point-to-Spike Residual Learning for Energy-Efficient 3D Point Cloud Classification
    Wu, Qiaoyun
    Zhang, Quanxiao
    Tan, Chunyu
    Zhou, Yun
    Sun, Changyin
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 6092 - 6099
  • [5] Context-Aware Transformer for 3D Point Cloud Automatic Annotation
    Qian, Xiaoyan
    Liu, Chang
    Qi, Xiaojuan
    Tan, Siew-Chong
    Lam, Edmund
    Wong, Ngai
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2082 - 2090
  • [6] Point-voxel dual stream transformer for 3d point cloud learning
    Zhao, Tianmeng
    Zeng, Hui
    Zhang, Baoqing
    Fan, Bin
    Li, Chen
    [J]. VISUAL COMPUTER, 2024, 40 (08): : 5323 - 5339
  • [7] An Effective Encoding Method Based on Local Information for 3D Point Cloud Classification
    Song, Yanan
    Gao, Liang
    Li, Xinyu
    Pan, Quan-Ke
    [J]. IEEE ACCESS, 2019, 7 : 39369 - 39377
  • [8] Group-in-Group Relation-Based Transformer for 3D Point Cloud Learning
    Liu, Shaolei
    Fu, Kexue
    Wang, Manning
    Song, Zhijian
    [J]. REMOTE SENSING, 2022, 14 (07)
  • [9] DEEP LEARNING ON POINT CLOUD FOR 3D CLASSIFICATION BASED ON SPIKING NEURAL NETWORK
    Zhang Silin
    [J]. 2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [10] Explore In-Context Learning for 3D Point Cloud Understanding
    Fang, Zhongbin
    Li, Xiangtai
    Li, Xia
    Buhmann, Joachim M.
    Loy, Chen Change
    Liu, Mengyuan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,