A reversible transformer for LiDAR point cloud semantic segmentation

被引:0
|
作者
Akwensi, Perpertual Hope [1 ]
Wang, Ruisheng [1 ]
机构
[1] Univ Calgary, Dept Geomat Engn, Calgary, AB, Canada
关键词
point clouds; transformers; reversible networks; semantic segmentation; DEEP LEARNING NETWORK;
D O I
10.1109/CRV60082.2023.00011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The success of transformer networks in the natural language processing and 2D vision domains has encouraged the adaptation of transformers for 3D computer vision tasks. However, majority of the existing approaches employ standard back-propagation (SBP) - which requires the storage of model activations on a forward pass for use during the backward pass - making their memory complexity linearly proportional to model depth, hence inefficient. Furthermore, most 3D point transformers use the classic QK(V) matrix multiplication design which comes with a memory bottleneck. To address these issues, we propose a memory-efficient point transformer that makes use of reversible functions and linearized selfattention to minimize SBP and transformer memory complexities, respectively. Experimental results on benchmark datasets (Toronto3D and CSPC) from different sensor platforms (aerial, and mobile backpack) show that our approach uses less than half the number of model parameters (compared to its SBP counterpart), take more than twice the input sequence, and use less than half the memory compared to majority of the traditional approach. Overall, the proposed RPT attained competitive performance compared to the state-of-the-art.
引用
收藏
页码:19 / 28
页数:10
相关论文
共 50 条
  • [31] Joint Semantic and Instance Segmentation in 3D Point Cloud Based on Transformer
    Liu, Suyi
    Wu, Chengdong
    Xu, Fang
    Wang, Juxiang
    Chi, Jianning
    Yu, Xiaosheng
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4074 - 4080
  • [32] Multilevel Geometric Feature Embedding in Transformer Network for ALS Point Cloud Semantic Segmentation
    Liang, Zhuanxin
    Lai, Xudong
    REMOTE SENSING, 2024, 16 (18)
  • [33] MPT-Net: Mask Point Transformer Network for Large Scale Point Cloud Semantic Segmentation
    Tang, Zhe Jun
    Cham, Tat-Jen
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10611 - 10618
  • [34] Point attention network for point cloud semantic segmentation
    Dayong REN
    Zhengyi WU
    Jiawei LI
    Piaopiao YU
    Jie GUO
    Mingqiang WEI
    Yanwen GUO
    Science China(Information Sciences), 2022, 65 (09) : 99 - 112
  • [35] Point attention network for point cloud semantic segmentation
    Ren, Dayong
    Wu, Zhengyi
    Li, Jiawei
    Yu, Piaopiao
    Guo, Jie
    Wei, Mingqiang
    Guo, Yanwen
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (09)
  • [36] Point attention network for point cloud semantic segmentation
    Dayong Ren
    Zhengyi Wu
    Jiawei Li
    Piaopiao Yu
    Jie Guo
    Mingqiang Wei
    Yanwen Guo
    Science China Information Sciences, 2022, 65
  • [37] DGPolarNet: Dynamic Graph Convolution Network for LiDAR Point Cloud Semantic Segmentation on Polar BEV
    Song, Wei
    Liu, Zhen
    Guo, Ying
    Sun, Su
    Zu, Guidong
    Li, Maozhen
    REMOTE SENSING, 2022, 14 (15)
  • [38] CNN semantic segmentation of airborne LiDAR point cloud considering long-tailed distribution
    Chen R.
    Wu J.
    Zhao X.
    Xu G.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2023, 44 (07): : 282 - 295
  • [39] LiDAR Point Cloud Semantic Segmentation Method Based on Multi-scale Contextual Feature
    Liu, Fuchun
    Chen, Xujian
    Huang, Zewen
    Liu, Zeyong
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 477 - 482
  • [40] Multi-Modal LiDAR Point Cloud Semantic Segmentation with Salience Refinement and Boundary Perception
    Zhou, Yong
    Xie, Zeming
    Zhao, Jiaqi
    Du, Wenliang
    Yao, Rui
    El Saddik, Abdulmotaleb
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (10)