A transformer-based real-time LiDAR semantic segmentation method for restricted mobile devices

被引:2
|
作者
Liu, Chang [1 ]
Zhao, Jin [1 ]
Sun, Nianyi [1 ]
机构
[1] Guizhou Univ, Sch Mech Engn, Guiyang 550025, Guizhou, Peoples R China
关键词
Semantic Segmentation;
D O I
10.1016/j.jfranklin.2024.01.033
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In scene understanding, LiDAR-based semantic segmentation is a crucial task for describing object boundaries and sizes. Simultaneously, real-time characteristics in autonomous driving heavily rely on 3D information for navigation. With these considerations in mind, we propose a novel LiDAR real-time semantic segmentation method, which involves projecting 3D point clouds into a spherical range image and performing segmentation using 2D convolution. Building upon the success of the Transformer on 2D images, we explore its potential on 3D point clouds. To leverage the advantages of both convolution and Transformer, we introduce the Multi -Head Self -Attention (MHSA) mechanism into LiDAR semantic segmentation, as a means to enhance 2D Convolution. This results in a lightweight model with three key insights: (i) Proposing a parallel semantic segmentation architecture by combining Transformer and convolution; (ii) Innovatively splitting channels to differentiate the Transformer and convolution branches; (iii) The concept of adaptive sliding window is introduced to enhance the relationship of edge dependency when projecting 3D point clouds into range images. We evaluate our method incrementally, both qualitatively and quantitatively, on the SemanticKITTI and SemanticPOSS datasets. The experimental results demonstrate that our proposed method achieves superior performance in 3D semantic segmentation and LiDAR mapping compared to the state-of-the-art.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] EMSFomer: Efficient Multi-Scale Transformer for Real-Time Semantic Segmentation
    Xia, Zhengyu
    Kim, Joohee
    IEEE ACCESS, 2025, 13 : 18239 - 18252
  • [22] A Transformer-based Semantic Segmentation Model for Street Fashion Images
    Peng, Dingjie
    Kameyama, Wataru
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
  • [23] FlowFormers: Transformer-based Models for Real-time Network Flow Classification
    Babaria, Rushi
    Madanapalli, Sharat Chandra
    Kumar, Himal
    Sivaraman, Vijay
    2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 231 - 238
  • [24] A transformer-based real-time earthquake detection framework in heterogeneous environments
    Wu, Aming
    Khan, Irshad
    Kwon, Young-Woo
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [25] Real-Time Semantic Clothing Segmentation
    Cushen, George. A.
    Nixon, Mark. S.
    ADVANCES IN VISUAL COMPUTING, ISVC 2012, PT I, 2012, 7431 : 272 - 281
  • [26] LACTNet: A Lightweight Real-Time Semantic Segmentation Network Based on an Aggregated Convolutional Neural Network and Transformer
    Zhang, Xiangyue
    Li, Hexiao
    Ru, Jingyu
    Ji, Peng
    Wu, Chengdong
    ELECTRONICS, 2024, 13 (12)
  • [27] TransUPR: A Transformer-based Plug-and-Play Uncertain Point Refiner For LiDAR Point Cloud Semantic Segmentation
    Yu, Zifan
    Chen, Meida
    Zhang, Zhikang
    You, Suya
    Rao, Raghuveer
    Agarwal, Sanjeev
    Ren, Fengbo
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5864 - 5869
  • [28] Research on real-time cloth edge extraction method based on ENet semantic segmentation
    Gao, Xiaofei
    Qi Lizhe
    Ma Chuangjia
    Sun, Yunquan
    JOURNAL OF ENGINEERED FIBERS AND FABRICS, 2022, 17
  • [29] Real-time semantic segmentation method for field grapes based on channel feature pyramid
    Sun J.
    Gong D.
    Yao K.
    Lu B.
    Dai C.
    Wu X.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2022, 38 (17): : 150 - 157
  • [30] Parallel segmentation network for real-time semantic segmentation
    Chen, Guanke
    Li, Haibin
    Li, Yaqian
    Zhang, Wenming
    Song, Tao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 148