Spherical Transformer for LiDAR-based 3D Recognition

被引:24
|
作者
Lai, Xin [1 ]
Chen, Yukang [1 ]
Lu, Fanbin [1 ]
Liu, Jianhui [2 ]
Jia, Jiaya [1 ,3 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Univ Hong Kong, Hong Kong, Peoples R China
[3] SmartMore, Hong Kong, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.01683
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
LiDAR-based 3D point cloud recognition has benefited various applications. Without specially considering the LiDAR point distribution, most current methods suffer from information disconnection and limited receptive field, especially for the sparse distant points. In this work, we study the varying-sparsity distribution of LiDAR points and present SphereFormer to directly aggregate information from dense close points to the sparse distant ones. We design radial window self-attention that partitions the space into multiple non-overlapping narrow and long windows. It overcomes the disconnection issue and enlarges the receptive field smoothly and dramatically, which significantly boosts the performance of sparse distant points. Moreover, to fit the narrow and long windows, we propose exponential splitting to yield fine-grained position encoding and dynamic feature selection to increase model representation ability. Notably, our method ranks 1st on both nuScenes and SemanticKITTI semantic segmentation benchmarks with 81.9% and 74.8% mIoU, respectively. Also, we achieve the 3rd place on nuScenes object detection benchmark with 72.8% NDS and 68.5% mAP. Code is available at https://github.com/dvlab-research/SphereFormer.git.
引用
收藏
页码:17545 / 17555
页数:11
相关论文
共 50 条
  • [1] KPTr: Key point transformer for LiDAR-based 3D object detection
    Fan, Likang (BITfanlikang@163.com), 2025, 242
  • [2] A New 3D LIDAR-based Lane Markings Recognition Approach
    Tan Li
    Deng Zhidong
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2013, : 2197 - 2202
  • [3] 3D LIDAR-based Ground Segmentation
    Chen Tongtong
    Dai Bin
    Liu Daxue
    Zhang Bo
    Liu Qixu
    2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 446 - 450
  • [4] LiDAR-Based 3D SLAM for Indoor Mapping
    Teng Hooi Chan
    Hesse, Henrik
    Song Guang Ho
    2021 7TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2021, : 285 - 289
  • [5] Visual and LiDAR-based for The Mobile 3D Mapping
    Wu, Qiao
    Sun, Kai
    Zhang, Wenjun
    Huang, Chaobing
    Wu, Xiaochun
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2016, : 1522 - 1527
  • [6] Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection for Autonomous Driving
    Yuan, Zhenxun
    Song, Xiao
    Bai, Lei
    Wang, Zhe
    Ouyang, Wanli
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2068 - 2078
  • [7] Reinforcing LiDAR-Based 3D Object Detection with RGB and 3D Information
    Liu, Wenjian
    Zhou, Yue
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 199 - 209
  • [8] Comparison of camera-based and 3D LiDAR-based place recognition across weather conditions
    Zywanowski, Kamil
    Banaszczyk, Adam
    Nowicki, Michal R.
    16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 886 - 891
  • [9] LiDAR-Based Symmetrical Guidance for 3D Object Detection
    Chu, Huazhen
    Ma, Huimin
    Liu, Haizhuang
    Wang, Rongquan
    PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 472 - 483
  • [10] Liborg: a lidar-based Robot for Efficient 3D Mapping
    Vlaminck, Michiel
    Luong, Hiep
    Philips, Wilfried
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XL, 2017, 10396