LiDAR 3D Object Detection Based on Improved PointRCNN

被引:0
|
作者
Gao, Han [1 ]
Chen, Ying [1 ]
Ni, Lizheng [1 ]
Deng, Xiuhan [1 ]
Zhong, Kai [1 ]
Yan, Chengzhi [1 ]
机构
[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai 201418, Peoples R China
关键词
three-dimensional object detection; point cloud; PointRCNN; detection for small object; self-attention mechanism;
D O I
10.3788/LOP232672
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To solve the problems of high misdetection rates and the low detection precision of far and small objects with current three-dimensional (3D) object detection algorithms, an improved 3D object detection algorithm based on PointRCNN is proposed. The improved algorithm adopts the spatial autocorrelation algorithm in the preprocessing stage to reduce the dimension of data, effectively removes irrelevant and noisy points, and optimizes the network's ability to extract features and identify key objects. This study also proposes a module called MGSA-PointNet to improve the point cloud encoding network of PointRCNN. The module takes advantage of the manifold self- attention mechanism to extract spatial information in the original point cloud more accurately. It incorporates the grouping self- attention mechanism to reduce the parameter counts in the self- attention weight coding layer while improving the efficiency and generalization ability of the model and enhancing the feature extraction ability of the network. Compared with PointRCNN on the KITTI dataset, the proposed algorithm enhances the accuracy of the 3D detection of cars and cyclists in complex scenes by 2. 10 percentage points and 2. 14 percentage points, respectively, and improves the average accuracy of 3D pedestrian detection by 5.21 percentage points, thus proving the effectiveness of the algorithm.
引用
收藏
页数:11
相关论文
共 20 条
  • [1] A Survey on 3D Object Detection Methods for Autonomous Driving Applications
    Arnold, Eduardo
    Al-Jarrah, Omar Y.
    Dianati, Mehrdad
    Fallah, Saber
    Oxtoby, David
    Mouzakitis, Alex
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (10) : 3782 - 3795
  • [2] Chen D J, 2023, Laser & Optoelectronics Progress
  • [3] Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
  • [4] 3D Semantic Segmentation with Submanifold Sparse Convolutional Networks
    Graham, Benjamin
    Engelcke, Martin
    van der Maaten, Laurens
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9224 - 9232
  • [5] 3D Object Detection Based on Deep Semantics and Position Information Fusion of Laser Point Cloud
    Hu Jie
    An Yongpeng
    Xu Wencai
    Xiong Zongquan
    Liu Han
    [J]. CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2023, 50 (10):
  • [6] PointPillars: Fast Encoders for Object Detection from Point Clouds
    Lang, Alex H.
    Vora, Sourabh
    Caesar, Holger
    Zhou, Lubing
    Yang, Jiong
    Beijbom, Oscar
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12689 - 12697
  • [7] Lidar for Autonomous Driving: The Principles, Challenges, and Trends for Automotive Lidar and Perception Systems
    Li, You
    Ibanez-Guzman, Javier
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2020, 37 (04) : 50 - 61
  • [8] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2999 - 3007
  • [9] Parmar N, 2018, Arxiv, DOI [arXiv:1802.05751, 10.48550/arXiv.1802.05751, DOI 10.48550/ARXIV.1802.05751]
  • [10] Frustum PointNets for 3D Object Detection from RGB-D Data
    Qi, Charles R.
    Liu, Wei
    Wu, Chenxia
    Su, Hao
    Guibas, Leonidas J.
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 918 - 927