Improved 3D Object Detection Based on PointPillars

被引:0
|
作者
Kong, Weiwei [1 ,2 ,3 ]
Du, Yusheng [1 ,2 ,3 ]
He, Leilei [1 ]
Li, Zejiang [1 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, Xian 710121, Peoples R China
[2] Shaanxi Key Lab Network Data Anal & Intelligent Pr, Xian 710121, Peoples R China
[3] Xian Key Lab Big Data & Intelligent Comp, Xian 710121, Peoples R China
关键词
3D object detection; attention mechanism; transformer;
D O I
10.3390/electronics13152915
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the recent advancements in 3D object detection, the conventional 3D point cloud object detection algorithms have been found to exhibit limited accuracy for the detection of small objects. To address the challenge of poor detection of small-scale objects, this paper adopts the PointPillars algorithm as the baseline model and proposes a two-stage 3D target detection approach. As a cutting-edge solution, point cloud processing is performed using Transformer models. Additionally, a redefined attention mechanism is introduced to further enhance the detection capabilities of the algorithm. In the first stage, the algorithm uses PointPillars as the baseline model. The central concept of this algorithm is to transform the point cloud space into equal-sized columns. During the feature extraction stage, when the features from all cylinders are transformed into pseudo-images, the proposed algorithm incorporates attention mechanisms adapted from the Squeeze-and-Excitation (SE) method to emphasize and suppress feature information. Furthermore, the 2D convolution of the traditional backbone network is replaced by dynamic convolution. Concurrently, the addition of the attention mechanism further improves the feature representation ability of the network. In the second phase, the candidate frames generated in the first phase are refined using a Transformer-based approach. The proposed algorithm applies channel weighting in the decoder to enhance channel information, leading to improved detection accuracy and reduced false detections. The encoder constructs the initial point features from the candidate frames for encoding. Meanwhile, the decoder applies channel weighting to enhance the channel information, thereby improving the detection accuracy and reducing false detections. In the KITTI dataset, the experimental results verify the effectiveness of this method in small objects detection. Experimental results show that the proposed method significantly improves the detection capability of small objects compared with the baseline PointPillars. In concrete terms, in the moderate difficulty detection category, cars, pedestrians, and cyclists average precision (AP) values increased by 5.30%, 8.1%, and 10.6%, respectively. Moreover, the proposed method surpasses existing mainstream approaches in the cyclist category.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] ExistenceMap-PointPillars: A Multi-Fusion Network for Stable 3D Object Detection with Pseudo 2D Maps
    Hariya, Keigo
    Inoshita, Hiroki
    Yoneda, Keisuke
    Yanase, Ryo
    Ishii, Kota
    Suganuma, Naoki
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [22] Frustum-PointPillars: A Multi-Stage Approach for 3D Object Detection using RGB Camera and LiDAR
    Paigwar, Anshul
    Sierra-Gonzalez, David
    Erkent, Ozgur
    Laugier, Christian
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2926 - 2933
  • [23] 3D Object Detection Based on LiDAR Data
    Sahba, Ramin
    Sahba, Amin
    Jamshidi, Mo
    Rad, Paul
    2019 IEEE 10TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2019, : 511 - 514
  • [24] 3D Object Detection based on Geometrical Segmentation
    Teng, Zhou
    Xiao, Jing
    2013 INTERNATIONAL CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2013, : 67 - 74
  • [25] Model-based 3D object detection
    Biegelbauer, Georg
    Vincze, Markus
    Wohlkinger, Walter
    MACHINE VISION AND APPLICATIONS, 2010, 21 (04) : 497 - 516
  • [26] 3D velocity filters for improved object detection in automotive applications
    Schauland, Sam
    Velten, Joerg
    Kummert, Anton
    2007 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE, VOLS 1 AND 2, 2007, : 392 - 396
  • [27] IAE-KM3D a 3D Object Detection Method Based on an Improved KM3D Network
    Sun, Yang
    Li, Song
    Wang, Haiyang
    Tian, Bin
    Li, Yi
    APPLIED SCIENCES-BASEL, 2024, 14 (12):
  • [28] Adaptive Scale and Correlative Attention PointPillars: An Efficient Real-Time 3D Point Cloud Object Detection Algorithm
    Zhai, Xinchao
    Gao, Yang
    Chen, Shiwei
    Yang, Jingshuai
    APPLIED SCIENCES-BASEL, 2024, 14 (09):
  • [29] 3D Object Detection with Pointformer
    Pan, Xuran
    Xia, Zhuofan
    Song, Shiji
    Li, Li Erran
    Huang, Gao
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7459 - 7468
  • [30] A survey of 3D object detection
    Wei Liang
    Pengfei Xu
    Ling Guo
    Heng Bai
    Yang Zhou
    Feng Chen
    Multimedia Tools and Applications, 2021, 80 : 29617 - 29641