Improved 3D Object Detection Based on PointPillars

被引：0

作者：

Kong, Weiwei ^{[1
,2
,3
]}

Du, Yusheng ^{[1
,2
,3
]}

He, Leilei ^{[1
]}

Li, Zejiang ^{[1
]}

机构：

[1] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, Xian 710121, Peoples R China

[2] Shaanxi Key Lab Network Data Anal & Intelligent Pr, Xian 710121, Peoples R China

[3] Xian Key Lab Big Data & Intelligent Comp, Xian 710121, Peoples R China

来源：

ELECTRONICS | 2024年 / 13卷 / 15期

关键词：

3D object detection; attention mechanism; transformer;

D O I：

10.3390/electronics13152915

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Despite the recent advancements in 3D object detection, the conventional 3D point cloud object detection algorithms have been found to exhibit limited accuracy for the detection of small objects. To address the challenge of poor detection of small-scale objects, this paper adopts the PointPillars algorithm as the baseline model and proposes a two-stage 3D target detection approach. As a cutting-edge solution, point cloud processing is performed using Transformer models. Additionally, a redefined attention mechanism is introduced to further enhance the detection capabilities of the algorithm. In the first stage, the algorithm uses PointPillars as the baseline model. The central concept of this algorithm is to transform the point cloud space into equal-sized columns. During the feature extraction stage, when the features from all cylinders are transformed into pseudo-images, the proposed algorithm incorporates attention mechanisms adapted from the Squeeze-and-Excitation (SE) method to emphasize and suppress feature information. Furthermore, the 2D convolution of the traditional backbone network is replaced by dynamic convolution. Concurrently, the addition of the attention mechanism further improves the feature representation ability of the network. In the second phase, the candidate frames generated in the first phase are refined using a Transformer-based approach. The proposed algorithm applies channel weighting in the decoder to enhance channel information, leading to improved detection accuracy and reduced false detections. The encoder constructs the initial point features from the candidate frames for encoding. Meanwhile, the decoder applies channel weighting to enhance the channel information, thereby improving the detection accuracy and reducing false detections. In the KITTI dataset, the experimental results verify the effectiveness of this method in small objects detection. Experimental results show that the proposed method significantly improves the detection capability of small objects compared with the baseline PointPillars. In concrete terms, in the moderate difficulty detection category, cars, pedestrians, and cyclists average precision (AP) values increased by 5.30%, 8.1%, and 10.6%, respectively. Moreover, the proposed method surpasses existing mainstream approaches in the cyclist category.

引用

页数：19

共 50 条

[21] ExistenceMap-PointPillars: A Multi-Fusion Network for Stable 3D Object Detection with Pseudo 2D Maps
Hariya, Keigo
Inoshita, Hiroki
Yoneda, Keisuke
Yanase, Ryo
Ishii, Kota
Suganuma, Naoki
2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
[22] Frustum-PointPillars: A Multi-Stage Approach for 3D Object Detection using RGB Camera and LiDAR
Paigwar, Anshul
Sierra-Gonzalez, David
Erkent, Ozgur
Laugier, Christian
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2926 - 2933
[23] 3D Object Detection Based on LiDAR Data
Sahba, Ramin
Sahba, Amin
Jamshidi, Mo
Rad, Paul
2019 IEEE 10TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2019, : 511 - 514
[24] 3D Object Detection based on Geometrical Segmentation
Teng, Zhou
Xiao, Jing
2013 INTERNATIONAL CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2013, : 67 - 74
[25] Model-based 3D object detection
Biegelbauer, Georg
Vincze, Markus
Wohlkinger, Walter
MACHINE VISION AND APPLICATIONS, 2010, 21 (04) : 497 - 516
[26] 3D velocity filters for improved object detection in automotive applications
Schauland, Sam
Velten, Joerg
Kummert, Anton
2007 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE, VOLS 1 AND 2, 2007, : 392 - 396
[27] IAE-KM3D a 3D Object Detection Method Based on an Improved KM3D Network
Sun, Yang
Li, Song
Wang, Haiyang
Tian, Bin
Li, Yi
APPLIED SCIENCES-BASEL, 2024, 14 (12):
[28] Adaptive Scale and Correlative Attention PointPillars: An Efficient Real-Time 3D Point Cloud Object Detection Algorithm
Zhai, Xinchao
Gao, Yang
Chen, Shiwei
Yang, Jingshuai
APPLIED SCIENCES-BASEL, 2024, 14 (09):
[29] 3D Object Detection with Pointformer
Pan, Xuran
Xia, Zhuofan
Song, Shiji
Li, Li Erran
Huang, Gao
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7459 - 7468
[30] A survey of 3D object detection
Wei Liang
Pengfei Xu
Ling Guo
Heng Bai
Yang Zhou
Feng Chen
Multimedia Tools and Applications, 2021, 80 : 29617 - 29641

← 1 2 3 4 5 →