BRTPillar: boosting real-time 3D object detection based point cloud and RGB image fusion in autonomous driving

被引：0

作者：

Zhang, Zhitian ^{[1
]}

Zhao, Hongdong ^{[1
]}

Zhao, Yazhou ^{[1
]}

Chen, Dan ^{[1
]}

Zhang, Ke ^{[1
]}

Li, Yanqi ^{[1
]}

机构：

[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin, Peoples R China

来源：

INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS | 2025年 / 18卷 / 01期

关键词：

Autonomous driving; Multimodal; 3D object detection; Attention mechanism;

D O I：

10.1108/IJICC-07-2024-0328

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

PurposeIn autonomous driving, the inherent sparsity of point clouds often limits the performance of object detection, while existing multimodal architectures struggle to meet the real-time requirements for 3D object detection. Therefore, the main purpose of this paper is to significantly enhance the detection performance of objects, especially the recognition capability for small-sized objects and to address the issue of slow inference speed. This will improve the safety of autonomous driving systems and provide feasibility for devices with limited computing power to achieve autonomous driving.Design/methodology/approachBRTPillar first adopts an element-based method to fuse image and point cloud features. Secondly, a local-global feature interaction method based on an efficient additive attention mechanism was designed to extract multi-scale contextual information. Finally, an enhanced multi-scale feature fusion method was proposed by introducing adaptive spatial and channel interaction attention mechanisms, thereby improving the learning of fine-grained features.FindingsExtensive experiments were conducted on the KITTI dataset. The results showed that compared with the benchmark model, the accuracy of cars, pedestrians and cyclists on the 3D object box improved by 3.05, 9.01 and 22.65%, respectively; the accuracy in the bird's-eye view has increased by 2.98, 10.77 and 21.14%, respectively. Meanwhile, the running speed of BRTPillar can reach 40.27 Hz, meeting the real-time detection needs of autonomous driving.Originality/valueThis paper proposes a boosting multimodal real-time 3D object detection method called BRTPillar, which achieves accurate location in many scenarios, especially for complex scenes with many small objects, while also achieving real-time inference speed.

引用

页码：217 / 235

页数：19

共 50 条

[21] Adversarial point cloud perturbations against 3D object detection in autonomous driving systems
Wang, Xupeng
Cai, Mumuxin
Sohel, Ferdous
Sang, Nan
Chang, Zhengwei
NEUROCOMPUTING, 2021, 466 : 27 - 36
[22] 3D object detection based on point cloud in automatic driving scene
Hai-Sheng Li
Yan-Ling Lu
Multimedia Tools and Applications, 2024, 83 : 13029 - 13044
[23] 3D object detection based on point cloud in automatic driving scene
Li, Hai-Sheng
Lu, Yan-Ling
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 13029 - 13044
[24] RangeLVDet: Boosting 3D Object Detection in LIDAR With Range Image and RGB Image
Zhang, Zehan
Liang, Zhidong
Zhang, Ming
Zhao, Xian
Li, Hao
Yang, Ming
Tan, Wenming
Pu, Shiliang
IEEE SENSORS JOURNAL, 2022, 22 (02) : 1391 - 1403
[25] Real-Time 3D Object Detection and Classification in Autonomous Driving Environment Using 3D LiDAR and Camera Sensors
Arikumar, K. S.
Kumar, A. Deepak
Gadekallu, Thippa Reddy
Prathiba, Sahaya Beni
Tamilarasi, K.
ELECTRONICS, 2022, 11 (24)
[26] Real-Time LiDAR Point-Cloud Moving Object Segmentation for Autonomous Driving
Xie, Xing
Wei, Haowen
Yang, Yongjie
SENSORS, 2023, 23 (01)
[27] Point-Level Fusion and Channel Attention for 3D Object Detection in Autonomous Driving
Shen, Juntao
Fang, Zheng
Huang, Jin
SENSORS, 2025, 25 (04)
[28] Real Pseudo-Lidar Point Cloud Fusion for 3D Object Detection
Fan, Xiangsuo
Xiao, Dachuan
Cai, Dengsheng
Ding, Wentao
ELECTRONICS, 2023, 12 (18)
[29] Object defect detection based on data fusion of a 3D point cloud and 2D image
Zhang, Wanning
Zhou, Fuqiang
Liu, Yang
Sun, Pengfei
Chen, Yuanze
Wang, Lin
MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (02)
[30] RI-Fusion: 3D Object Detection Using Enhanced Point Features With Range-Image Fusion for Autonomous Driving
Zhang, Xinyu
Wang, Li
Zhang, Guoxin
Lan, Tianwei
Zhang, Haoming
Zhao, Lijun
Li, Jun
Zhu, Lei
Liu, Huaping
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72

← 1 2 3 4 5 →