BRTPillar: boosting real-time 3D object detection based point cloud and RGB image fusion in autonomous driving

被引:0
|
作者
Zhang, Zhitian [1 ]
Zhao, Hongdong [1 ]
Zhao, Yazhou [1 ]
Chen, Dan [1 ]
Zhang, Ke [1 ]
Li, Yanqi [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin, Peoples R China
关键词
Autonomous driving; Multimodal; 3D object detection; Attention mechanism;
D O I
10.1108/IJICC-07-2024-0328
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
PurposeIn autonomous driving, the inherent sparsity of point clouds often limits the performance of object detection, while existing multimodal architectures struggle to meet the real-time requirements for 3D object detection. Therefore, the main purpose of this paper is to significantly enhance the detection performance of objects, especially the recognition capability for small-sized objects and to address the issue of slow inference speed. This will improve the safety of autonomous driving systems and provide feasibility for devices with limited computing power to achieve autonomous driving.Design/methodology/approachBRTPillar first adopts an element-based method to fuse image and point cloud features. Secondly, a local-global feature interaction method based on an efficient additive attention mechanism was designed to extract multi-scale contextual information. Finally, an enhanced multi-scale feature fusion method was proposed by introducing adaptive spatial and channel interaction attention mechanisms, thereby improving the learning of fine-grained features.FindingsExtensive experiments were conducted on the KITTI dataset. The results showed that compared with the benchmark model, the accuracy of cars, pedestrians and cyclists on the 3D object box improved by 3.05, 9.01 and 22.65%, respectively; the accuracy in the bird's-eye view has increased by 2.98, 10.77 and 21.14%, respectively. Meanwhile, the running speed of BRTPillar can reach 40.27 Hz, meeting the real-time detection needs of autonomous driving.Originality/valueThis paper proposes a boosting multimodal real-time 3D object detection method called BRTPillar, which achieves accurate location in many scenarios, especially for complex scenes with many small objects, while also achieving real-time inference speed.
引用
收藏
页码:217 / 235
页数:19
相关论文
共 50 条
  • [31] RI-Fusion: 3D Object Detection Using Enhanced Point Features With Range-Image Fusion for Autonomous Driving
    Zhang, Xinyu
    Wang, Li
    Zhang, Guoxin
    Lan, Tianwei
    Zhang, Haoming
    Zhao, Lijun
    Li, Jun
    Zhu, Lei
    Liu, Huaping
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [32] RI-Fusion: 3D Object Detection Using Enhanced Point Features With Range-Image Fusion for Autonomous Driving
    Zhang, Xinyu
    Wang, Li
    Zhang, Guoxin
    Lan, Tianwei
    Zhang, Haoming
    Zhao, Lijun
    Li, Jun
    Zhu, Lei
    Liu, Huaping
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [33] 3D Object Detection Based on Feature Fusion of Point Cloud Sequences
    Zhai, Zhenyu
    Wang, Qiantong
    Pan, Zongxu
    Hu, Wenlong
    Hu, Yuxin
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1240 - 1245
  • [34] Real-time Object detection and Classification for Autonomous Driving
    Naghavi, Seyyed Hamed
    Pourreza, Hamidreza
    2018 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2018, : 274 - 279
  • [35] Adaptive Feature Fusion Based Cooperative 3D Object Detection for Autonomous Driving
    Wang, Junyong
    Zeng, Yuan
    Gong, Yi
    2022 3RD INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE (ICTC 2022), 2022, : 103 - 107
  • [36] Boosting Sparse Point Cloud Object Detection via Image Fusion
    Shi, Weijing
    Rajkumar, Ragunathan
    2021 IEEE THIRD INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2021), 2021, : 214 - 220
  • [37] Deep Learning Based, Real-Time Object Detection for Autonomous Driving
    Akyol, Gamze
    Kantarci, Alperen
    Celik, Ali Eren
    Ak, Abdullah Cihan
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [38] MSL3D: 3D object detection from monocular, stereo and point cloud for autonomous driving
    Chen, Wenyu
    Li, Peixuan
    Zhao, Huaici
    NEUROCOMPUTING, 2022, 494 : 23 - 32
  • [39] 3D Object Detection Based on Extremely Sparse Laser Point Cloud and RGB Images
    Qin Chao
    Wang Yafei
    Zhang Yuchao
    Yin Chengliang
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [40] 3D object detection based on synthetic RGB image
    Xu C.
    Li Z.
    Jiang D.
    Yun J.
    Liu Y.
    Liu Y.
    Bai D.
    Ying S.
    International Journal of Wireless and Mobile Computing, 2021, 20 (01): : 70 - 76