BRTPillar: boosting real-time 3D object detection based point cloud and RGB image fusion in autonomous driving

被引:0
|
作者
Zhang, Zhitian [1 ]
Zhao, Hongdong [1 ]
Zhao, Yazhou [1 ]
Chen, Dan [1 ]
Zhang, Ke [1 ]
Li, Yanqi [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin, Peoples R China
关键词
Autonomous driving; Multimodal; 3D object detection; Attention mechanism;
D O I
10.1108/IJICC-07-2024-0328
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
PurposeIn autonomous driving, the inherent sparsity of point clouds often limits the performance of object detection, while existing multimodal architectures struggle to meet the real-time requirements for 3D object detection. Therefore, the main purpose of this paper is to significantly enhance the detection performance of objects, especially the recognition capability for small-sized objects and to address the issue of slow inference speed. This will improve the safety of autonomous driving systems and provide feasibility for devices with limited computing power to achieve autonomous driving.Design/methodology/approachBRTPillar first adopts an element-based method to fuse image and point cloud features. Secondly, a local-global feature interaction method based on an efficient additive attention mechanism was designed to extract multi-scale contextual information. Finally, an enhanced multi-scale feature fusion method was proposed by introducing adaptive spatial and channel interaction attention mechanisms, thereby improving the learning of fine-grained features.FindingsExtensive experiments were conducted on the KITTI dataset. The results showed that compared with the benchmark model, the accuracy of cars, pedestrians and cyclists on the 3D object box improved by 3.05, 9.01 and 22.65%, respectively; the accuracy in the bird's-eye view has increased by 2.98, 10.77 and 21.14%, respectively. Meanwhile, the running speed of BRTPillar can reach 40.27 Hz, meeting the real-time detection needs of autonomous driving.Originality/valueThis paper proposes a boosting multimodal real-time 3D object detection method called BRTPillar, which achieves accurate location in many scenarios, especially for complex scenes with many small objects, while also achieving real-time inference speed.
引用
收藏
页码:217 / 235
页数:19
相关论文
共 50 条
  • [1] BEVDetNet: Bird's Eye View LiDAR Point Cloud based Real-time 3D Object Detection for Autonomous Driving
    Mohapatra, Sambit
    Yogamani, Senthil
    Gotzig, Heinrich
    Milz, Stefan
    Maeder, Patrick
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2809 - 2815
  • [2] Real-Time Semantic Segmentation of 3D Point Cloud for Autonomous Driving
    Kang, Dongwan
    Wong, Anthony
    Lee, Banghyon
    Kim, Jungha
    ELECTRONICS, 2021, 10 (16)
  • [3] 3D object detection based on image and LIDAR fusion for autonomous driving
    Chen G.
    Yi H.
    Mao Z.
    International Journal of Vehicle Information and Communication Systems, 2023, 8 (03) : 237 - 251
  • [4] A RGB-D Based Real-Time Multiple Object Detection and Ranging System for Autonomous Driving
    Yang, Jiachen
    Wang, Chenguang
    Wang, Huihui
    Li, Qiang
    IEEE SENSORS JOURNAL, 2020, 20 (20) : 11959 - 11966
  • [5] Research on 3D Point Cloud Object Detection Algorithm for Autonomous Driving
    Jiang, Haiyang
    Lu, Yuanyao
    Chen, Shengnan
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [6] PLOT: a 3D point cloud object detection network for autonomous driving
    Zhang, Yihuan
    Wang, Liang
    Dai, Yifan
    ROBOTICA, 2023, 41 (05) : 1483 - 1499
  • [7] PIFNet: 3D Object Detection Using Joint Image and Point Cloud Features for Autonomous Driving
    Zheng, Wenqi
    Xie, Han
    Chen, Yunfan
    Roh, Jeongjin
    Shin, Hyunchul
    APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [8] A survey on 3D object detection in real time for autonomous driving
    Contreras, Marcelo
    Jain, Aayush
    Bhatt, Neel P.
    Banerjee, Arunava
    Hashemi, Ehsan
    FRONTIERS IN ROBOTICS AND AI, 2024, 11
  • [9] Research on 3D Object Detection Based on Laser Point Cloud and Image Fusion
    Liu Y.
    Yu F.
    Zhang X.
    Chen Z.
    Qin D.
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2022, 58 (24): : 289 - 299
  • [10] 3D object detection based on fusion of point cloud and image by mutual attention
    Chen J.-Y.
    Bai T.-Y.
    Zhao L.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2021, 29 (09): : 2247 - 2254