BRTPillar: boosting real-time 3D object detection based point cloud and RGB image fusion in autonomous driving

被引:0
|
作者
Zhang, Zhitian [1 ]
Zhao, Hongdong [1 ]
Zhao, Yazhou [1 ]
Chen, Dan [1 ]
Zhang, Ke [1 ]
Li, Yanqi [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin, Peoples R China
关键词
Autonomous driving; Multimodal; 3D object detection; Attention mechanism;
D O I
10.1108/IJICC-07-2024-0328
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
PurposeIn autonomous driving, the inherent sparsity of point clouds often limits the performance of object detection, while existing multimodal architectures struggle to meet the real-time requirements for 3D object detection. Therefore, the main purpose of this paper is to significantly enhance the detection performance of objects, especially the recognition capability for small-sized objects and to address the issue of slow inference speed. This will improve the safety of autonomous driving systems and provide feasibility for devices with limited computing power to achieve autonomous driving.Design/methodology/approachBRTPillar first adopts an element-based method to fuse image and point cloud features. Secondly, a local-global feature interaction method based on an efficient additive attention mechanism was designed to extract multi-scale contextual information. Finally, an enhanced multi-scale feature fusion method was proposed by introducing adaptive spatial and channel interaction attention mechanisms, thereby improving the learning of fine-grained features.FindingsExtensive experiments were conducted on the KITTI dataset. The results showed that compared with the benchmark model, the accuracy of cars, pedestrians and cyclists on the 3D object box improved by 3.05, 9.01 and 22.65%, respectively; the accuracy in the bird's-eye view has increased by 2.98, 10.77 and 21.14%, respectively. Meanwhile, the running speed of BRTPillar can reach 40.27 Hz, meeting the real-time detection needs of autonomous driving.Originality/valueThis paper proposes a boosting multimodal real-time 3D object detection method called BRTPillar, which achieves accurate location in many scenarios, especially for complex scenes with many small objects, while also achieving real-time inference speed.
引用
收藏
页码:217 / 235
页数:19
相关论文
共 50 条
  • [41] Stereo RGB and Deeper LIDAR-Based Network for 3D Object Detection in Autonomous Driving
    He, Qingdong
    Wang, Zhengning
    Zeng, Hao
    Zeng, Yi
    Liu, Yijun
    Liu, Shuaicheng
    Zeng, Bing
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (01) : 152 - 162
  • [42] RGB and LiDAR fusion based 3D Semantic Segmentation for Autonomous Driving
    El Madawi, Khaled
    Rashed, Hazem
    El Sallab, Ahmad
    Nasr, Omar
    Kamel, Hanan
    Yogamani, Senthil
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 7 - 12
  • [43] Real-time 3D point cloud registration
    Qian, Jiaming
    Feng, Shijie
    Tao, Tianyang
    Hu, Yan
    Chen, Qian
    Zuo, Chao
    SEVENTH INTERNATIONAL CONFERENCE ON OPTICAL AND PHOTONIC ENGINEERING (ICOPEN 2019), 2019, 11205
  • [44] A review of 3D object detection based on autonomous driving
    Wang, Huijuan
    Chen, Xinyue
    Yuan, Quanbo
    Liu, Peng
    VISUAL COMPUTER, 2025, 41 (03): : 1757 - 1775
  • [45] Coarse to fine-based image-point cloud fusion network for 3D object detection
    Hao, Meilan
    Zhang, Zhongkang
    Li, Lei
    Dong, Kejian
    Cheng, Long
    Tiwari, Prayag
    Ning, Xin
    INFORMATION FUSION, 2024, 112
  • [46] PIXOR: Real-time 3D Object Detection from Point Clouds
    Yang, Bin
    Luo, Wenjie
    Urtasun, Raquel
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7652 - 7660
  • [47] Boosting Lidar 3D Object Detection with Point Cloud Semantic Segmentation
    Zhang, Xuchong
    Min, Chong
    Jia, Yijie
    Chen, Liming
    Zhang, Jingmin
    Sun, Hongbin
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7614 - 7621
  • [48] Multiattention Mechanism 3D Object Detection Algorithm Based on RGB and LiDAR Fusion for Intelligent Driving
    Zhang, Xiucai
    He, Lei
    Chen, Junyi
    Wang, Baoyun
    Wang, Yuhai
    Zhou, Yuanle
    SENSORS, 2023, 23 (21)
  • [49] Point Cloud-based Real-Time 3D Object Detection for Predictive Analytics of Safety Incidents in Manufacturing Industry
    Moon, Yeeun
    Lee, Jieun
    Beak, Seunghyo
    Jeong, Jongpil
    2023 29TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE, M2VIP 2023, 2023,
  • [50] Point cloud 3D object detection algorithm based on local information fusion
    Zhang, Linjie
    Chai, Zhilei
    Wang, Ning
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (11): : 2219 - 2229