Regional feature fusion for on-road detection of objects using camera and 3D-LiDAR in high-speed autonomous vehicles

被引:75
|
作者
Wu, Qingyu [1 ]
Li, Xiaoxiao [1 ]
Wang, Kang [2 ]
Bilal, Hazrat [3 ]
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] China Mobile Zhejiang Innovat Res Co Ltd, Hangzhou 310030, Zhejiang, Peoples R China
[3] Univ Sci & Technol China, Dept Automat, Hefei 2300271, Peoples R China
关键词
Autonomous vehicle; Object detection; 3D LIDAR; CNN; Feature extraction; Regional features;
D O I
10.1007/s00500-023-09278-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autonomous vehicles require accurate, and fast decision-making perception systems to know the driving environment. The 2D object detection is critical in allowing the perception system to know the environment. However, 2D object detection lacks depth information, which are crucial for understanding the driving environment. Therefore, 3D object detection is essential for the perception system of autonomous vehicles to predict the location of objects and understand the driving environment. The 3D object detection also faces challenges because of scale changes, and occlusions. Therefore in this study, a novel object detection method is presented that fuses the complementary information of 2D and 3D object detection to accurately detect objects in autonomous vehicles. Firstly, the aim is to project the 3D-LiDAR data into image space. Secondly, the regional proposal network (RPN) to produce a region of interest (ROI) is utilised. The ROI pooling network is used to map the ROI into ResNet50 feature extractor to get a feature map of fixed size. To accurately predict the dimensions of all the objects, we fuse the features of the 3D-LiDAR with the regional features obtained from camera images. The fused features from 3D-LiDAR and camera images are employed as input to the faster-region based convolution neural network (Faster-RCNN) network for the detection of objects. The assessment results on the KITTI object detection dataset reveal that the method can accurately predict car, van, truck, pedestrian and cyclist with an average precision of 94.59%, 82.50%, 79.60%, 85.31%, 86.33%, respectively, which is better than most of the previous methods. Moreover, the average processing time of the proposed method is only 70 ms which meets the real-time demand of autonomous vehicles. Additionally, the proposed model runs at 15.8 frames per second (FPS), which is faster than state-of-the-art fusion methods for 3D-LiDAR and camera.
引用
收藏
页码:18195 / 18213
页数:19
相关论文
共 50 条
  • [31] Real-time 3D-LiDAR object detection in autonomous vehicle systems using cluster-based candidates and deep learning
    Kim M.-G.
    Bae S.-H.
    Kim H.
    Journal of Institute of Control, Robotics and Systems, 2019, 25 (09): : 795 - 801
  • [32] Sem-Aug: Improving Camera-LiDAR Feature Fusion With Semantic Augmentation for 3D Vehicle Detection
    Zhao, Lin
    Wang, Meiling
    Yue, Yufeng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04): : 9358 - 9365
  • [33] CL3D: Camera-LiDAR 3D Object Detection With Point Feature Enhancement and Point-Guided Fusion
    Lin, Chunmian
    Tian, Daxin
    Duan, Xuting
    Zhou, Jianshan
    Zhao, Dezong
    Cao, Dongpu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 18040 - 18050
  • [34] SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection
    Zhang, Hongcheng
    Liang, Liu
    Zeng, Pengxin
    Song, Xiao
    Wang, Zhe
    COMPUTER VISION-ECCV 2024, PT XXXV, 2025, 15093 : 109 - 128
  • [35] Fast and Accurate 3D Object Detection for Lidar-Camera-Based Autonomous Vehicles Using One Shared Voxel-Based Backbone
    Wen, Li-Hua
    Jo, Kang-Hyun
    IEEE ACCESS, 2021, 9 : 22080 - 22089
  • [36] High-speed structured light based 3D scanning using an event camera
    Huang, Xueyan
    Zhang, Yueyi
    Xiong, Zhiwei
    OPTICS EXPRESS, 2021, 29 (22) : 35864 - 35876
  • [37] High-speed extraction of 3D structure of selectable quality using a translating camera
    Dalmia, AK
    Trivedi, M
    COMPUTER VISION AND IMAGE UNDERSTANDING, 1996, 64 (01) : 97 - 110
  • [38] Evaluation of 3D Vulnerable Objects' Detection Using a Multi-Sensors System for Autonomous Vehicles
    Khatab, Esraa
    Onsy, Ahmed
    Abouelfarag, Ahmed
    SENSORS, 2022, 22 (04)
  • [39] ROS2 Implementation of Object Detection and Distance Estimation using Camera and 2D LiDAR Fusion in Autonomous Vehicle
    Hwang, Gyu Hyeon
    Lee, Si Woo
    Jeon, JaeWook
    2024 33RD INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, ISIE 2024, 2024,
  • [40] Efficient Extrinsic Calibration of Multi-Sensor 3D LiDAR Systems for Autonomous Vehicles using Static Objects Information
    Ponton, Brahayam
    Ferri, Magda
    Konig, Lars
    Bartels, Marcus
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 6285 - 6292