Regional feature fusion for on-road detection of objects using camera and 3D-LiDAR in high-speed autonomous vehicles

被引:75
|
作者
Wu, Qingyu [1 ]
Li, Xiaoxiao [1 ]
Wang, Kang [2 ]
Bilal, Hazrat [3 ]
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] China Mobile Zhejiang Innovat Res Co Ltd, Hangzhou 310030, Zhejiang, Peoples R China
[3] Univ Sci & Technol China, Dept Automat, Hefei 2300271, Peoples R China
关键词
Autonomous vehicle; Object detection; 3D LIDAR; CNN; Feature extraction; Regional features;
D O I
10.1007/s00500-023-09278-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autonomous vehicles require accurate, and fast decision-making perception systems to know the driving environment. The 2D object detection is critical in allowing the perception system to know the environment. However, 2D object detection lacks depth information, which are crucial for understanding the driving environment. Therefore, 3D object detection is essential for the perception system of autonomous vehicles to predict the location of objects and understand the driving environment. The 3D object detection also faces challenges because of scale changes, and occlusions. Therefore in this study, a novel object detection method is presented that fuses the complementary information of 2D and 3D object detection to accurately detect objects in autonomous vehicles. Firstly, the aim is to project the 3D-LiDAR data into image space. Secondly, the regional proposal network (RPN) to produce a region of interest (ROI) is utilised. The ROI pooling network is used to map the ROI into ResNet50 feature extractor to get a feature map of fixed size. To accurately predict the dimensions of all the objects, we fuse the features of the 3D-LiDAR with the regional features obtained from camera images. The fused features from 3D-LiDAR and camera images are employed as input to the faster-region based convolution neural network (Faster-RCNN) network for the detection of objects. The assessment results on the KITTI object detection dataset reveal that the method can accurately predict car, van, truck, pedestrian and cyclist with an average precision of 94.59%, 82.50%, 79.60%, 85.31%, 86.33%, respectively, which is better than most of the previous methods. Moreover, the average processing time of the proposed method is only 70 ms which meets the real-time demand of autonomous vehicles. Additionally, the proposed model runs at 15.8 frames per second (FPS), which is faster than state-of-the-art fusion methods for 3D-LiDAR and camera.
引用
收藏
页码:18195 / 18213
页数:19
相关论文
共 50 条
  • [21] Drivable Area Segmentation in Deteriorating Road Regions for Autonomous Vehicles using 3D LiDAR Sensor
    Ali, Abdelrahman
    Gergis, Mark
    Abdennadher, Slim
    El Mougy, Amr
    2021 32ND IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2021, : 845 - 851
  • [22] 3-D Objects Detection and Tracking Using Solid-State LiDAR and RGB Camera
    Peng, Zheng
    Xiong, Zhi
    Zhao, Yao
    Zhang, Ling
    IEEE SENSORS JOURNAL, 2023, 23 (13) : 14795 - 14808
  • [23] 3D Detection and Tracking for On-road Vehicles with a Monovision Camera and Dual Low-cost 4D mmWave Radars
    Cui, Hang
    Wu, Junzhe
    Zhang, Jiaming
    Chowdhary, Girish
    Norris, William R.
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2931 - 2937
  • [24] Dynamic Detection Technology for Moving Objects Using 3D LiDAR Information and RGB Camera
    Wang, Shi-Chiuan
    Fan, Yu-Cheng
    2017 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2017,
  • [25] 3-D LiDAR + Monocular Camera: An Inverse-Depth-Induced Fusion Framework for Urban Road Detection
    Gu, Shuo
    Lu, Tao
    Zhang, Yigong
    Alvarez, Jose M.
    Yang, Jian
    Kong, Hui
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2018, 3 (03): : 351 - 360
  • [26] Channelwise and Spatially Guided Multimodal Feature Fusion Network for 3-D Object Detection in Autonomous Vehicles
    Uzair, Muhammad
    Dong, Jian
    Shi, Ronghua
    Mushtaq, Husnain
    Ullah, Irshad
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [27] HIGH QUALITY RECONSTRUCTION OF DYNAMIC OBJECTS USING 2D-3D CAMERA FUSION
    Jiang, Cansen
    Christie, Dennis
    Paudel, Danda Pani
    Demonceaux, Cedric
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2209 - 2213
  • [28] Nonlinear Model Predictive Planning and Control for High-Speed Autonomous Vehicles on 3D Terrains
    Yu, Siyuan
    Shen, Congkai
    Ersal, Tulga
    IFAC PAPERSONLINE, 2021, 54 (20): : 412 - 417
  • [29] Road Curb Detection using 3D Lidar and Integral Laser Points for Intelligent Vehicles
    Yao, Wentao
    Deng, Zhidong
    Zhou, Lipu
    6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 100 - 105
  • [30] FS-Net: LiDAR-Camera Fusion With Matched Scale for 3D Object Detection in Autonomous Driving
    Zhang, Lei
    Li, Xu
    Tang, Kaichen
    Jiang, Yunzhe
    Yang, Liu
    Zhang, Yonggang
    Chen, Xianyi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (11) : 12154 - 12165