Regional feature fusion for on-road detection of objects using camera and 3D-LiDAR in high-speed autonomous vehicles

被引：75

作者：

Wu, Qingyu ^{[1
]}

Li, Xiaoxiao ^{[1
]}

Wang, Kang ^{[2
]}

Bilal, Hazrat ^{[3
]}

机构：

[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[2] China Mobile Zhejiang Innovat Res Co Ltd, Hangzhou 310030, Zhejiang, Peoples R China

[3] Univ Sci & Technol China, Dept Automat, Hefei 2300271, Peoples R China

来源：

SOFT COMPUTING | 2023年 / 27卷 / 23期

关键词：

Autonomous vehicle; Object detection; 3D LIDAR; CNN; Feature extraction; Regional features;

D O I：

10.1007/s00500-023-09278-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Autonomous vehicles require accurate, and fast decision-making perception systems to know the driving environment. The 2D object detection is critical in allowing the perception system to know the environment. However, 2D object detection lacks depth information, which are crucial for understanding the driving environment. Therefore, 3D object detection is essential for the perception system of autonomous vehicles to predict the location of objects and understand the driving environment. The 3D object detection also faces challenges because of scale changes, and occlusions. Therefore in this study, a novel object detection method is presented that fuses the complementary information of 2D and 3D object detection to accurately detect objects in autonomous vehicles. Firstly, the aim is to project the 3D-LiDAR data into image space. Secondly, the regional proposal network (RPN) to produce a region of interest (ROI) is utilised. The ROI pooling network is used to map the ROI into ResNet50 feature extractor to get a feature map of fixed size. To accurately predict the dimensions of all the objects, we fuse the features of the 3D-LiDAR with the regional features obtained from camera images. The fused features from 3D-LiDAR and camera images are employed as input to the faster-region based convolution neural network (Faster-RCNN) network for the detection of objects. The assessment results on the KITTI object detection dataset reveal that the method can accurately predict car, van, truck, pedestrian and cyclist with an average precision of 94.59%, 82.50%, 79.60%, 85.31%, 86.33%, respectively, which is better than most of the previous methods. Moreover, the average processing time of the proposed method is only 70 ms which meets the real-time demand of autonomous vehicles. Additionally, the proposed model runs at 15.8 frames per second (FPS), which is faster than state-of-the-art fusion methods for 3D-LiDAR and camera.

引用

页码：18195 / 18213

页数：19

共 50 条

[21] Drivable Area Segmentation in Deteriorating Road Regions for Autonomous Vehicles using 3D LiDAR Sensor
Ali, Abdelrahman
Gergis, Mark
Abdennadher, Slim
El Mougy, Amr
2021 32ND IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2021, : 845 - 851
[22] 3-D Objects Detection and Tracking Using Solid-State LiDAR and RGB Camera
Peng, Zheng
Xiong, Zhi
Zhao, Yao
Zhang, Ling
IEEE SENSORS JOURNAL, 2023, 23 (13) : 14795 - 14808
[23] 3D Detection and Tracking for On-road Vehicles with a Monovision Camera and Dual Low-cost 4D mmWave Radars
Cui, Hang
Wu, Junzhe
Zhang, Jiaming
Chowdhary, Girish
Norris, William R.
2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2931 - 2937
[24] Dynamic Detection Technology for Moving Objects Using 3D LiDAR Information and RGB Camera
Wang, Shi-Chiuan
Fan, Yu-Cheng
2017 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2017,
[25] 3-D LiDAR + Monocular Camera: An Inverse-Depth-Induced Fusion Framework for Urban Road Detection
Gu, Shuo
Lu, Tao
Zhang, Yigong
Alvarez, Jose M.
Yang, Jian
Kong, Hui
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2018, 3 (03): : 351 - 360
[26] Channelwise and Spatially Guided Multimodal Feature Fusion Network for 3-D Object Detection in Autonomous Vehicles
Uzair, Muhammad
Dong, Jian
Shi, Ronghua
Mushtaq, Husnain
Ullah, Irshad
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[27] HIGH QUALITY RECONSTRUCTION OF DYNAMIC OBJECTS USING 2D-3D CAMERA FUSION
Jiang, Cansen
Christie, Dennis
Paudel, Danda Pani
Demonceaux, Cedric
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2209 - 2213
[28] Nonlinear Model Predictive Planning and Control for High-Speed Autonomous Vehicles on 3D Terrains
Yu, Siyuan
Shen, Congkai
Ersal, Tulga
IFAC PAPERSONLINE, 2021, 54 (20): : 412 - 417
[29] Road Curb Detection using 3D Lidar and Integral Laser Points for Intelligent Vehicles
Yao, Wentao
Deng, Zhidong
Zhou, Lipu
6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 100 - 105
[30] FS-Net: LiDAR-Camera Fusion With Matched Scale for 3D Object Detection in Autonomous Driving
Zhang, Lei
Li, Xu
Tang, Kaichen
Jiang, Yunzhe
Yang, Liu
Zhang, Yonggang
Chen, Xianyi
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (11) : 12154 - 12165

← 1 2 3 4 5 →