Deep Learning-Based Pedestrian Detection Using RGB Images and Sparse LiDAR Point Clouds

被引:2
|
作者
Xu, Haoran [1 ]
Huang, Shuo [2 ]
Yang, Yixin [1 ]
Chen, Xiaodao [1 ]
Hu, Shiyan [3 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan 430078, Peoples R China
[2] China FAW Nanjing Technol Dev Co Ltd, Nanjing 211106, Peoples R China
[3] Univ Southampton, Sch Elect & Comp Sci, Southampton SO17 1BJ, England
基金
中国国家自然科学基金;
关键词
Laser radar; Pedestrians; Feature extraction; Point cloud compression; Sensors; Cameras; Data models; Data fusion; pedestrian detection; RGB camera; sparse light detection and ranging (LiDAR); FUSION; NETWORK;
D O I
10.1109/TII.2024.3353845
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the fundamental tasks in autonomous driving is environment perception for pedestrian detection, where the fused pedestrian detection using camera and light detection and ranging (LiDAR) information imposes challenges since the data alignment, compensation, and fusion between different data modes are challenging and the simultaneous acquisition of data from two different modalities also increases the difficulty. This work addresses the above challenges from both of the hardware and software dimensions. First, a multimodal pedestrian data acquisition platform is designed and constructed using an RGB camera, sparse LiDAR, and data processing module including hardware connection and deployment, sensor distortion correction and joint calibration, and data acquisition synchronization. Pedestrian data from multiple scenes are then collected using this platform to produce and form a dedicated multimodal pedestrian detection dataset. Further, a two-branch multimodal multilevel fusion pedestrian detection network (MM-Net) is proposed, which includes a two-branch feature extraction module and a feature-level data fusion module. Extensive experiments are performed on the multimodal pedestrian detection dataset and KITTI dataset for the comparison with the existing models. The experimental results demonstrate the superior performance of MM-Net.
引用
收藏
页码:7149 / 7161
页数:13
相关论文
共 50 条
  • [31] Deep Learning-Based Pedestrian Detection in Autonomous Vehicles: Substantial Issues and Challenges
    Iftikhar, Sundas
    Zhang, Zuping
    Asim, Muhammad
    Muthanna, Ammar
    Koucheryavy, Andrey
    Abd El-Latif, Ahmed A.
    [J]. ELECTRONICS, 2022, 11 (21)
  • [32] A Deep Learning-based Automatic Method for Early Detection of the Glaucoma using Fundus Images
    Shoukat, Ayesha
    Akbar, Shahzad
    Safdar, Khadij A.
    [J]. 4TH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING (IC)2, 2021, : 391 - 396
  • [33] Deep Learning-Based Graffiti Detection: A Study Using Images from the Streets of Lisbon
    Fogaca, Joana
    Brandao, Tomas
    Ferreira, Joao C.
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [34] Deep learning-based visual crack detection using Google Street View images
    Mohsen Maniat
    Charles V. Camp
    Ali R. Kashani
    [J]. Neural Computing and Applications, 2021, 33 : 14565 - 14582
  • [35] Deep Learning-Based Automatic Detection of Ships: An Experimental Study Using Satellite Images
    Patel, Krishna
    Bhatt, Chintan
    Mazzeo, Pier Luigi
    [J]. JOURNAL OF IMAGING, 2022, 8 (07)
  • [36] Deep learning-based visual crack detection using Google Street View images
    Maniat, Mohsen
    Camp, Charles V.
    Kashani, Ali R.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (21): : 14565 - 14582
  • [37] Deep Learning-Based Leaf Disease Detection in Crops Using Images for Agricultural Applications
    Andrew, J.
    Eunice, Jennifer
    Popescu, Daniela Elena
    Chowdary, M. Kalpana
    Hemanth, Jude
    [J]. AGRONOMY-BASEL, 2022, 12 (10):
  • [38] Pedestrian Detection Based on Deep Learning
    Jeon, Hyung-Min
    Vinh Dinh Nguyen
    Jeon, Jae Wook
    [J]. 45TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2019), 2019, : 144 - 149
  • [39] Deep Learning Based Pedestrian Detection
    Sun, Weicheng
    Zhu, Songhao
    Ju, Xuewen
    Wang, Dongsheng
    [J]. PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 1007 - 1011
  • [40] SEMANTIC URBAN MESH SEGMENTATION BASED ON AERIAL OBLIQUE IMAGES AND POINT CLOUDS USING DEEP LEARNING
    Wilk, L.
    Mielczarek, D.
    Ostrowski, W.
    Dominik, W.
    Krawczyk, J.
    [J]. XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 43-B2 : 485 - 491