Depth-Enhanced Deep Learning Approach For Monocular Camera Based 3D Object Detection

被引:0
|
作者
Wang, Chuyao [1 ]
Aouf, Nabil [1 ]
机构
[1] City Univ London, Sch Sci & Technol, Dept Engn, London, England
关键词
3D Object detection; Autonomous driving; Machine learning;
D O I
10.1007/s10846-024-02128-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic 3D object detection using monocular cameras presents significant challenges in the context of autonomous driving. Precise labeling of 3D object scales requires accurate spatial information, which is difficult to obtain from a single image due to the inherent lack of depth information in monocular images, compared to LiDAR data. In this paper, we propose a novel approach to address this issue by enhancing deep neural networks with depth information for monocular 3D object detection. The proposed method comprises three key components: 1)Feature Enhancement Pyramid Module: We extend the conventional Feature Pyramid Networks (FPN) by introducing a feature enhancement pyramid network. This module fuses feature maps from the original pyramid and captures contextual correlations across multiple scales. To increase the connectivity between low-level and high-level features, additional pathways are incorporated. 2)Auxiliary Dense Depth Estimator: We introduce an auxiliary dense depth estimator that generates dense depth maps to enhance the spatial perception capabilities of the deep network model without adding computational burden. 3)Augmented Center Depth Regression: To aid center depth estimation, we employ additional bounding box vertex depth regression based on geometry. Our experimental results demonstrate the superiority of the proposed technique over existing competitive methods reported in the literature. The approach showcases remarkable performance improvements in monocular 3D object detection, making it a promising solution for autonomous driving applications.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Deep Learning-Based Monocular 3D Object Detection with Refinement of Depth Information
    Hu, Henan
    Zhu, Ming
    Li, Muyu
    Chan, Kwok-Leung
    [J]. SENSORS, 2022, 22 (07)
  • [2] Deep Optics for Monocular Depth Estimation and 3D Object Detection
    Chang, Julie
    Wetzstein, Gordon
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10192 - 10201
  • [3] A Survey on Monocular 3D Object Detection Algorithms Based on Deep Learning
    Wu, Junhui
    Yin, Dong
    Chen, Jie
    Wu, Yusheng
    Si, Huiping
    Lin, Kaiyan
    [J]. 2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518
  • [4] Deep Learning Based 3D Object Detection for Automotive Radar and Camera
    Meyer, Michael
    Kuschk, Georg
    [J]. 2019 16TH EUROPEAN RADAR CONFERENCE (EURAD), 2019, : 133 - 136
  • [5] A Survey on Deep Learning Based Methods and Datasets for Monocular 3D Object Detection
    Kim, Seong-heum
    Hwang, Youngbae
    [J]. ELECTRONICS, 2021, 10 (04) : 1 - 22
  • [6] 3D Street Object Detection from Monocular Images Using Deep Learning and Depth Information
    Liu, Wei
    Zhang, Tao
    Ma, Yun
    Wei, Longsheng
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2023, 27 (02) : 198 - 206
  • [7] Depth-discriminative Metric Learning for Monocular 3D Object Detection
    Choi, Wonhyeok
    Shin, Mingyu
    Im, Sunghoon
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Learning Depth-Guided Convolutions for Monocular 3D Object Detection
    Ng, Mingyu
    Huo, Yuqi
    Yi, Hongwei
    Wang, Zhe
    Shi, Jianping
    Lu, Zhiwu
    Luo, Ping
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4306 - 4315
  • [9] Competition for roadside camera monocular 3D object detection
    Jia, Jinrang
    Shi, Yifeng
    Qu, Yuli
    Wang, Rui
    Xu, Xing
    Zhang, Hai
    [J]. NATIONAL SCIENCE REVIEW, 2023, 10 (06)
  • [10] Competition for roadside camera monocular 3D object detection
    Jinrang Jia
    Yifeng Shi
    Yuli Qu
    Rui Wang
    Xing Xu
    Hai Zhang
    [J]. National Science Review, 2023, 10 (06) : 34 - 37