Towards unified on-road object detection and depth estimation from a single image

被引:0
|
作者
Guofei Lian
Yan Wang
Huabiao Qin
Guancheng Chen
机构
[1] South China University of Technology,School of Electronic and Information Engineering
关键词
On-road object detection; Depth estimation; Monocular image; Convolution neural network; YOLOv3;
D O I
暂无
中图分类号
学科分类号
摘要
On-road object detection based on convolutional neural network (CNN) is an important problem in the field of automatic driving. However, traditional 2D object detection aims to accomplish object classification and location in image space, lacking the ability to acquire the depth information. Besides, it is inefficient to cascade the object detection and monocular depth estimation network for realizing 2.5D object detection. To address this problem, we propose a unified multi-task learning mechanism of object detection and depth estimation. Firstly, we propose an innovative loss function, namely projective consistency loss, which uses the perspective projection principle to model the transformation relationship between the target size and the depth value. Therefore, the object detection task and the depth estimation task can be mutually constrained. Then, we propose a global multi-scale feature extracting scheme by combining the Global Context (GC) and Atrous Spatial Pyramid Pooling (ASPP) block in an appropriate way, which can promote effective feature learning and collaborative learning between object detection and depth estimation. Comprehensive experiments conducted on KITTI and Cityscapes dataset show that our approach achieves high mAP and low distance estimation error, outperforming other state-of-the-art methods.
引用
收藏
页码:1231 / 1241
页数:10
相关论文
共 50 条
  • [1] Towards unified on-road object detection and depth estimation from a single image
    Lian, Guofei
    Wang, Yan
    Qin, Huabiao
    Chen, Guancheng
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (05) : 1231 - 1241
  • [2] Towards Unified Depth and Semantic Prediction from a Single Image
    Wang, Peng
    Shen, Xiaohui
    Lin, Zhe
    Cohen, Scott
    Price, Brian
    Yuille, Alan
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2800 - 2809
  • [3] Towards depth estimation in a single aerial image
    Pellegrin, Luis
    Martinez-Carranza, Jose
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2020, 41 (05) : 1970 - 1985
  • [4] Joint Object Detection and Depth Estimation in Multiplexed Image
    Zhou, Changxin
    Liu, Yazhou
    Sun, Quansen
    Lasang, Pongsak
    [J]. IEEE ACCESS, 2019, 7 : 123107 - 123115
  • [5] Joint Object Detection and Depth Estimation in Multiplexed Image
    Zhou, Changxin
    Liu, Yazhou
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 312 - 323
  • [6] Towards detection and tracking of on-road objects
    Goecke, Roland
    Pettersson, Niklas
    Petersson, Lars
    [J]. 2007 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1-3, 2007, : 1048 - 1053
  • [7] Object detection from road image sequence
    Wang, PT
    Doihara, T
    [J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XIV, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING III, 2002, : 23 - 27
  • [8] Object Depth Estimation from a Single Image using Fully Convolutional Neural Network
    Afifi, Ahmed J.
    Hellwich, Olaf
    [J]. 2016 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2016, : 605 - 611
  • [9] On-Road Object Detection Based on Deep Residual Networks
    Chen, Kang
    Zhao, Qi
    Lin, Yaorong
    Zhang, Jun
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT VI, 2017, 10639 : 567 - 574
  • [10] SINGLE IMAGE DEPTH ESTIMATION FROM IMAGE DESCRIPTORS
    Lin, Yu-Hsun
    Cheng, Wen-Huang
    Miao, Hsin
    Ku, Tsung-Hao
    Hsieh, Yung-Huan
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 809 - 812