Ground-Aware Monocular 3D Object Detection for Autonomous Driving

被引:79
|
作者
Liu, Yuxuan [1 ]
Yixuan, Yuan [2 ]
Liu, Ming [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Robot & Multipercept Lab, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
来源
关键词
Three-dimensional displays; Cameras; Object detection; Two dimensional displays; Feature extraction; Convolution; Neural networks; Automation technologies for smart cities; deep learning for visual perception; object detection; segmentation and categorization;
D O I
10.1109/LRA.2021.3052442
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Estimating the 3D position and orientation of objects in the environment with a single RGB camera is a critical and challenging task for low-cost urban autonomous driving and mobile robots. Most of the existing algorithms are based on the geometric constraints in 2D-3D correspondence, which stems from generic 6D object pose estimation. We first identify how the ground plane provides additional clues in depth reasoning in 3D detection in driving scenes. Based on this observation, we then improve the processing of 3D anchors and introduce a novel neural network module to fully utilize such application-specific priors in the framework of deep learning. Finally, we introduce an efficient neural network embedded with the proposed module for 3D object detection. We further verify the power of the proposed module with a neural network designed for monocular depth prediction. The two proposed networks achieve state-of-the-art performances on the KITTI 3D object detection and depth prediction benchmarks, respectively.
引用
收藏
页码:919 / 926
页数:8
相关论文
共 50 条
  • [1] MonoGAE: Roadside Monocular 3D Object Detection With Ground-Aware Embeddings
    Yang, Lei
    Zhang, Xinyu
    Yu, Jiaxin
    Li, Jun
    Zhao, Tong
    Wang, Li
    Huang, Yi
    Zhang, Chuang
    Wang, Hong
    Li, Yiming
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 1 - 15
  • [2] Monocular 3D Object Detection for Autonomous Driving
    Chen, Xiaozhi
    Kundu, Kaustav
    Zhang, Ziyu
    Ma, Huimin
    Fidler, Sanja
    Urtasun, Raquel
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2147 - 2156
  • [3] Efficient Uncertainty Estimation for Monocular 3D Object Detection in Autonomous Driving
    Liu, Zechen
    Han, Zhihua
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2711 - 2718
  • [4] Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
    Chen, Yi-Nan
    Dai, Hang
    Ding, Yong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 877 - 887
  • [5] Monocular 3D Object Detection for Autonomous Driving Based on Contextual Transformer
    She, Xiangyang
    Yan, Weijia
    Dong, Lihong
    Computer Engineering and Applications, 2024, 60 (19) : 178 - 189
  • [6] Monocular 3D object detection using dual quadric for autonomous driving
    Li, Peixuan
    Zhao, Huaici
    NEUROCOMPUTING, 2021, 441 : 151 - 160
  • [7] Pseudo-Mono for Monocular 3D Object Detection in Autonomous Driving
    Tao, Chongben
    Cao, Jiecheng
    Wang, Chen
    Zhang, Zufeng
    Gao, Zhen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3962 - 3975
  • [8] Monocular 3D object detection via estimation of paired keypoints for autonomous driving
    Chaofeng Ji
    Guizhong Liu
    Dan Zhao
    Multimedia Tools and Applications, 2022, 81 : 5973 - 5988
  • [9] Monocular 3D object detection via estimation of paired keypoints for autonomous driving
    Ji, Chaofeng
    Liu, Guizhong
    Zhao, Dan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (04) : 5973 - 5988
  • [10] Ground-Aware Point Cloud Semantic Segmentation for Autonomous Driving
    Wu, Jian
    Jiao, Jianbo
    Yang, Qingxiong
    Zha, Zheng-Jun
    Chen, Xuejin
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 971 - 979