Is Pseudo-Lidar needed for Monocular 3D Object detection?

被引:88
|
作者
Park, Dennis [1 ]
Ambrus, Rares [1 ]
Guizilini, Vitor [1 ]
Li, Jie [1 ]
Gaidon, Adrien [1 ]
机构
[1] Toyota Res Inst, Cambridge, MA 02139 USA
关键词
D O I
10.1109/ICCV48922.2021.00313
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent progress in 3D object detection from single images leverages monocular depth estimation as a way to produce 3D pointclouds, turning cameras into pseudo-lidar sensors. These two-stage detectors improve with the accuracy of the intermediate depth estimation network, which can itself be improved without manual labels via large-scale self-supervised learning. However, they tend to suffer from overfitting more than end-to-end methods, are more complex, and the gap with similar lidar-based detectors remains significant. In this work, we propose an end-to-end, single stage, monocular 3D object detector, DD3D, that can benefit from depth pre-training like pseudo-lidar methods, but without their limitations. Our architecture is designed for effective information transfer between depth estimation and 3D detection, allowing us to scale with the amount of unlabeled pre-training data. Our method achieves state-of-the-art results on two challenging benchmarks, with 16:34% and 9:28% AP for Cars and Pedestrians (respectively) on the KITTI-3D benchmark, and 41.5% mAP on NuScenes.
引用
收藏
页码:3122 / 3132
页数:11
相关论文
共 50 条
  • [11] Monocular 3D object detection with Pseudo-LiDAR confidence sampling and hierarchical geometric feature extraction in 6G network
    Zhang, Jianlong
    Fang, Guangzu
    Wang, Bin
    Zhou, Xiaobo
    Pei, Qingqi
    Chen, Chen
    [J]. DIGITAL COMMUNICATIONS AND NETWORKS, 2023, 9 (04) : 827 - 835
  • [12] End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection
    Qian, Rui
    Garg, Divyansh
    Wang, Yan
    You, Yurong
    Belongie, Serge
    Hariharan, Bharath
    Campbell, Mark
    Weinberger, Kilian Q.
    Chao, Wei-Lun
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5880 - 5889
  • [13] Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving
    Wang, Yan
    Chao, Wei-Lun
    Garg, Divyansh
    Hariharan, Bharath
    Campbell, Mark
    Weinberger, Kilian Q.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8437 - 8445
  • [14] Lidar Point Cloud Guided Monocular 3D Object Detection
    Peng, Liang
    Liu, Fei
    Yu, Zhengxu
    Yan, Senbo
    Deng, Dan
    Yang, Zheng
    Liu, Haifeng
    Cai, Deng
    [J]. COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 123 - 139
  • [15] Simple linear iterative clustering based low-cost pseudo-LiDAR for 3D object detection in autonomous driving
    Le, Duy
    Nguyen, Linh
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (16) : 25253 - 25269
  • [16] Simple linear iterative clustering based low-cost pseudo-LiDAR for 3D object detection in autonomous driving
    Duy Le
    Linh Nguyen
    [J]. Multimedia Tools and Applications, 2023, 82 : 25253 - 25269
  • [17] Pseudo-Mono for Monocular 3D Object Detection in Autonomous Driving
    Tao, Chongben
    Cao, Jiecheng
    Wang, Chen
    Zhang, Zufeng
    Gao, Zhen
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3962 - 3975
  • [18] Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
    Chen, Yi-Nan
    Dai, Hang
    Ding, Yong
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 877 - 887
  • [19] Sparse Pseudo-LiDAR Depth Assisted Monocular Depth Estimation
    Shao, Shuwei
    Pei, Zhongcai
    Chen, Weihai
    Liu, Qiang
    Yue, Haosong
    Li, Zhengguo
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 917 - 929
  • [20] Probabilistic instance shape reconstruction with sparse LiDAR for monocular 3D object detection
    Ji, Chaofeng
    Wu, Han
    Liu, Guizhong
    [J]. NEUROCOMPUTING, 2023, 529 : 92 - 100