Shape Prior Guided Instance Disparity Estimation for 3D Object Detection

被引:9
|
作者
Chen, Linghao [1 ]
Sun, Jiaming [2 ]
Xie, Yiming [1 ]
Zhang, Siyu [2 ]
Shuai, Qing [1 ]
Jiang, Qinhong [2 ]
Zhang, Guofeng [1 ]
Bao, Hujun [1 ]
Zhou, Xiaowei [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD & CG, Hangzhou 310027, Zhejiang, Peoples R China
[2] SenseTime, Hangzhou 311215, Peoples R China
关键词
Three-dimensional displays; Estimation; Object detection; Shape; Solid modeling; Laser radar; Image reconstruction; Autonomous driving; 3D detection; stereo matching;
D O I
10.1109/TPAMI.2021.3076678
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel system named Disp R-CNN for 3D object detection from stereo images. Many recent works solve this problem by first recovering point clouds with disparity estimation and then apply a 3D detector. The disparity map is computed for the entire image, which is costly and fails to leverage category-specific prior. In contrast, we design an instance disparity estimation network (iDispNet) that predicts disparity only for pixels on objects of interest and learns a category-specific shape prior for more accurate disparity estimation. To address the challenge from scarcity of disparity annotation in training, we propose to use a statistical shape model to generate dense disparity pseudo-ground-truth without the need of LiDAR point clouds, which makes our system more widely applicable. Experiments on the KITTI dataset show that, when LiDAR ground-truth is not used at training time, Disp R-CNN outperforms previous state-of-the-art methods based on stereo input by 20 percent in terms of average precision for all categories. The code and pseudo-ground-truth data are available at the project page: https://github.com/zju3dv/disprcnn.
引用
收藏
页码:5529 / 5540
页数:12
相关论文
共 50 条
  • [1] 2D Amodal Instance Segmentation Guided by 3D Shape Prior
    Li, Zhixuan
    Ye, Weining
    Jiang, Tingting
    Huang, Tiejun
    [J]. COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 165 - 181
  • [2] Probabilistic instance shape reconstruction with sparse LiDAR for monocular 3D object detection
    Ji, Chaofeng
    Wu, Han
    Liu, Guizhong
    [J]. NEUROCOMPUTING, 2023, 529 : 92 - 100
  • [3] Object 3D position estimation based on instance segmentation
    Liu Chang-ji
    Hao Zhi-cheng
    Yang Jin-cheng
    Zhu Ming
    Nie Hai-tao
    [J]. CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2021, 36 (11) : 1535 - 1544
  • [4] Confidence Guided Stereo 3D Object Detection with Split Depth Estimation
    Li, Chengyao
    Ku, Jason
    Waslander, Steven L.
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5776 - 5783
  • [5] FocalFormer3D: Focusing on Hard Instance for 3D Object Detection
    Chen, Yilun
    Yu, Zhiding
    Chen, Yukang
    Lan, Shiyi
    Anandkumar, Anima
    Jia, Jiaya
    Alvarez, Jose M.
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8360 - 8371
  • [6] Stereo 3D object detection via instance depth prior guidance and adaptive spatial feature aggregation
    Ji, Chaofeng
    Liu, Guizhong
    Zhao, Dan
    [J]. VISUAL COMPUTER, 2023, 39 (10): : 4543 - 4554
  • [7] Stereo 3D object detection via instance depth prior guidance and adaptive spatial feature aggregation
    Chaofeng Ji
    Guizhong Liu
    Dan Zhao
    [J]. The Visual Computer, 2023, 39 : 4543 - 4554
  • [8] 3D Object Detection Incorporating Instance Segmentation and Image Restoration
    HUANG Bo
    HUANG Man
    GAO Yongbin
    YU Yuxin
    JIANG Xiaoyan
    ZHANG Juan
    [J]. Wuhan University Journal of Natural Sciences, 2019, 24 (04) : 360 - 368
  • [9] Joint 3D Instance Segmentation and Object Detection for Autonomous Driving
    Zhou, Dingfu
    Fang, Jin
    Song, Xibin
    Liu, Liu
    Yin, Junbo
    Dai, Yuchao
    Li, Hongdong
    Yang, Ruigang
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1836 - 1846
  • [10] Representation Disparity-aware Distillation for 3D Object Detection
    Li, Yanjing
    Xu, Sheng
    Lin, Mingbao
    Yin, Jihao
    Zhang, Baochang
    Cao, Xianbin
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6692 - 6701