Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images

被引:412
|
作者
Song, Shuran [1 ]
Xiao, Jianxiong [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2016.94
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We focus on the task of amodal 3D object detection in RGB-D images, which aims to produce a 3D bounding box of an object in metric form at its full extent. We introduce Deep Sliding Shapes, a 3D ConvNet formulation that takes a 3D volumetric scene from a RGB-D image as input and outputs 3D object bounding boxes. In our approach, we propose the first 3D Region Proposal Network (RPN) to learn objectness from geometric shapes and the first joint Object Recognition Network (ORN) to extract geometric features in 3D and color features in 2D. In particular, we handle objects of various sizes by training an amodal RPN at two different scales and an ORN to regress 3D bounding boxes. Experiments show that our algorithm outperforms the state-of-the-art by 13.8 in mAP and is 200x faster than the original Sliding Shapes.
引用
收藏
页码:808 / 816
页数:9
相关论文
共 50 条
  • [41] Rethinking feature aggregation for deep RGB-D salient object detection
    Zhang, Yuan-fang
    Zheng, Jiangbin
    Li, Long
    Liu, Nian
    Jia, Wenjing
    Fan, Xiaochen
    Xu, Chengpei
    He, Xiangjian
    NEUROCOMPUTING, 2021, 423 : 463 - 473
  • [42] Yolo+FPN: 2D and 3D Fused Object Detection With an RGB-D Camera
    Wang, Ya
    Zell, Andreas
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4657 - 4664
  • [43] RGB-D Object Tracking with Occlusion Detection
    Xie, Yujun
    Lu, Yao
    Gu, Shuang
    2019 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2019), 2019, : 11 - 15
  • [44] RGB-D salient object detection: A survey
    Tao Zhou
    Deng-Ping Fan
    Ming-Ming Cheng
    Jianbing Shen
    Ling Shao
    Computational Visual Media, 2021, 7 (01) : 37 - 69
  • [45] RGB-D salient object detection: A survey
    Zhou, Tao
    Fan, Deng-Ping
    Cheng, Ming-Ming
    Shen, Jianbing
    Shao, Ling
    COMPUTATIONAL VISUAL MEDIA, 2021, 7 (01) : 37 - 69
  • [46] RGB-D salient object detection: A survey
    Tao Zhou
    Deng-Ping Fan
    Ming-Ming Cheng
    Jianbing Shen
    Ling Shao
    Computational Visual Media, 2021, 7 : 37 - 69
  • [47] Calibrated RGB-D Salient Object Detection
    Ji, Wei
    Li, Jingjing
    Yu, Shuang
    Zhang, Miao
    Piao, Yongri
    Yao, Shunyu
    Bi, Qi
    Ma, Kai
    Zheng, Yefeng
    Lu, Huchuan
    Cheng, Li
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9466 - 9476
  • [48] Salient Object Detection in RGB-D Videos
    Mou, Ao
    Lu, Yukang
    He, Jiahao
    Min, Dingyao
    Fu, Keren
    Zhao, Qijun
    IEEE Transactions on Image Processing, 2024, 33 : 6660 - 6675
  • [49] 3D Human Pose Estimation from RGB-D Images Using Deep Learning Method
    Chun, Junchul
    Park, Seohee
    Ji, Myunggeun
    2018 INTERNATIONAL CONFERENCE ON SENSORS, SIGNAL AND IMAGE PROCESSING (SSIP 2018), 2018, : 51 - 55
  • [50] A Comparative Evaluation of 3D Keypoint Detectors in a RGB-D Object Dataset
    Filipe, Silvio
    Alexandre, Luis A.
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS (VISAPP), VOL 1, 2014, : 476 - 483