Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images

被引:412
|
作者
Song, Shuran [1 ]
Xiao, Jianxiong [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2016.94
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We focus on the task of amodal 3D object detection in RGB-D images, which aims to produce a 3D bounding box of an object in metric form at its full extent. We introduce Deep Sliding Shapes, a 3D ConvNet formulation that takes a 3D volumetric scene from a RGB-D image as input and outputs 3D object bounding boxes. In our approach, we propose the first 3D Region Proposal Network (RPN) to learn objectness from geometric shapes and the first joint Object Recognition Network (ORN) to extract geometric features in 3D and color features in 2D. In particular, we handle objects of various sizes by training an amodal RPN at two different scales and an ORN to regress 3D bounding boxes. Experiments show that our algorithm outperforms the state-of-the-art by 13.8 in mAP and is 200x faster than the original Sliding Shapes.
引用
收藏
页码:808 / 816
页数:9
相关论文
共 50 条
  • [21] 3D Camouflaging Object using RGB-D Sensors
    Siddek, Ahmed M.
    Rashwan, Mohsen A.
    Eshrah, Islam A.
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 1232 - 1237
  • [22] Implementaion of 3D Collaborative Object Detection Systems using RGB-D Sensors
    Jun, Sungwoo
    Baek, Jaeuk
    Do, Seungwon
    Lee, ChangEun
    2023 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2023, : 366 - 368
  • [23] RGB-D Salient Object Detection via 3D Convolutional Neural Networks
    Chen, Qian
    Liu, Ze
    Zhang, Yi
    Fu, Keren
    Zhao, Qijun
    Du, Hongwei
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1063 - 1071
  • [24] Basic 3D Solid Recognition in RGB-D Images
    Kornuta, Tomasz
    Stefanczyk, Maciej
    Kasprzak, Wlodzimierz
    RECENT ADVANCES IN AUTOMATION, ROBOTICS AND MEASURING TECHNIQUES, 2014, 267 : 421 - 430
  • [25] A novel unsupervised 3D skeleton detection in RGB-D images for video surveillance
    Cheng, Shyi-Chyi
    Hsiao, Kuei-Fang
    Yang, Chen-Kuei
    Hsiao, Po-Fu
    Yu, Wan-Hsuan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (23-24) : 15829 - 15857
  • [26] Voting-based 3D Object Cuboid Detection Robust to Partial Occlusion from RGB-D Images
    Yun, Sangdoo
    Jeong, Hawook
    Kim, Soo Wan
    Choi, Jin Young
    2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016), 2016,
  • [27] 3D Object Discovery and Modeling Using Single RGB-D Images Containing Multiple Object Instances
    Abbeloos, Wim
    Ataer-Cansizoglu, Esra
    Caccamo, Sergio
    Taguchi, Yuichi
    Domae, Yukiyasu
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 431 - 439
  • [28] A novel unsupervised 3D skeleton detection in RGB-D images for video surveillance
    Shyi-Chyi Cheng
    Kuei-Fang Hsiao
    Chen-Kuei Yang
    Po-Fu Hsiao
    Wan-Hsuan Yu
    Multimedia Tools and Applications, 2020, 79 : 15829 - 15857
  • [29] 3D object detection: Learning 3D bounding boxes from scaled down 2D bounding boxes in RGB-D images
    Rahman, Mohammad Muntasir
    Tan, Yanhao
    Xue, Jian
    Shao, Ling
    Lu, Ke
    INFORMATION SCIENCES, 2019, 476 : 147 - 158
  • [30] 3D Object Detection and 6D Pose Estimation Using RGB-D Images and Mask R-CNN
    Tran, Van Luan
    Lin, Huei-Yung
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,