Sliding Shapes for 3D Object Detection in Depth Images

被引:0
|
作者
Song, Shuran [1 ]
Xiao, Jianxiong [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The depth information of RGB-D sensors has greatly simplified some common challenges in computer vision and enabled breakthroughs for several tasks. In this paper, we propose to use depth maps for object detection and design a 3D detector to overcome the major difficulties for recognition, namely the variations of texture, illumination, shape, viewpoint, clutter, occlusion, self-occlusion and sensor noises. We take a collection of 3D CAD models and render each CAD model from hundreds of viewpoints to obtain synthetic depth maps. For each depth rendering, we extract features from the 3D point cloud and train an Exemplar-SVM classifier. During testing and hard-negative mining, we slide a 3D detection window in 3D space. Experiment results show that our 3D detector significantly outperforms the state-of-the-art algorithms for both RGB and RGBD images, and achieves about x1.7 improvement on average precision compared to DPM and R-CNN. All source code and data are available online.
引用
收藏
页码:634 / 651
页数:18
相关论文
共 50 条
  • [1] Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images
    Song, Shuran
    Xiao, Jianxiong
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 808 - 816
  • [2] FrustumVoxNet for 3D object detection fromRGB-D or Depth images
    Shen, Xiaoke
    Stamos, Ioannis
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1687 - 1695
  • [3] Improved Sliding Shapes for Instance Segmentation of Amodal 3D Object
    Lin, Jinhua
    Yao, Yu
    Wang, Yanjie
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (11): : 5555 - 5567
  • [4] 3D Reconstruction of Novel Object Shapes from Single Images
    Thai, Anh
    Stojanov, Stefan
    Upadhya, Vijay
    Rehg, James M.
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 85 - 95
  • [5] Fuzzy object detection in 3D medical images
    Wang, H
    Zhuang, TG
    Jiang, DZ
    Zhang, H
    Liu, WY
    Bacelar, A
    Magnin, IE
    Gimenez, G
    NONLINEAR IMAGE PROCESSING VIII, 1997, 3026 : 155 - 163
  • [6] Combining depth and gray images for fast 3D object recognition
    Pan, Wang
    Zhu, Feng
    Hao, Yingming
    OPTICAL MEASUREMENT TECHNOLOGY AND INSTRUMENTATION, 2016, 10155
  • [7] Category Level 3D Object Recognition using Depth Images
    Kayim, Guney
    Akgul, Ceyhun Burak
    Sankur, Bulent
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [8] 3D Object Finding Using Geometrical Constraints on Depth Images
    Van-Hung Le
    Hai Vu
    Thuy Thi Nguyen
    Thi-Lan Le
    Thi-Thanh-Hai Tran
    Vlaminck, Michiel
    Philips, Wilfried
    Veelaert, Peter
    2015 SEVENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2015, : 389 - 394
  • [9] Simultaneous object size and depth adjustment for stereoscopic 3D images
    Shao, Feng
    Fei, Yanjia
    Fu, Randi
    Jiang, Gangyi
    Ho, Yo-Sung
    INFORMATION SCIENCES, 2019, 481 : 280 - 291
  • [10] Object Detection and Depth Estimation for 3D Trajectory Extraction
    Boukhers, Zeyd
    Shirahama, Kimiaki
    Li, Frederic
    Grzegorzek, Marcin
    2015 13TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2015,