Deep Learning of Local RGB-D Patches for 3D Object Detection and 6D Pose Estimation

被引:158
|
作者
Kehl, Wadim [1 ]
Milletari, Fausto [1 ]
Tombari, Federico [1 ,2 ]
Ilic, Slobodan [1 ,3 ]
Navab, Nassir [1 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] Univ Bologna, Bologna, Italy
[3] Siemens AG, Munich, Germany
来源
关键词
D O I
10.1007/978-3-319-46487-9_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a 3D object detection method that uses regressed descriptors of locally-sampled RGB-D patches for 6D vote casting. For regression, we employ a convolutional auto-encoder that has been trained on a large collection of random local patches. During testing, scene patch descriptors are matched against a database of synthetic model view patches and cast 6D object votes which are subsequently filtered to refined hypotheses. We evaluate on three datasets to show that our method generalizes well to previously unseen input data, delivers robust detection results that compete with and surpass the state-of-the-art while being scalable in the number of objects.
引用
收藏
页码:205 / 220
页数:16
相关论文
共 50 条
  • [31] SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again
    Kehl, Wadim
    Manhardt, Fabian
    Tombari, Federico
    Ilic, Slobodan
    Navab, Nassir
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1530 - 1538
  • [32] Efficient Center Voting for Object Detection and 6D Pose Estimation in 3D Point Cloud
    Guo, Jianwei
    Xing, Xuejun
    Quan, Weize
    Yan, Dong-Ming
    Gu, Qingyi
    Liu, Yang
    Zhang, Xiaopeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5072 - 5084
  • [33] 3D Hand Pose Detection in Egocentric RGB-D Images
    Rogez, Gregory
    Khademi, Maryam
    Supancic, J. S., III
    Montiel, J. M. M.
    Ramanan, Deva
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 356 - 371
  • [34] Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images
    Song, Shuran
    Xiao, Jianxiong
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 808 - 816
  • [35] RGB-D object pose estimation in unstructured environments
    Choi, Changhyun
    Christensen, Henrik I.
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 : 595 - 613
  • [36] Online 3D Reconstruction and 6-DoF Pose Estimation for RGB-D Sensors
    Lim, Hyon
    Lim, Jongwoo
    Kim, H. Jin
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 238 - 254
  • [37] Towards Deep Learning-based 6D Bin Pose Estimation in 3D Scans
    Gajdosech, Lukas
    Kocur, Viktor
    Stuchlik, Martin
    Hudec, Lukas
    Madaras, Martin
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 545 - 552
  • [38] Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation
    Liu, Xingyu
    Ren, Pengfei
    Gao, Yuanyuan
    Wang, Jingyu
    Sun, Haifeng
    Qi, Qi
    Zhuang, Zirui
    Liao, Jianxin
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3756 - 3764
  • [39] 3D Pose Estimation of Daily Objects Using an RGB-D Camera
    Choi, Changhyun
    Christensen, Henrik I.
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 3342 - 3349
  • [40] Context-aware 6D pose estimation of known objects using RGB-D data
    Ankit Kumar
    Priya Shukla
    Vandana Kushwaha
    Gora Chand Nandi
    Multimedia Tools and Applications, 2024, 83 : 52973 - 52987