Convolutional Hypercube Pyramid for Accurate RGB-D Object Category and Instance Recognition

被引:0
|
作者
Zaki, Hasan F. M. [1 ]
Shafait, Faisal [2 ]
Mian, Ajmal [1 ]
机构
[1] Univ Western Australia, Sch Comp Sci & Software Engn, Crawley, WA 6009, Australia
[2] Natl Univ Sci & Technol, Islamabad, Pakistan
关键词
EXTREME LEARNING-MACHINE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning based methods have achieved unprecedented success in solving several computer vision problems involving RGB images. However, this level of success is yet to be seen on RGB-D images owing to two major challenges in this domain: training data deficiency and multi-modality input dissimilarity. We present an RGB-D object recognition framework that addresses these two key challenges by effectively embedding depth and point cloud data into the RGB domain. We employ a convolutional neural network (CNN) pre-trained on RGB data as a feature extractor for both color and depth channels and propose a rich coarse-to-fine feature representation scheme, coined Hypercube Pyramid, that is able to capture discriminatory information at different levels of detail. Finally, we present a novel fusion scheme to combine the Hypercube Pyramid features with the activations of fully connected neurons to construct a compact representation prior to classification. By employing Extreme Learning Machines (ELM) as non-linear classifiers, we show that the proposed method outperforms ten state-of-the-art algorithms for several tasks in terms of recognition accuracy on the benchmark Washington RGB-D and 2D3D object datasets by a large margin (upto 50% reduction in error rate).
引用
收藏
页码:1685 / 1692
页数:8
相关论文
共 50 条
  • [1] ENSEMBLE CLASSIFIER FOR JOINT OBJECT INSTANCE AND CATEGORY RECOGNITION ON RGB-D DATA
    Seib, Viktor
    Memmesheimer, Raphael
    Paulus, Dietrich
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 143 - 147
  • [2] Object Class and Instance Recognition on RGB-D Data
    Seib, Viktor
    Christ-Friedmann, Susanne
    Thierfelder, Susanne
    Paulus, Dietrich
    [J]. SIXTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2013), 2013, 9067
  • [3] Recurrent Convolutional Fusion for RGB-D Object Recognition
    Loghmani, Mohammad Reza
    Planamente, Mirco
    Caputo, Barbara
    Vincze, Markus
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03) : 2878 - 2885
  • [4] Convolutional Fisher Kernels for RGB-D Object Recognition
    Cheng, Yanhua
    Cai, Rui
    Zhao, Xin
    Huang, Kaiqi
    [J]. 2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, : 135 - 143
  • [5] Contextual object category recognition for RGB-D scene labeling
    Ali, Haider
    Shafait, Faisal
    Giannakidou, Eirini
    Vakali, Athena
    Figueroa, Nadia
    Varvadoukas, Theodoros
    Mavridis, Nikolaos
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2014, 62 (02) : 241 - 256
  • [6] RGB-D OBJECT RECOGNITION WITH MULTIMODAL DEEP CONVOLUTIONAL NEURAL NETWORKS
    Rahman, Mohammad Muntasir
    Tan, Yanhao
    Xue, Jian
    Lu, Ke
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 991 - 996
  • [7] RGB-D Object Recognition Using Deep Convolutional Neural Networks
    Zia, Saman
    Yuksel, Buket
    Yuret, Deniz
    Yemez, Yucel
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 887 - 894
  • [8] Multiview RGB-D Dataset for Object Instance Detection
    Georgakis, Georgios
    Reza, Md Alimoor
    Mousavian, Arsalan
    Le, Phi-Hung
    Kosecka, Jana
    [J]. PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 426 - 434
  • [9] Revisiting Deep Convolutional Neural Networks for RGB-D Based Object Recognition
    Madai-Tahy, Lorand
    Otte, Sebastian
    Hanten, Richard
    Zell, Andreas
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 29 - 37
  • [10] RGB-D Object Modelling for Object Recognition and Tracking
    Prankl, Johann
    Aldoma, Aitor
    Svejda, Alexander
    Vincze, Markus
    [J]. 2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 96 - 103