Convolutional Fisher Kernels for RGB-D Object Recognition

被引:21
|
作者
Cheng, Yanhua [1 ,2 ,4 ]
Cai, Rui [5 ]
Zhao, Xin [1 ,2 ,4 ]
Huang, Kaiqi [1 ,2 ,3 ,4 ]
机构
[1] Center Res Intelligent Percept & Comp, Beijing, Peoples R China
[2] Natl Lab Pattern Recognit, Beijing, Peoples R China
[3] CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Beijing 100864, Peoples R China
[5] Microsoft Res, Beijing, Peoples R China
关键词
D O I
10.1109/3DV.2015.23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies the problem of improving object recognition using the novel RGB-D data. To address the problem, a new convolutional Fisher Kernels (CFK) method is proposed to represent RGB-D objects powerfully yet efficiently. The core idea of our approach is to integrate the both advantages of the convolutional neural networks (CNN) and Fisher Kernel encoding (FK): CNN model is flexible to adapt to new data sources, but requires for large amounts of training data with significant computational resources for good generalization; In comparison, FK encoding is able to represent objects powerfully and efficiently with small training data, however, its success highly depends on the well-designed SIFT features in literature, which may not be suitable for the new depth data. CFK can be interpreted as a two-layer feature learning structure to bridge the two models. The first layer employs a single-layer CNN to learn low-level translationally invariant features for both RGB and depth data efficiently. The second layer aggregates the convolutional responses by FK encoding. Here 2D and 3D spatial pyramids are applied to further improve the Fisher vector representation of each modality. Experiments on RGB-D object recognition benchmarks demonstrate that our approach can achieve the state-of-the-art results.
引用
收藏
页码:135 / 143
页数:9
相关论文
共 50 条
  • [1] Hybrid RGB-D Object Recognition using Convolutional Neural Network and Fisher Vector
    Li, Wei
    Cao, Zhiguo
    Xiao, Yang
    Fang, Zhiwen
    [J]. 2015 CHINESE AUTOMATION CONGRESS (CAC), 2015, : 506 - 511
  • [2] Recurrent Convolutional Fusion for RGB-D Object Recognition
    Loghmani, Mohammad Reza
    Planamente, Mirco
    Caputo, Barbara
    Vincze, Markus
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03) : 2878 - 2885
  • [3] RGB-D OBJECT RECOGNITION WITH MULTIMODAL DEEP CONVOLUTIONAL NEURAL NETWORKS
    Rahman, Mohammad Muntasir
    Tan, Yanhao
    Xue, Jian
    Lu, Ke
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 991 - 996
  • [4] RGB-D Object Recognition Using Deep Convolutional Neural Networks
    Zia, Saman
    Yuksel, Buket
    Yuret, Deniz
    Yemez, Yucel
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 887 - 894
  • [5] Convolutional Hypercube Pyramid for Accurate RGB-D Object Category and Instance Recognition
    Zaki, Hasan F. M.
    Shafait, Faisal
    Mian, Ajmal
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 1685 - 1692
  • [6] Revisiting Deep Convolutional Neural Networks for RGB-D Based Object Recognition
    Madai-Tahy, Lorand
    Otte, Sebastian
    Hanten, Richard
    Zell, Andreas
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 29 - 37
  • [7] RGB-D Object Modelling for Object Recognition and Tracking
    Prankl, Johann
    Aldoma, Aitor
    Svejda, Alexander
    Vincze, Markus
    [J]. 2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 96 - 103
  • [8] Combining Features For RGB-D object Recognition
    Khan, Wasif
    Phaisangittisagul, Ekachai
    Ali, Luqman
    Gansawat, Duangrat
    Kumazawa, Itsuo
    [J]. 2017 INTERNATIONAL ELECTRICAL ENGINEERING CONGRESS (IEECON), 2017,
  • [9] Object Recognition in Noisy RGB-D Data
    Carlos Rangel, Jose
    Morell, Vicente
    Cazorla, Miguel
    Orts-Escolano, Sergio
    Garcia Rodriguez, Jose
    [J]. BIOINSPIRED COMPUTATION IN ARTIFICIAL SYSTEMS, PT II, 2015, 9108 : 261 - 270
  • [10] Convolutional Neural Network for 3D Object Recognition Based on RGB-D Dataset
    Wang, Jianhua
    Lu, Jinjin
    Chen, Weihai
    Wu, Xingming
    [J]. PROCEEDINGS OF THE 2015 10TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, 2015, : 34 - 39