RGB-D OBJECT RECOGNITION WITH MULTIMODAL DEEP CONVOLUTIONAL NEURAL NETWORKS

被引:0
|
作者
Rahman, Mohammad Muntasir [1 ]
Tan, Yanhao [1 ]
Xue, Jian [1 ]
Lu, Ke [1 ]
机构
[1] Univ Chinese Acad Sci, Sch Engn Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Object recognition; RGB-D data; Deep neural networks; Multi-modal feature learning;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Object recognition from RGB-D images has become a hot topic and gained a significant popularity in recent years due to its numerous applications. In this paper, we propose a novel multimodal deep convolutional neural networks architecture for RGB-D object recognition which composed of three streams with two different types of deep CNNs, where each stream can separately learn from each modality. Finally, we propose a combined architecture of joint network of these three streams to classify the objects. Compared to RGB data, RGB-D images provide additional depth information that can be represented as depth colorization methods or surface normals. Our goal is to exploit both colorization and surface normals information to encode depth images. We show that by utilizing both colorization and surface normals of depth images combined with RGB significantly can improves the classification accuracy. We evaluate our model on one of the most challenging RGB-D object dataset and achieves comparable performance to state-of-the-art methods.
引用
收藏
页码:991 / 996
页数:6
相关论文
共 50 条
  • [1] RGB-D Object Recognition Using Deep Convolutional Neural Networks
    Zia, Saman
    Yuksel, Buket
    Yuret, Deniz
    Yemez, Yucel
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 887 - 894
  • [2] Revisiting Deep Convolutional Neural Networks for RGB-D Based Object Recognition
    Madai-Tahy, Lorand
    Otte, Sebastian
    Hanten, Richard
    Zell, Andreas
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 29 - 37
  • [3] RGB-D BASED MULTIMODAL CONVOLUTIONAL NEURAL NETWORKS FOR SPACECRAFT RECOGNITION
    AlDahoul, Nouar
    Karim, Hezerul Abdul
    Momo, Mhd Adel
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING CHALLENGES (ICIPC), 2021, : 1 - 5
  • [4] Multimodal Deep Learning for Robust RGB-D Object Recognition
    Eitel, Andreas
    Springenberg, Jost Tobias
    Spinello, Luciano
    Riedmiller, Martin
    Burgard, Wolfram
    [J]. 2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 681 - 687
  • [5] Human Action Recognition Using RGB-D Sensor and Deep Convolutional Neural Networks
    Imran, Javed
    Kumar, Praveen
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 144 - 148
  • [6] Multimodal Convolutional Neural Network for Object Detection Using RGB-D Images
    Mocanu, Irina
    Clapon, Cosmin
    [J]. 2018 41ST INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2018, : 307 - 310
  • [7] Multimodal Neural Networks: RGB-D for Semantic Segmentation and Object Detection
    Schneider, Lukas
    Jasch, Manuel
    Froehlich, Bjoern
    Weber, Thomas
    Franke, Uwe
    Pollefeys, Marc
    Raetsch, Matthias
    [J]. IMAGE ANALYSIS, SCIA 2017, PT I, 2017, 10269 : 98 - 109
  • [8] RGB-D-Based Object Recognition Using Multimodal Convolutional Neural Networks: A Survey
    Gao, Mingliang
    Jiang, Jun
    Zou, Guofeng
    John, Vijay
    Liu, Zheng
    [J]. IEEE ACCESS, 2019, 7 : 43110 - 43136
  • [9] Gesture Recognition of RGB and RGB-D Static Images Using Convolutional Neural Networks
    Khari, Manju
    Garg, Aditya Kumar
    Gonzalez Crespo, Ruben
    Verdu, Elena
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2019, 5 (07): : 22 - 27
  • [10] Convolutional Fisher Kernels for RGB-D Object Recognition
    Cheng, Yanhua
    Cai, Rui
    Zhao, Xin
    Huang, Kaiqi
    [J]. 2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, : 135 - 143