Contextual object category recognition for RGB-D scene labeling

被引:19
|
作者
Ali, Haider [1 ]
Shafait, Faisal [2 ]
Giannakidou, Eirini [3 ]
Vakali, Athena [3 ]
Figueroa, Nadia [4 ]
Varvadoukas, Theodoros [4 ]
Mavridis, Nikolaos [4 ]
机构
[1] DLR, German Aerosp Ctr, Inst Robot & Mech RM, Oberpfaffenhofen, Germany
[2] Univ Western Australia, Comp Sci & Software Engn Dept, Perth, WA 6009, Australia
[3] Aristotle Univ Thessaloniki AUTH, Dept Informat, Thessaloniki, Greece
[4] New York Univ Abu Dhabi, Dept Engn, Abu Dhabi, U Arab Emirates
关键词
Object recognition; Contextual modeling; RGB-D scenes; Social media; 3D scene labeling; FEATURES;
D O I
10.1016/j.robot.2013.10.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advances in computer. vision on the one hand, and imaging technologies on the other hand, have opened up a number of interesting possibilities for robust 3D scene labeling. This paper presents contributions in several directions to improve the state-of-the-art in RGB-D scene labeling. First, we present a novel combination of depth and color features to recognize different object categories in isolation. Then, we use a context model that exploits detection results of other objects in the scene to jointly optimize labels of co-occurring objects in the scene. Finally, we investigate the use of social media mining to develop the context model, and provide an investigation of its convergence. We perform thorough experimentation on both the publicly available RGB-D Dataset from the University of Washington as well as on the NYU scene dataset. An analysis of the results shows interesting insights about contextual object category recognition, and its benefits. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:241 / 256
页数:16
相关论文
共 50 条
  • [1] RGB-D Scene Recognition with Object-to-Object Relation
    Song, Xinhang
    Chen, Chengpeng
    Jiang, Shuqiang
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 600 - 608
  • [2] RGB-D Scene Recognition based on Object-Scene Relation
    Guo, Yuhui
    Liang, Xun
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15787 - 15788
  • [3] Learning a deeply supervised multi-modal RGB-D embedding for semantic scene and object category recognition
    Zaki, Hasan F. M.
    Shafait, Faisal
    Mian, Ajmal
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2017, 92 : 41 - 52
  • [4] Image Representations With Spatial Object-to-Object Relations for RGB-D Scene Recognition
    Song, Xinhang
    Jiang, Shuqiang
    Wang, Bohan
    Chen, Chengpeng
    Chen, Gongwei
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 525 - 537
  • [5] Convolutional Hypercube Pyramid for Accurate RGB-D Object Category and Instance Recognition
    Zaki, Hasan F. M.
    Shafait, Faisal
    Mian, Ajmal
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 1685 - 1692
  • [6] ENSEMBLE CLASSIFIER FOR JOINT OBJECT INSTANCE AND CATEGORY RECOGNITION ON RGB-D DATA
    Seib, Viktor
    Memmesheimer, Raphael
    Paulus, Dietrich
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 143 - 147
  • [7] RGB-D Object Modelling for Object Recognition and Tracking
    Prankl, Johann
    Aldoma, Aitor
    Svejda, Alexander
    Vincze, Markus
    [J]. 2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 96 - 103
  • [8] RGB-D Scene Recognition based on Object-Scene Relation and Semantics-Preserving Attention
    Guo, Yuhui
    Liang, Xun
    [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 127 - 134
  • [9] Combining Features For RGB-D object Recognition
    Khan, Wasif
    Phaisangittisagul, Ekachai
    Ali, Luqman
    Gansawat, Duangrat
    Kumazawa, Itsuo
    [J]. 2017 INTERNATIONAL ELECTRICAL ENGINEERING CONGRESS (IEECON), 2017,
  • [10] Object Recognition in Noisy RGB-D Data
    Carlos Rangel, Jose
    Morell, Vicente
    Cazorla, Miguel
    Orts-Escolano, Sergio
    Garcia Rodriguez, Jose
    [J]. BIOINSPIRED COMPUTATION IN ARTIFICIAL SYSTEMS, PT II, 2015, 9108 : 261 - 270