RGB-D Scene Recognition with Object-to-Object Relation

被引:17
|
作者
Song, Xinhang [1 ,2 ]
Chen, Chengpeng [1 ,2 ]
Jiang, Shuqiang [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Comp Tech, Key Lab Intel Inf Proc, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Intermediate representation; spatial layout; object detection; RGB-D; scene recognition;
D O I
10.1145/3123266.3123300
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A scene is usually abstract that consists of several less abstract entities such as objects or themes. It is very difficult to reason scenes from visual features due to the semantic gap between the abstract scenes and low-level visual features. Some alternative works recognize scenes with a two-step framework by representing images with intermediate representations of objects or themes. However, the object co-occurrences between scenes may lead to ambiguity for scene recognition. In this paper, we propose a framework to represent images with intermediate (object) representations with spatial layout, i.e., object-to-object relation (OOR) representation. In order to better capture the spatial information, the proposed OOR is adapted to RGB-D data. In the proposed framework, we first apply object detection technique on RGB and depth images separately. Then the detected results of both modalities are combined with a RGB-D proposal fusion process. Based on the detected results, we extract semantic feature OOR and regional convolutional neural network (CNN) features located by bounding boxes. Finally, different features are concatenated to feed to the classifier for scene recognition. The experimental results on SUN RGB-D and NYUD2 datasets illustrate the efficiency of the proposed method.
引用
收藏
页码:600 / 608
页数:9
相关论文
共 50 条
  • [1] Image Representations With Spatial Object-to-Object Relations for RGB-D Scene Recognition
    Song, Xinhang
    Jiang, Shuqiang
    Wang, Bohan
    Chen, Chengpeng
    Chen, Gongwei
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 525 - 537
  • [2] RGB-D Scene Recognition based on Object-Scene Relation
    Guo, Yuhui
    Liang, Xun
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15787 - 15788
  • [3] Contextual object category recognition for RGB-D scene labeling
    Ali, Haider
    Shafait, Faisal
    Giannakidou, Eirini
    Vakali, Athena
    Figueroa, Nadia
    Varvadoukas, Theodoros
    Mavridis, Nikolaos
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2014, 62 (02) : 241 - 256
  • [4] RGB-D Scene Recognition based on Object-Scene Relation and Semantics-Preserving Attention
    Guo, Yuhui
    Liang, Xun
    [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 127 - 134
  • [5] RGB-D Object Modelling for Object Recognition and Tracking
    Prankl, Johann
    Aldoma, Aitor
    Svejda, Alexander
    Vincze, Markus
    [J]. 2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 96 - 103
  • [6] Combining Features For RGB-D object Recognition
    Khan, Wasif
    Phaisangittisagul, Ekachai
    Ali, Luqman
    Gansawat, Duangrat
    Kumazawa, Itsuo
    [J]. 2017 INTERNATIONAL ELECTRICAL ENGINEERING CONGRESS (IEECON), 2017,
  • [7] Object Recognition in Noisy RGB-D Data
    Carlos Rangel, Jose
    Morell, Vicente
    Cazorla, Miguel
    Orts-Escolano, Sergio
    Garcia Rodriguez, Jose
    [J]. BIOINSPIRED COMPUTATION IN ARTIFICIAL SYSTEMS, PT II, 2015, 9108 : 261 - 270
  • [8] Application of Transfer Learning in RGB-D Object Recognition
    Kumar, Abhishek
    Shrivatsav, S. Nithin
    Subrahmanyam, G. R. K. S.
    Mishra, Deepak
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 580 - 584
  • [9] Deep sensorimotor learning for RGB-D object recognition
    Thermos, Spyridon
    Papadopoulos, Georgios Th.
    Daras, Petros
    Potamianos, Gerasimos
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 190
  • [10] Recurrent Convolutional Fusion for RGB-D Object Recognition
    Loghmani, Mohammad Reza
    Planamente, Mirco
    Caputo, Barbara
    Vincze, Markus
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03) : 2878 - 2885