RGB-D Scene Recognition with Object-to-Object Relation

被引：17

作者：

Song, Xinhang ^{[1
,2
]}

Chen, Chengpeng ^{[1
,2
]}

Jiang, Shuqiang ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Comp Tech, Key Lab Intel Inf Proc, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17) | 2017年

基金：

中国国家自然科学基金;

关键词：

Intermediate representation; spatial layout; object detection; RGB-D; scene recognition;

D O I：

10.1145/3123266.3123300

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

A scene is usually abstract that consists of several less abstract entities such as objects or themes. It is very difficult to reason scenes from visual features due to the semantic gap between the abstract scenes and low-level visual features. Some alternative works recognize scenes with a two-step framework by representing images with intermediate representations of objects or themes. However, the object co-occurrences between scenes may lead to ambiguity for scene recognition. In this paper, we propose a framework to represent images with intermediate (object) representations with spatial layout, i.e., object-to-object relation (OOR) representation. In order to better capture the spatial information, the proposed OOR is adapted to RGB-D data. In the proposed framework, we first apply object detection technique on RGB and depth images separately. Then the detected results of both modalities are combined with a RGB-D proposal fusion process. Based on the detected results, we extract semantic feature OOR and regional convolutional neural network (CNN) features located by bounding boxes. Finally, different features are concatenated to feed to the classifier for scene recognition. The experimental results on SUN RGB-D and NYUD2 datasets illustrate the efficiency of the proposed method.

引用

页码：600 / 608

页数：9

共 50 条

[1] Image Representations With Spatial Object-to-Object Relations for RGB-D Scene Recognition
Song, Xinhang
Jiang, Shuqiang
Wang, Bohan
Chen, Chengpeng
Chen, Gongwei
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 525 - 537
[2] RGB-D Scene Recognition based on Object-Scene Relation
Guo, Yuhui
Liang, Xun
[J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15787 - 15788
[3] Contextual object category recognition for RGB-D scene labeling
Ali, Haider
Shafait, Faisal
Giannakidou, Eirini
Vakali, Athena
Figueroa, Nadia
Varvadoukas, Theodoros
Mavridis, Nikolaos
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2014, 62 (02) : 241 - 256
[4] RGB-D Scene Recognition based on Object-Scene Relation and Semantics-Preserving Attention
Guo, Yuhui
Liang, Xun
[J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 127 - 134
[5] RGB-D Object Modelling for Object Recognition and Tracking
Prankl, Johann
Aldoma, Aitor
Svejda, Alexander
Vincze, Markus
[J]. 2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 96 - 103
[6] Combining Features For RGB-D object Recognition
Khan, Wasif
Phaisangittisagul, Ekachai
Ali, Luqman
Gansawat, Duangrat
Kumazawa, Itsuo
[J]. 2017 INTERNATIONAL ELECTRICAL ENGINEERING CONGRESS (IEECON), 2017,
[7] Object Recognition in Noisy RGB-D Data
Carlos Rangel, Jose
Morell, Vicente
Cazorla, Miguel
Orts-Escolano, Sergio
Garcia Rodriguez, Jose
[J]. BIOINSPIRED COMPUTATION IN ARTIFICIAL SYSTEMS, PT II, 2015, 9108 : 261 - 270
[8] Application of Transfer Learning in RGB-D Object Recognition
Kumar, Abhishek
Shrivatsav, S. Nithin
Subrahmanyam, G. R. K. S.
Mishra, Deepak
[J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 580 - 584
[9] Deep sensorimotor learning for RGB-D object recognition
Thermos, Spyridon
Papadopoulos, Georgios Th.
Daras, Petros
Potamianos, Gerasimos
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 190
[10] Recurrent Convolutional Fusion for RGB-D Object Recognition
Loghmani, Mohammad Reza
Planamente, Mirco
Caputo, Barbara
Vincze, Markus
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03) : 2878 - 2885

← 1 2 3 4 5 →