Adaptive RGB Image Recognition by Visual-Depth Embedding

被引：14

作者：

Cai, Ziyun ^{[1
]}

Long, Yang ^{[2
]}

Shao, Ling ^{[3
,4
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing, Jiangsu, Peoples R China

[2] Newcastle Univ, Sch Comp, Open Lab, Newcastle Upon Tyne NE4 5TG, Tyne & Wear, England

[3] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates

[4] Univ East Anglia, Sch Comp Sci, Norwich NR4 7TJ, Norfolk, England

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2018年 / 27卷 / 05期

关键词：

RGB-D data; domain adaptation; visual categorization; NONNEGATIVE MATRIX FACTORIZATION; KERNEL;

D O I：

10.1109/TIP.2018.2806839

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recognizing RGB images from RGB-D data is a promising application, which significantly reduces the cost while can still retain high recognition rates. However, existing methods still suffer from the domain shifting problem due to conventional surveillance cameras and depth sensors are using different mechanisms. In this paper, we aim to simultaneously solve the above two challenges: 1) how to take advantage of the additional depth information in the source domain? 2) how to reduce the data distribution mismatch between the source and target domains? We propose a novel method called adaptive visual-depth embedding (aVDE), which learns the compact shared latent space between two representations of labeled RGB and depth modalities in the source domain first. Then the shared latent space can help the transfer of the depth information to the unlabeled target dataset. At last, aVDE models two separate learning strategies for domain adaptation (feature matching and instance reweighting) in a unified optimization problem, which matches features and reweights instances jointly across the shared latent space and the projected target domain for an adaptive classifier. We test our method on five pairs of data sets for object recognition and scene classification, the results of which demonstrates the effectiveness of our proposed method.

引用

页码：2471 / 2483

页数：13

共 50 条

[41] Visual Orientation And Recognition Of An Image
Harsha, T. D.
Fousiya, K. K.
PROCEEDINGS OF 2016 ONLINE INTERNATIONAL CONFERENCE ON GREEN ENGINEERING AND TECHNOLOGIES (IC-GET), 2016,
[42] Dense RGB-D visual odometry using inverse depth
Gutierrez-Gomez, Daniel
Mayol-Cuevas, Walterio
Guerrero, J. J.
ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 : 571 - 583
[43] Learning to Weight Color and Depth for RGB-D Visual Search
Petrelli, Alioscia
Di Stefano, Luigi
IMAGE ANALYSIS AND PROCESSING,(ICIAP 2017), PT I, 2017, 10484 : 648 - 659
[44] Human Activities Recognition with RGB-Depth Camera using HMM
Dubois, Amandine
Charpillet, Francois
2013 35TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2013, : 4666 - 4669
[45] Isolated Sign Recognition with a Siamese Neural Network of RGB and Depth Streams
Tur, Anil Osman
Keles, Hacer Yalim
PROCEEDINGS OF 18TH INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES (IEEE EUROCON 2019), 2019,
[46] Static Gesture Recognition Based on RGB-D Depth Information
Wang, Yi
Dong, Xiucheng
Li, Changlong
Yu, Ximu
ADVANCES IN COMPUTERS, ELECTRONICS AND MECHATRONICS, 2014, 667 : 248 - +
[47] Sparse Distance Learning for Object Recognition Combining RGB and Depth Information
Lai, Kevin
Bo, Liefeng
Ren, Xiaofeng
Fox, Dieter
2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011,
[48] Human Action Recognition using Meta Learning for RGB and Depth Information
Amiri, S. Mohsen
Pourazad, Mahsa T.
Nasiopoulos, Panos
Leung, Victor C. M.
2014 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2014, : 363 - 367
[49] Image recognition: Visual grouping, recognition, and learning
Buhmann, JM
Malik, J
Perona, P
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (25) : 14203 - 14204
[50] An adaptive converged depth completion network based on efficient RGB guidance
Kaixiang Liu
Qingwu Li
Yaqin Zhou
Multimedia Tools and Applications, 2022, 81 : 35915 - 35933

← 1 2 3 4 5 →