Adaptive RGB Image Recognition by Visual-Depth Embedding

被引:14
|
作者
Cai, Ziyun [1 ]
Long, Yang [2 ]
Shao, Ling [3 ,4 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing, Jiangsu, Peoples R China
[2] Newcastle Univ, Sch Comp, Open Lab, Newcastle Upon Tyne NE4 5TG, Tyne & Wear, England
[3] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
[4] Univ East Anglia, Sch Comp Sci, Norwich NR4 7TJ, Norfolk, England
关键词
RGB-D data; domain adaptation; visual categorization; NONNEGATIVE MATRIX FACTORIZATION; KERNEL;
D O I
10.1109/TIP.2018.2806839
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing RGB images from RGB-D data is a promising application, which significantly reduces the cost while can still retain high recognition rates. However, existing methods still suffer from the domain shifting problem due to conventional surveillance cameras and depth sensors are using different mechanisms. In this paper, we aim to simultaneously solve the above two challenges: 1) how to take advantage of the additional depth information in the source domain? 2) how to reduce the data distribution mismatch between the source and target domains? We propose a novel method called adaptive visual-depth embedding (aVDE), which learns the compact shared latent space between two representations of labeled RGB and depth modalities in the source domain first. Then the shared latent space can help the transfer of the depth information to the unlabeled target dataset. At last, aVDE models two separate learning strategies for domain adaptation (feature matching and instance reweighting) in a unified optimization problem, which matches features and reweights instances jointly across the shared latent space and the projected target domain for an adaptive classifier. We test our method on five pairs of data sets for object recognition and scene classification, the results of which demonstrates the effectiveness of our proposed method.
引用
收藏
页码:2471 / 2483
页数:13
相关论文
共 50 条
  • [41] Visual Orientation And Recognition Of An Image
    Harsha, T. D.
    Fousiya, K. K.
    PROCEEDINGS OF 2016 ONLINE INTERNATIONAL CONFERENCE ON GREEN ENGINEERING AND TECHNOLOGIES (IC-GET), 2016,
  • [42] Dense RGB-D visual odometry using inverse depth
    Gutierrez-Gomez, Daniel
    Mayol-Cuevas, Walterio
    Guerrero, J. J.
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 : 571 - 583
  • [43] Learning to Weight Color and Depth for RGB-D Visual Search
    Petrelli, Alioscia
    Di Stefano, Luigi
    IMAGE ANALYSIS AND PROCESSING,(ICIAP 2017), PT I, 2017, 10484 : 648 - 659
  • [44] Human Activities Recognition with RGB-Depth Camera using HMM
    Dubois, Amandine
    Charpillet, Francois
    2013 35TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2013, : 4666 - 4669
  • [45] Isolated Sign Recognition with a Siamese Neural Network of RGB and Depth Streams
    Tur, Anil Osman
    Keles, Hacer Yalim
    PROCEEDINGS OF 18TH INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES (IEEE EUROCON 2019), 2019,
  • [46] Static Gesture Recognition Based on RGB-D Depth Information
    Wang, Yi
    Dong, Xiucheng
    Li, Changlong
    Yu, Ximu
    ADVANCES IN COMPUTERS, ELECTRONICS AND MECHATRONICS, 2014, 667 : 248 - +
  • [47] Sparse Distance Learning for Object Recognition Combining RGB and Depth Information
    Lai, Kevin
    Bo, Liefeng
    Ren, Xiaofeng
    Fox, Dieter
    2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011,
  • [48] Human Action Recognition using Meta Learning for RGB and Depth Information
    Amiri, S. Mohsen
    Pourazad, Mahsa T.
    Nasiopoulos, Panos
    Leung, Victor C. M.
    2014 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2014, : 363 - 367
  • [49] Image recognition: Visual grouping, recognition, and learning
    Buhmann, JM
    Malik, J
    Perona, P
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (25) : 14203 - 14204
  • [50] An adaptive converged depth completion network based on efficient RGB guidance
    Kaixiang Liu
    Qingwu Li
    Yaqin Zhou
    Multimedia Tools and Applications, 2022, 81 : 35915 - 35933