A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification

被引:57
|
作者
Hayat, Munawar [1 ]
Khan, Salman H. [2 ,3 ]
Bennamoun, Mohammed [4 ]
An, Senjian [4 ]
机构
[1] Univ Canberra, Bruce, ACT 2617, Australia
[2] CSIRO, Data61, Canberra, ACT 0200, Australia
[3] Australian Natl Univ, Canberra, ACT 0200, Australia
[4] Univ Western Australia, Crawley, WA 6009, Australia
基金
澳大利亚研究理事会;
关键词
Indoor scenes classification; spatial layout variations; scale invariance;
D O I
10.1109/TIP.2016.2599292
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unlike standard object classification, where the image to be classified contains one or multiple instances of the same object, indoor scene classification is quite different since the image consists of multiple distinct objects. Furthermore, these objects can be of varying sizes and are present across numerous spatial locations in different layouts. For automatic indoor scene categorization, large-scale spatial layout deformations and scale variations are therefore two major challenges and the design of rich feature descriptors which are robust to these challenges is still an open problem. This paper introduces a new learnable feature descriptor called "spatial layout and scale invariant convolutional activations" to deal with these challenges. For this purpose, a new convolutional neural network architecture is designed which incorporates a novel "spatially unstructured" layer to introduce robustness against spatial layout deformations. To achieve scale invariance, we present a pyramidal image representation. For feasible training of the proposed network for images of indoor scenes, this paper proposes a methodology, which efficiently adapts a trained network model (on a large-scale data) for our task with only a limited amount of available training data. The efficacy of the proposed approach is demonstrated through extensive experiments on a number of data sets, including MIT-67, Scene-15, Sports-8, Graz-02, and NYU data sets.
引用
收藏
页码:4829 / 4841
页数:13
相关论文
共 50 条
  • [1] Face Recognition Based on Scale Invariant Feature Transform and Spatial Pyramid Representation
    Song, Tao
    Xiang, Ke
    Wang, Xuan-Yin
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 4113 - 4118
  • [2] A detailed analysis of a new 3D spatial feature vector for indoor scene classification
    Swadzba, Agnes
    Wachsmuth, Sven
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2014, 62 (05) : 646 - 662
  • [3] A Hybrid Geometric Spatial Image Representation for scene classification
    Ali, Nouman
    Zafar, Bushra
    Riaz, Faisal
    Dar, Saadat Hanif
    Ratyal, Naeem Iqbal
    Bajwa, Khalid Bashir
    Iqba, Muhammad Kashif
    Sajid, Muhammad
    PLOS ONE, 2018, 13 (09):
  • [4] A Compact Spatial Feature Representation for Image Classification
    Liu, Yinglu
    Hou, Xinwen
    Liu, Cheng-Lin
    2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 601 - 605
  • [5] Two-Level Feature Representation for Aerial Scene Classification
    Gan, Jinrui
    Li, Qingyong
    Zhang, Zhen
    Wang, Jianzhu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (11) : 1626 - 1630
  • [6] Performance Analysis of Holistic Feature Representation for Scene Understanding and Classification
    Fu Yi
    Tian Chang
    Wu Ze Min
    Zeng Ming Yong
    Hu Yinji
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 3756 - 3760
  • [7] An Indoor Scene Classification Method for Service Robot Based on CNN Feature
    Liu, Shaopeng
    Tian, Guohui
    JOURNAL OF ROBOTICS, 2019, 2019
  • [8] Indoor Scene Classification: A Comparative Study of Feature Detectors and Local Descriptors
    Fouad, Islam I.
    Rady, Sherine
    Mostafa, Mostafa G. M.
    INTERNATIONAL CONFERENCE ON INFORMATICS AND SYSTEMS (INFOS 2016), 2016, : 215 - 221
  • [9] Robust Scene Categorization via Scale-Rotation Invariant Generative Model and Kernel Sparse Representation Classification
    Kuang, Jinjun
    Chai, Yi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (03): : 758 - 761
  • [10] Spatial Distribution Feature for 3D Indoor Scene Labelling
    Lang, Yankun
    Wu, Haiyuan
    Chen, Qian
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 66 - 70