A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification

被引:57
|
作者
Hayat, Munawar [1 ]
Khan, Salman H. [2 ,3 ]
Bennamoun, Mohammed [4 ]
An, Senjian [4 ]
机构
[1] Univ Canberra, Bruce, ACT 2617, Australia
[2] CSIRO, Data61, Canberra, ACT 0200, Australia
[3] Australian Natl Univ, Canberra, ACT 0200, Australia
[4] Univ Western Australia, Crawley, WA 6009, Australia
基金
澳大利亚研究理事会;
关键词
Indoor scenes classification; spatial layout variations; scale invariance;
D O I
10.1109/TIP.2016.2599292
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unlike standard object classification, where the image to be classified contains one or multiple instances of the same object, indoor scene classification is quite different since the image consists of multiple distinct objects. Furthermore, these objects can be of varying sizes and are present across numerous spatial locations in different layouts. For automatic indoor scene categorization, large-scale spatial layout deformations and scale variations are therefore two major challenges and the design of rich feature descriptors which are robust to these challenges is still an open problem. This paper introduces a new learnable feature descriptor called "spatial layout and scale invariant convolutional activations" to deal with these challenges. For this purpose, a new convolutional neural network architecture is designed which incorporates a novel "spatially unstructured" layer to introduce robustness against spatial layout deformations. To achieve scale invariance, we present a pyramidal image representation. For feasible training of the proposed network for images of indoor scenes, this paper proposes a methodology, which efficiently adapts a trained network model (on a large-scale data) for our task with only a limited amount of available training data. The efficacy of the proposed approach is demonstrated through extensive experiments on a number of data sets, including MIT-67, Scene-15, Sports-8, Graz-02, and NYU data sets.
引用
收藏
页码:4829 / 4841
页数:13
相关论文
共 50 条
  • [41] LARGE-SCALE AUDIO FEATURE EXTRACTION AND SVM FOR ACOUSTIC SCENE CLASSIFICATION
    Geiger, Juergen T.
    Schuller, Bjoern
    Rigoll, Gerhard
    2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2013,
  • [42] GLOBAL AND MULTI-SCALE FEATURE LEARNING FOR REMOTE SENSING SCENE CLASSIFICATION
    Xia, Ziying
    Gan, Guolong
    Liu, Siyu
    Cao, Wei
    Cheng, Jian
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 655 - 658
  • [43] Compressed spatial-spectral feature representation for hyperspectral ground classification
    Zhou Shichao
    Zhao Baojun
    Tang Linbo
    Wang Wenzheng
    JOURNAL OF ENGINEERING-JOE, 2019, 2019 (21): : 7928 - 7931
  • [44] Learning Informative Edge Maps for Indoor Scene Layout Prediction
    Mallya, Arun
    Lazebnik, Svetlana
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 936 - 944
  • [45] Indoor Scene Classification Based on the Bag-of-Words Model of Local Feature Information Gain
    Wang, Rong
    Wang, Zhiliang
    Ma, Xirong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (04): : 984 - 987
  • [46] An invariant feature representation for shape retrieval
    Han, SH
    Yang, SY
    PDCAT 2005: Sixth International Conference on Parallel and Distributed Computing, Applications and Technologies, Proceedings, 2005, : 1052 - 1054
  • [47] Hyperspectral Imagery Classification based on Rotation Invariant Spectral-Spatial Feature
    Tao, Chao
    Jin, Jing
    Tang, Yuqi
    Zou, ZhengRong
    2013 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2013, : 422 - 424
  • [48] Omnidirectional Vision for Indoor Spatial Layout Recovery
    Omedes, J.
    Lopez-Nicolas, G.
    Guerrero, J. J.
    INTELLIGENT AUTONOMOUS SYSTEMS 12, VOL 1, 2013, 193 : 355 - 364
  • [49] Riesz Feature Representation: Scale Equivariant Scattering Network for Classification Tasks
    Barisin, Tin
    Angulo, Jesus
    Schladitz, Katja
    Redenbach, Claudia
    SIAM JOURNAL ON IMAGING SCIENCES, 2024, 17 (02): : 1284 - 1313
  • [50] Object clique representation for scene classification
    Chen, Jingjing
    Cao, Xiaochun
    Zhang, Bao
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 2829 - 2832