Pyramid of Spatial Relatons for Scene-Level Land Use Classification

被引:204
|
作者
Chen, Shizhi [1 ]
Tian, YingLi [1 ,2 ]
机构
[1] CUNY City Coll, New York, NY 10031 USA
[2] CUNY, Grad Ctr, New York, NY 10031 USA
来源
关键词
Bag of words (BOW); geographical image classification; land use classification; pyramid of spatial relatons (PSR); spatial pyramid matching (SPM); URBAN-AREA; IMAGE; FEATURES; POINTS; SIFT;
D O I
10.1109/TGRS.2014.2351395
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Local feature with bag-of-words (BOW) representation has become one of the most popular approaches in object classification and image retrieval applications in the computer vision community. The recent efforts in the remote sensing community have demonstrated that the BOW approach can also effectively apply to geographic images for the applications of classification and retrieval. However, the BOW representation discards spatial information, which is critical for the remotely sensed land use classification. Several algorithms have incorporated spatial information into the BOWrepresentation by hard encoding coordinates of local features. Such rigid spatial encoding is not robust to translation and rotation variations, which are common characteristics of geographic images. To effectively incorporate spatial information into the BOW model for the land use classification, we propose a pyramid-of-spatial-relatons (PSR) model to capture both absolute and relative spatial relationships of local features. Unlike the conventional cooccurrence approach to describe pairwise spatial relationships between local features, the PSR model employs a novel concept of spatial relation to describe relative spatial relationship of a group of local features. As the result, the storage cost of the PSR model only linearly increases with the visual word codebook size instead of the quadratic relationship as in the cooccurrence approach. The PSR model is robust to translation and rotation variations and demonstrates excellent performance for the application of remotely sensed land use classification. On the Land Use and Land Cover image database, the PSR achieves 8% higher in the classification accuracy than the state of the art. If using only gray images, it outperforms the state of the art by more than 11%.
引用
收藏
页码:1947 / 1957
页数:11
相关论文
共 50 条
  • [11] Class-Wise Subspace Alignment-Based Unsupervised Adaptive Land Cover Classification in Scene-Level Using Deep Siamese Network
    Kalita, Indrajit
    Roy, Moumita
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3323 - 3334
  • [12] DeepScene: Scene classification via convolutional neural network with spatial pyramid pooling
    Yee, Pui Sin
    Lim, Kian Ming
    Lee, Chin Poo
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193
  • [13] Acoustic Scene Classification Using Spatial Pyramid Pooling With Convolutional Neural Networks
    Basbug, Ahmet Melih
    Sert, Mustafa
    2019 13TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2019, : 128 - 131
  • [14] Scene classification with context pyramid features
    Jiang Y.
    Wang R.
    Wang C.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2010, 22 (08): : 1366 - 1373
  • [15] Scene-Level Geographic Image Classification Based on a Covariance Descriptor Using Supervised Collaborative Kernel Coding
    Yang, Chunwei
    Liu, Huaping
    Wang, Shicheng
    Liao, Shouyi
    SENSORS, 2016, 16 (03):
  • [16] Land-use classification with biologically inspired color descriptor and sparse coding spatial pyramid matching
    Tian Tian
    Yun Zhang
    Hao Dou
    Hengjian Tong
    Multimedia Tools and Applications, 2017, 76 : 22943 - 22958
  • [17] Land-use classification with biologically inspired color descriptor and sparse coding spatial pyramid matching
    Tian, Tian
    Zhang, Yun
    Dou, Hao
    Tong, Hengjian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (21) : 22943 - 22958
  • [18] Image Parsing with a Wide Range of Classes and Scene-Level Context
    George, Marian
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3622 - 3630
  • [19] SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement
    Bauer, Dominik
    Patten, Timothy
    Vincze, Markus
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 196 - 204
  • [20] Scene-level Pose Estimation for Multiple Instances of Densely Packed Objects
    Mitash, Chaitanya
    Wen, Bowen
    Bekris, Kostas
    Boularias, Abdeslam
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100