Pyramid of Spatial Relatons for Scene-Level Land Use Classification

被引：204

作者：

Chen, Shizhi ^{[1
]}

Tian, YingLi ^{[1
,2
]}

机构：

[1] CUNY City Coll, New York, NY 10031 USA

[2] CUNY, Grad Ctr, New York, NY 10031 USA

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2015年 / 53卷 / 04期

关键词：

Bag of words (BOW); geographical image classification; land use classification; pyramid of spatial relatons (PSR); spatial pyramid matching (SPM); URBAN-AREA; IMAGE; FEATURES; POINTS; SIFT;

D O I：

10.1109/TGRS.2014.2351395

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Local feature with bag-of-words (BOW) representation has become one of the most popular approaches in object classification and image retrieval applications in the computer vision community. The recent efforts in the remote sensing community have demonstrated that the BOW approach can also effectively apply to geographic images for the applications of classification and retrieval. However, the BOW representation discards spatial information, which is critical for the remotely sensed land use classification. Several algorithms have incorporated spatial information into the BOWrepresentation by hard encoding coordinates of local features. Such rigid spatial encoding is not robust to translation and rotation variations, which are common characteristics of geographic images. To effectively incorporate spatial information into the BOW model for the land use classification, we propose a pyramid-of-spatial-relatons (PSR) model to capture both absolute and relative spatial relationships of local features. Unlike the conventional cooccurrence approach to describe pairwise spatial relationships between local features, the PSR model employs a novel concept of spatial relation to describe relative spatial relationship of a group of local features. As the result, the storage cost of the PSR model only linearly increases with the visual word codebook size instead of the quadratic relationship as in the cooccurrence approach. The PSR model is robust to translation and rotation variations and demonstrates excellent performance for the application of remotely sensed land use classification. On the Land Use and Land Cover image database, the PSR achieves 8% higher in the classification accuracy than the state of the art. If using only gray images, it outperforms the state of the art by more than 11%.

引用

页码：1947 / 1957

页数：11

共 50 条

[11] Class-Wise Subspace Alignment-Based Unsupervised Adaptive Land Cover Classification in Scene-Level Using Deep Siamese Network
Kalita, Indrajit
Roy, Moumita
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3323 - 3334
[12] DeepScene: Scene classification via convolutional neural network with spatial pyramid pooling
Yee, Pui Sin
Lim, Kian Ming
Lee, Chin Poo
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193
[13] Acoustic Scene Classification Using Spatial Pyramid Pooling With Convolutional Neural Networks
Basbug, Ahmet Melih
Sert, Mustafa
2019 13TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2019, : 128 - 131
[14] Scene classification with context pyramid features
Jiang Y.
Wang R.
Wang C.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2010, 22 (08): : 1366 - 1373
[15] Scene-Level Geographic Image Classification Based on a Covariance Descriptor Using Supervised Collaborative Kernel Coding
Yang, Chunwei
Liu, Huaping
Wang, Shicheng
Liao, Shouyi
SENSORS, 2016, 16 (03):
[16] Land-use classification with biologically inspired color descriptor and sparse coding spatial pyramid matching
Tian Tian
Yun Zhang
Hao Dou
Hengjian Tong
Multimedia Tools and Applications, 2017, 76 : 22943 - 22958
[17] Land-use classification with biologically inspired color descriptor and sparse coding spatial pyramid matching
Tian, Tian
Zhang, Yun
Dou, Hao
Tong, Hengjian
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (21) : 22943 - 22958
[18] Image Parsing with a Wide Range of Classes and Scene-Level Context
George, Marian
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3622 - 3630
[19] SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement
Bauer, Dominik
Patten, Timothy
Vincze, Markus
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 196 - 204
[20] Scene-level Pose Estimation for Multiple Instances of Densely Packed Objects
Mitash, Chaitanya
Wen, Bowen
Bekris, Kostas
Boularias, Abdeslam
CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100

← 1 2 3 4 5 →