Ensemble Learning Approaches Based on Covariance Pooling of CNN Features for High Resolution Remote Sensing Scene Classification

被引：17

作者：

Akodad, Sara ^{[1
]}

Bombrun, Lionel ^{[1
]}

Xia, Junshi ^{[2
]}

Berthoumieu, Yannick ^{[1
]}

Germain, Christian ^{[1
]}

机构：

[1] Univ Bordeaux, Grp Signal & Image, CNRS, UMR 5218,IMS, F-33405 Talence, France

[2] RIKEN, RIKEN Ctr Adv Intelligence Project AIP, Tokyo 1030027, Japan

来源：

REMOTE SENSING | 2020年 / 12卷 / 20期

关键词：

transfer learning; covariance matrices; log-euclidean metric; ensemble learning; remote sensing scene classification; fisher vector; FRAMEWORK;

D O I：

10.3390/rs12203292

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Remote sensing image scene classification, which consists of labeling remote sensing images with a set of categories based on their content, has received remarkable attention for many applications such as land use mapping. Standard approaches are based on the multi-layer representation of first-order convolutional neural network (CNN) features. However, second-order CNNs have recently been shown to outperform traditional first-order CNNs for many computer vision tasks. Hence, the aim of this paper is to show the use of second-order statistics of CNN features for remote sensing scene classification. This takes the form of covariance matrices computed locally or globally on the output of a CNN. However, these datapoints do not lie in an Euclidean space but a Riemannian manifold. To manipulate them, Euclidean tools are not adapted. Other metrics should be considered such as the log-Euclidean one. This consists of projecting the set of covariance matrices on a tangent space defined at a reference point. In this tangent plane, which is a vector space, conventional machine learning algorithms can be considered, such as the Fisher vector encoding or SVM classifier. Based on this log-Euclidean framework, we propose a novel transfer learning approach composed of two hybrid architectures based on covariance pooling of CNN features, the first is local and the second is global. They rely on the extraction of features from models pre-trained on the ImageNet dataset processed with some machine learning algorithms. The first hybrid architecture consists of an ensemble learning approach with the log-Euclidean Fisher vector encoding of region covariance matrices computed locally on the first layers of a CNN. The second one concerns an ensemble learning approach based on the covariance pooling of CNN features extracted globally from the deepest layers. These two ensemble learning approaches are then combined together based on the strategy of the most diverse ensembles. For validation and comparison purposes, the proposed approach is tested on various challenging remote sensing datasets. Experimental results exhibit a significant gain of approximately 2 % in overall accuracy for the proposed approach compared to a similar state-of-the-art method based on covariance pooling of CNN features (on the UC Merced dataset).

引用

页码：1 / 19

页数：19

共 50 条

[21] Enhanced multi-level features for very high resolution remote sensing scene classification
Sitaula, Chiranjibi
Sumesh, K. C.
Aryal, Jagannath
[J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7071 - 7083
[22] Enhanced multi-level features for very high resolution remote sensing scene classification
Chiranjibi Sitaula
Sumesh KC
Jagannath Aryal
[J]. Neural Computing and Applications, 2024, 36 : 7071 - 7083
[23] Searching for CNN Architectures for Remote Sensing Scene Classification
Broni-Bediako, Clifford
Murata, Yuki
Mormille, Luiz H. B.
Atsumi, Masayasu
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[24] Classification Method of High-Resolution Remote Sensing Scene Image Based on Dictionary Learning and Vision Transformer
He Xiaojun
Liu Xuan
Wei Xian
[J]. LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (14)
[25] Complex Scene Classification of High Resolution Remote Sensing Images Based on DCNN Model
Chen, Dexi
Hu, Peng
Duan, Xuelin
[J]. 2019 10TH INTERNATIONAL WORKSHOP ON THE ANALYSIS OF MULTITEMPORAL REMOTE SENSING IMAGES (MULTITEMP), 2019,
[26] GLR-CNN: CNN-Based Framework With Global Latent Relationship Embedding for High-Resolution Remote Sensing Image Scene Classification
Liu, Li
Wang, Yuebin
Peng, Junhuan
Zhang, Liqiang
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[27] Improved Metric Learning With the CNN for Very-High-Resolution Remote Sensing Image Classification
Shi, Cheng
Lv, Zhiyong
Shen, Huifang
Fang, Li
You, Zhenzhen
[J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 631 - 644
[28] A Combined Deep Learning Model for the Scene Classification of High-Resolution Remote Sensing Image
Dong, Yunya
Zhang, Qian
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (10) : 1540 - 1544
[29] Semantic Multigranularity Feature Learning for High-Resolution Remote Sensing Image Scene Classification
Ma, Xinyi
Xiao, Zhifeng
Yun, Hong-sik
Lee, Seung-Jun
[J]. APPLIED SCIENCES-BASEL, 2021, 11 (19):
[30] A semi-supervised generative framework with deep learning features for high-resolution remote sensing image scene classification
Han, Wei
Feng, Ruyi
Wang, Lizhe
Cheng, Yafan
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 : 23 - 43

← 1 2 3 4 5 →