Ensemble Learning Approaches Based on Covariance Pooling of CNN Features for High Resolution Remote Sensing Scene Classification

被引:16
|
作者
Akodad, Sara [1 ]
Bombrun, Lionel [1 ]
Xia, Junshi [2 ]
Berthoumieu, Yannick [1 ]
Germain, Christian [1 ]
机构
[1] Univ Bordeaux, Grp Signal & Image, CNRS, UMR 5218,IMS, F-33405 Talence, France
[2] RIKEN, RIKEN Ctr Adv Intelligence Project AIP, Tokyo 1030027, Japan
关键词
transfer learning; covariance matrices; log-euclidean metric; ensemble learning; remote sensing scene classification; fisher vector; FRAMEWORK;
D O I
10.3390/rs12203292
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Remote sensing image scene classification, which consists of labeling remote sensing images with a set of categories based on their content, has received remarkable attention for many applications such as land use mapping. Standard approaches are based on the multi-layer representation of first-order convolutional neural network (CNN) features. However, second-order CNNs have recently been shown to outperform traditional first-order CNNs for many computer vision tasks. Hence, the aim of this paper is to show the use of second-order statistics of CNN features for remote sensing scene classification. This takes the form of covariance matrices computed locally or globally on the output of a CNN. However, these datapoints do not lie in an Euclidean space but a Riemannian manifold. To manipulate them, Euclidean tools are not adapted. Other metrics should be considered such as the log-Euclidean one. This consists of projecting the set of covariance matrices on a tangent space defined at a reference point. In this tangent plane, which is a vector space, conventional machine learning algorithms can be considered, such as the Fisher vector encoding or SVM classifier. Based on this log-Euclidean framework, we propose a novel transfer learning approach composed of two hybrid architectures based on covariance pooling of CNN features, the first is local and the second is global. They rely on the extraction of features from models pre-trained on the ImageNet dataset processed with some machine learning algorithms. The first hybrid architecture consists of an ensemble learning approach with the log-Euclidean Fisher vector encoding of region covariance matrices computed locally on the first layers of a CNN. The second one concerns an ensemble learning approach based on the covariance pooling of CNN features extracted globally from the deepest layers. These two ensemble learning approaches are then combined together based on the strategy of the most diverse ensembles. For validation and comparison purposes, the proposed approach is tested on various challenging remote sensing datasets. Experimental results exhibit a significant gain of approximately 2 % in overall accuracy for the proposed approach compared to a similar state-of-the-art method based on covariance pooling of CNN features (on the UC Merced dataset).
引用
收藏
页码:1 / 19
页数:19
相关论文
共 50 条
  • [1] An ensemble learning approach for the classification of remote sensing scenes based on covariance pooling of CNN features
    Akodad, Sara
    Vilfroy, Solene
    Bombrun, Lionel
    Cavalcante, Charles C.
    Germain, Christian
    Berthoumieu, Yannick
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [2] SEMI-SUPERVISED SCENE CLASSIFICATION FOR REMOTE SENSING IMAGES BASED ON CNN AND ENSEMBLE LEARNING
    Dai, Xueyuan
    Wu, Xiaofeng
    Wang, Bin
    Zhang, Liming
    [J]. IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 4732 - 4735
  • [3] Remote Sensing Scene Classification Using Multilayer Stacked Covariance Pooling
    He, Nanjun
    Fang, Leyuan
    Li, Shutao
    Plaza, Antonio
    Plaza, Javier
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (12): : 6899 - 6910
  • [4] USING CNN-BASED HIGH-LEVEL FEATURES FOR REMOTE SENSING SCENE CLASSIFICATION
    Fang, Zhengzheng
    Li, Wei
    Zou, Jinyi
    Du, Qian
    [J]. 2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 2610 - 2613
  • [5] DEEP ENSEMBLE LEARNING MODEL BASED ON COVARIANCE POOLING OF MULTI-LAYER CNN FEATURES
    Akodad, Sara
    Bombrun, Lionel
    Puscasu, Maria
    Xia, Junshi
    Germain, Christian
    Berthoumieu, Yannick
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1081 - 1085
  • [6] High-Resolution Remote Sensing Scene Classification Based on Salient Features and DCNN
    Lu Huanhuan
    Liu Tao
    Zhang Hui
    Peng Guofeng
    Zhang Juntong
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (20)
  • [7] Continual learning for scene classification of high resolution remote sensing images
    Xi, Jiangbo
    Yan, Ziyun
    Jiang, Wandong
    Xiang, Yaobing
    Xie, Dashuai
    [J]. TWELFTH INTERNATIONAL CONFERENCE ON INFORMATION OPTICS AND PHOTONICS (CIOP 2021), 2021, 12057
  • [8] Remote sensing scene classification with multi-spatial scale frequency covariance pooling
    Wenjie Chen
    Yuan Gao
    Aibin Chen
    Guoxiong Zhou
    Jianwu Wang
    Xiaobo Yang
    RunDong Jiang
    [J]. Multimedia Tools and Applications, 2022, 81 : 30413 - 30435
  • [9] Transferring CNN With Adaptive Learning for Remote Sensing Scene Classification
    Wang, Weiquan
    Chen, Yushi
    Ghamisi, Pedram
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [10] Remote sensing scene classification with multi-spatial scale frequency covariance pooling
    Chen, Wenjie
    Gao, Yuan
    Chen, Aibin
    Zhou, Guoxiong
    Wang, Jianwu
    Yang, Xiaobo
    Jiang, RunDong
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (21) : 30413 - 30435