Ensemble Learning Approaches Based on Covariance Pooling of CNN Features for High Resolution Remote Sensing Scene Classification

被引:17
|
作者
Akodad, Sara [1 ]
Bombrun, Lionel [1 ]
Xia, Junshi [2 ]
Berthoumieu, Yannick [1 ]
Germain, Christian [1 ]
机构
[1] Univ Bordeaux, Grp Signal & Image, CNRS, UMR 5218,IMS, F-33405 Talence, France
[2] RIKEN, RIKEN Ctr Adv Intelligence Project AIP, Tokyo 1030027, Japan
关键词
transfer learning; covariance matrices; log-euclidean metric; ensemble learning; remote sensing scene classification; fisher vector; FRAMEWORK;
D O I
10.3390/rs12203292
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Remote sensing image scene classification, which consists of labeling remote sensing images with a set of categories based on their content, has received remarkable attention for many applications such as land use mapping. Standard approaches are based on the multi-layer representation of first-order convolutional neural network (CNN) features. However, second-order CNNs have recently been shown to outperform traditional first-order CNNs for many computer vision tasks. Hence, the aim of this paper is to show the use of second-order statistics of CNN features for remote sensing scene classification. This takes the form of covariance matrices computed locally or globally on the output of a CNN. However, these datapoints do not lie in an Euclidean space but a Riemannian manifold. To manipulate them, Euclidean tools are not adapted. Other metrics should be considered such as the log-Euclidean one. This consists of projecting the set of covariance matrices on a tangent space defined at a reference point. In this tangent plane, which is a vector space, conventional machine learning algorithms can be considered, such as the Fisher vector encoding or SVM classifier. Based on this log-Euclidean framework, we propose a novel transfer learning approach composed of two hybrid architectures based on covariance pooling of CNN features, the first is local and the second is global. They rely on the extraction of features from models pre-trained on the ImageNet dataset processed with some machine learning algorithms. The first hybrid architecture consists of an ensemble learning approach with the log-Euclidean Fisher vector encoding of region covariance matrices computed locally on the first layers of a CNN. The second one concerns an ensemble learning approach based on the covariance pooling of CNN features extracted globally from the deepest layers. These two ensemble learning approaches are then combined together based on the strategy of the most diverse ensembles. For validation and comparison purposes, the proposed approach is tested on various challenging remote sensing datasets. Experimental results exhibit a significant gain of approximately 2 % in overall accuracy for the proposed approach compared to a similar state-of-the-art method based on covariance pooling of CNN features (on the UC Merced dataset).
引用
收藏
页码:1 / 19
页数:19
相关论文
共 50 条
  • [41] Improving remote sensing crop classification by argumentation-based conflict resolution in ensemble learning
    Contiu, Stefan
    Groza, Adrian
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 64 : 269 - 286
  • [42] Remote Sensing Image Scene Classification Based on Combining Multiple Features
    Jin, Xuesong
    Zhang, Qiuying
    Sun, Huadong
    Li, Jing
    Han, Xiaowei
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 5551 - 5555
  • [43] LEARNING REGION RESPONSE RANKING FEATURES FOR REMOTE SENSING IMAGE SCENE CLASSIFICATION
    Yang, Junyu
    Cheng, Gong
    Yao, Xiwen
    Han, Junwei
    Guo, Lei
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 529 - 532
  • [44] SwinHCST: a deep learning network architecture for scene classification of remote sensing images based on improved CNN and Transformer
    Song, Jiayin
    Fan, Yiming
    Song, Wenlong
    Zhou, Hongwei
    Yang, Liusong
    Huang, Qiqi
    Jiang, Zhuoyuan
    Wang, Chuangqi
    Liao, Ting
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (23) : 7439 - 7463
  • [45] Urban impervious surface extraction based on the deep features of high-resolution remote sensing image and ensemble learning
    Li, Xuetao
    Wang, Pancheng
    Zeng, Yongnian
    [J]. Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2024, 53 (04): : 700 - 711
  • [46] Research on Scene Classification Method of High-Resolution Remote Sensing Images Based on RFPNet
    Zhang, Xin
    Wang, Yongcheng
    Zhang, Ning
    Xu, Dongdong
    Chen, Bo
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (10):
  • [47] Attention based Residual Network for High-Resolution Remote Sensing Imagery Scene Classification
    Fan, Runyu
    Wang, Lizhe
    Feng, Ruyi
    Zhu, Yingqian
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 1346 - 1349
  • [48] SCENE CLASSIFICATION OF HIGH RESOLUTION REMOTE SENSING IMAGES VIA SELF-PACED DEEP LEARNING
    Yao, Xiwen
    Yang, Liuqing
    Cheng, Gong
    Han, Junwei
    Guo, Lei
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 521 - 524
  • [49] Fine crop classification in high resolution remote sensing based on deep learning
    Lu, Tingyu
    Wan, Luhe
    Wang, Lei
    [J]. FRONTIERS IN ENVIRONMENTAL SCIENCE, 2022, 10
  • [50] High-Resolution Remote Sensing Image Scene Classification by Merging Multilevel Features of Convolutional Neural Networks
    Zhang, Xiaoxia
    Guo, Yong
    Zhang, Xia
    [J]. JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2021, 49 (06) : 1379 - 1391