Weakly supervised scale adaptation data augmentation for scene classification of high-resolution remote sensing images

被引:0
|
作者
Wang, Liming [1 ]
Qi, Kunlun [2 ]
Yang, Chao [1 ,2 ]
Wu, Huayi [3 ]
机构
[1] School of Geography and Information Engineering, China University of Geosciences (Wuhan), Wuhan,430074, China
[2] National Engineering Research Center of Geographic Information System, Wuhan,430074, China
[3] State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan,430079, China
关键词
Classification (of information) - Convolution - Convolutional neural networks - Deep learning - Image classification - Image enhancement - Image fusion;
D O I
10.11834/jrs.20221481
中图分类号
学科分类号
摘要
Scene classification of remote sensing images aims to assign a meaningful label to a given image. In recent years, Convolutional Neural Networks (CNNs)-based methods make a breakthrough and substantially outperform traditional methods in scene classification tasks of remote sensing images. However, obtaining features under different scales in remote sensing images is difficult due to the fixed receptive field of CNNs. This complexity seriously affects the performance of CNNs in scene classification of remote sensing images. This study proposes a method to learn the optimal scales for different scene image instances in a weakly supervised manner. A Weakly Supervised Scale Adaptive Data Augmentation Network (WSADAN) is proposed to capture feature information at different scales of remote sensing scenes, and a scale generation module and a scale fusion module are designed to improve the robustness. The scale generation module learns the optimal scale parameters based on the CNN features of the original image. The scale fusion module filters the CNN features of images with original and optimal scales to remove the noise and then deeply fuses them to exploit the correlation between features at different scales. The deeply fused multi-scale features are input into a fully connected layer to predict categories of scene images. The effectiveness of the scale generation and scale fusion modules is verified by ablation experiments. The accuracy of WSADANSGM compared with the baseline improves by 0.94% and 0.89% for the 20% and 50% training data ratios of RSSCN7 dataset, 1.27% and 0.87% for the 20% and 50% training data ratios of AID dataset, and 1.09% and 0.71% for the 10% and 20% training data ratios of NWPU dataset, respectively. Compared with WSADANSGM, WSADANSGM+SFM improves by 1.65% and 1.32% for the RSSCN7 dataset at 20% and 50% training data ratios, 1.65% and 1.26% for the AID dataset at 20% and 50% training data ratios, and 1.75% and 1.42% for the NWPU dataset at 10% and 20% training data ratios, respectively. In the experiment for scene scale change analysis, the classification accuracy of our method is higher than the baseline at any scale of image, which proves that our method can learn certain image scale information and has strong scale adaptation ability. We use three datasets for remote sensing scene classification, namely, RSSCN7, AID, and NWPU, for the experiments. On the RSSCN7 dataset, the overall accuracies are 91.65% and 94.07% with the training ratios of 20% and 50% for WSADANVGG16. For WSADAN-ResNet50, the corresponding accuracies are 92.69% and 94.82%. On the AID dataset, the overall accuracies are 92.78% and 95.18% with the training ratios of 20% and 50% for WSADAN-VGG16. For WSADAN-ResNet50, the corresponding accuracies are 93.73% and 95.88%. On the NWPU dataset, the overall accuracies are 87.01% and 90.44% with the training ratios of 10% and 20% for WSADAN-VGG16. For WSADAN-ResNet50, the corresponding accuracies are 90.71% and 92.63%. The proposed method can learn CNN features at a wider range of scales without manual multi-scale selection for different datasets. The performance of the proposed method is better than that of traditional CNNs, especially for the scene categories containing objects with large-scale variations. © 2023 Science Press. All rights reserved.
引用
收藏
页码:2815 / 2830
相关论文
共 50 条
  • [1] Scene classification of high-resolution remote sensing images based on IMFNet
    Zhang, Xin
    Wang, Yongcheng
    Zhang, Ning
    Xu, Dongdong
    Chen, Bo
    Ben, Guangli
    Wang, Xue
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2019, 13 (04)
  • [2] Evaluation of Convnets for Large-Scale Scene Classification From High-Resolution Remote Sensing Images
    Pilipovic, Ratko
    Risojevic, Vladimir
    [J]. 17TH IEEE INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES - IEEE EUROCON 2017 CONFERENCE PROCEEDINGS, 2017, : 932 - 937
  • [3] Semi-Supervised Subcategory Centroid Alignment-Based Scene Classification for High-Resolution Remote Sensing Images †
    Mo, Nan
    Zhu, Ruixi
    [J]. Remote Sensing, 2024, 16 (19)
  • [4] Weakly Supervised Road Segmentation in High-Resolution Remote Sensing Images Using Point Annotations
    Lian, Renbao
    Huang, Liqin
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [5] Research on Scene Classification Method of High-Resolution Remote Sensing Images Based on RFPNet
    Zhang, Xin
    Wang, Yongcheng
    Zhang, Ning
    Xu, Dongdong
    Chen, Bo
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (10):
  • [6] Semi-Supervised DEGAN for Optical High-Resolution Remote Sensing Image Scene Classification
    Li, Jia
    Liao, Yujia
    Zhang, Junjie
    Zeng, Dan
    Qian, Xiaoliang
    [J]. REMOTE SENSING, 2022, 14 (17)
  • [7] Continual learning for scene classification of high resolution remote sensing images
    Xi, Jiangbo
    Yan, Ziyun
    Jiang, Wandong
    Xiang, Yaobing
    Xie, Dashuai
    [J]. TWELFTH INTERNATIONAL CONFERENCE ON INFORMATION OPTICS AND PHOTONICS (CIOP 2021), 2021, 12057
  • [8] Classification of High-Resolution Remote Sensing Images based on Multi-Scale Superposition
    Wang, Jinliang
    Gao, Wenjie
    Liu, Guangjie
    [J]. NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420
  • [9] Deep Differential Coding for High-Resolution Remote Sensing Scene Classification
    Shi, Qiuping
    Li, Jie
    Jiao, Zhicheng
    Wang, Ying
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON IMAGE AND GRAPHICS PROCESSING (ICIGP 2018), 2018, : 71 - 77
  • [10] Deep feature representations for high-resolution remote sensing scene classification
    Zhou, Weixun
    Shao, Zhenfeng
    Cheng, Qimin
    [J]. 2016 4RTH INTERNATIONAL WORKSHOP ON EARTH OBSERVATION AND REMOTE SENSING APPLICATIONS (EORSA), 2016,