Self-supervised learning for remote sensing scene classification under the few shot scenario

被引:3
|
作者
Alosaimi, Najd [1 ]
Alhichri, Haikel [1 ]
Bazi, Yakoub [1 ]
Ben Youssef, Belgacem [1 ]
Alajlan, Naif [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Riyadh 11451, Saudi Arabia
来源
SCIENTIFIC REPORTS | 2023年 / 13卷 / 01期
关键词
D O I
10.1038/s41598-022-27313-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Scene classification is a crucial research problem in remote sensing (RS) that has attracted many researchers recently. It has many challenges due to multiple issues, such as: the complexity of remote sensing scenes, the classes overlapping (as a scene may contain objects that belong to foreign classes), and the difficulty of gaining sufficient labeled scenes. Deep learning (DL) solutions and in particular convolutional neural networks (CNN) are now state-of-the-art solution in RS scene classification; however, CNN models need huge amounts of annotated data, which can be costly and time-consuming. On the other hand, it is relatively easy to acquire large amounts of unlabeled images. Recently, Self-Supervised Learning (SSL) is proposed as a method that can learn from unlabeled images, potentially reducing the need for labeling. In this work, we propose a deep SSL method, called RS-FewShotSSL, for RS scene classification under the few shot scenario when we only have a few (less than 20) labeled scenes per class. Under this scenario, typical DL solutions that fine-tune CNN models, pre-trained on the ImageNet dataset, fail dramatically. In the SSL paradigm, a DL model is pre-trained from scratch during the pretext task using the large amounts of unlabeled scenes. Then, during the main or the so-called downstream task, the model is fine-tuned on the labeled scenes. Our proposed RS-FewShotSSL solution is composed of an online network and a target network both using the EfficientNet-B3 CNN model as a feature encoder backbone. During the pretext task, RS-FewShotSSL learns discriminative features from the unlabeled images using cross-view contrastive learning. Different views are generated from each image using geometric transformations and passed to the online and target networks. Then, the whole model is optimized by minimizing the cross-view distance between the online and target networks. To address the problem of limited computation resources available to us, our proposed method uses a novel DL architecture that can be trained using both high-resolution and low-resolution images. During the pretext task, RS-FewShotSSL is trained using low-resolution images, thereby, allowing for larger batch sizes which significantly boosts the performance of the proposed pipeline on the task of RS classification. In the downstream task, the target network is discarded, and the online network is fine-tuned using the few labeled shots or scenes. Here, we use smaller batches of both high-resolution and low-resolution images. This architecture allows RS-FewshotSSL to benefit from both large batch sizes and full image sizes, thereby learning from the large amounts of unlabeled data in an effective way. We tested RS-FewShotSSL on three RS public datasets, and it demonstrated a significant improvement compared to other state-of-the-art methods such as: SimCLR, MoCo, BYOL and IDSSL.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Subspace prototype learning for few-Shot remote sensing scene classification
    Wang, Wuli
    Xing, Lei
    Ren, Peng
    Jiang, Yumeng
    Wang, Ge
    Liu, Baodi
    SIGNAL PROCESSING, 2023, 208
  • [32] Self-Supervised Learning in Remote Sensing
    Wang, Yi
    Albrecht, Conrad M.
    Ait Ali Braham, Nassim
    Mou, Lichao
    Zhu, Xiao Xiang
    IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2022, 10 (04) : 213 - 247
  • [33] DEEP SELF-SUPERVISED LEARNING FOR FEW-SHOT HYPERSPECTRAL IMAGE CLASSIFICATION
    Li, Yu
    Zhang, Lei
    Wei, Wei
    Zhang, Yanning
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 501 - 504
  • [34] Self-Supervised GANs With Similarity Loss for Remote Sensing Image Scene Classification
    Guo, Dongen
    Xia, Ying
    Luo, Xiaobo
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 (14) : 2508 - 2521
  • [35] SCL: Self-supervised contrastive learning for few-shot image classification
    Lim, Jit Yan
    Lim, Kian Ming
    Lee, Chin Poo
    Tan, Yong Xuan
    NEURAL NETWORKS, 2023, 165 : 19 - 30
  • [36] Self-supervised Network Evolution for Few-shot Classification
    Tang, Xuwen
    Teng, Zhu
    Zhang, Baopeng
    Fan, Jianping
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3045 - 3051
  • [37] Class Centralized Dictionary Learning for Few-Shot Remote Sensing Scene Classification
    Wei, Lei
    Xing, Lei
    Zhao, Lifei
    Liu, Baodi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [38] Unsupervised Few-Shot Continual Learning for Remote Sensing Image Scene Classification
    Anwar Ma'Sum, Muhammad
    Pratama, Mahardhika
    Savitha, Ramasamy
    Liu, Lin
    Habibullah, Ryszard
    Kowalczyk, Ryszard
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [39] A META-LEARNING FRAMEWORK FOR FEW-SHOT CLASSIFICATION OF REMOTE SENSING SCENE
    Zhang, Pei
    Bai, Yunpeng
    Wang, Dong
    Bai, Bendu
    Li, Ying
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4590 - 4594
  • [40] Class Centralized Dictionary Learning for Few-Shot Remote Sensing Scene Classification
    Wei, Lei
    Xing, Lei
    Zhao, Lifei
    Liu, Baodi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20