TRS: Transformers for Remote Sensing Scene Classification

被引:79
|
作者
Zhang, Jianrong [1 ,2 ]
Zhao, Hongwei [1 ,2 ]
Li, Jiao [3 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Peoples R China
[3] Jilin Univ, Dept Jilin Univ Lib, Changchun 130012, Peoples R China
关键词
transformers; deep convolutional neural networks; multi-head self-attention; remote sensing scene classification; CONVOLUTIONAL NEURAL-NETWORKS; FEATURES; ATTENTION; MODEL; SCALE;
D O I
10.3390/rs13204143
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Remote sensing scene classification remains challenging due to the complexity and variety of scenes. With the development of attention-based methods, Convolutional Neural Networks (CNNs) have achieved competitive performance in remote sensing scene classification tasks. As an important method of the attention-based model, the Transformer has achieved great success in the field of natural language processing. Recently, the Transformer has been used for computer vision tasks. However, most existing methods divide the original image into multiple patches and encode the patches as the input of the Transformer, which limits the model's ability to learn the overall features of the image. In this paper, we propose a new remote sensing scene classification method, Remote Sensing Transformer (TRS), a powerful "pure CNNs -> Convolution + Transformer -> pure Transformers " structure. First, we integrate self-attention into ResNet in a novel way, using our proposed Multi-Head Self-Attention layer instead of 3 x 3 spatial revolutions in the bottleneck. Then we connect multiple pure Transformer encoders to further improve the representation learning performance completely depending on attention. Finally, we use a linear classifier for classification. We train our model on four public remote sensing scene datasets: UC-Merced, AID, NWPU-RESISC45, and OPTIMAL-31. The experimental results show that TRS exceeds the state-of-the-art methods and achieves higher accuracy.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Recent advances in the application of vision transformers to remote sensing image scene classification
    Kumari, Monika
    Kaul, Ajay
    [J]. REMOTE SENSING LETTERS, 2023, 14 (07) : 722 - 732
  • [2] Vision Transformers for Remote Sensing Image Classification
    Bazi, Yakoub
    Bashmal, Laila
    Rahhal, Mohamad M. Al
    Dayil, Reham Al
    Ajlan, Naif Al
    [J]. REMOTE SENSING, 2021, 13 (03) : 1 - 20
  • [3] A Hierarchical Approach to Remote Sensing Scene Classification
    Sen, Ozlem
    Keles, Hacer Yalim
    [J]. PFG-JOURNAL OF PHOTOGRAMMETRY REMOTE SENSING AND GEOINFORMATION SCIENCE, 2022, 90 (02): : 161 - 175
  • [4] A Hierarchical Approach to Remote Sensing Scene Classification
    Ozlem Sen
    Hacer Yalim Keles
    [J]. PFG – Journal of Photogrammetry, Remote Sensing and Geoinformation Science, 2022, 90 : 161 - 175
  • [5] Comparison of CNNs for Remote Sensing Scene Classification
    Shafaey, Mayar A.
    Salem, Mohammed A. -M.
    Ebeid, H. M.
    Al-Berry, M. N.
    Tolba, Mohamed F.
    [J]. PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 27 - 32
  • [6] Learning scene-vectors for remote sensing image scene classification
    Datla, Rajeshreddy
    Perveen, Nazil
    Mohan, C. Krishna
    [J]. NEUROCOMPUTING, 2024, 587
  • [7] Remote Sensing Scene Classification by Gated Bidirectional Network
    Sun, Hao
    Li, Siyuan
    Zheng, Xiangtao
    Lu, Xiaoqiang
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (01): : 82 - 96
  • [8] GLFFNet model for remote sensing image scene classification
    Wang W.
    Deng J.
    Wang X.
    Li Z.
    Yuan P.
    [J]. Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2023, 52 (10): : 1693 - 1702
  • [9] Federated Learning Approach for Remote Sensing Scene Classification
    Ben Youssef, Belgacem
    Alhmidi, Lamyaa
    Bazi, Yakoub
    Zuair, Mansour
    [J]. REMOTE SENSING, 2024, 16 (12)
  • [10] ATTENTION BASED NETWORK FOR REMOTE SENSING SCENE CLASSIFICATION
    Liu, Shaoteng
    Wang, Qi
    Li, Xuelong
    [J]. IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 4740 - 4743