TRS: Transformers for Remote Sensing Scene Classification

被引:79
|
作者
Zhang, Jianrong [1 ,2 ]
Zhao, Hongwei [1 ,2 ]
Li, Jiao [3 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Peoples R China
[3] Jilin Univ, Dept Jilin Univ Lib, Changchun 130012, Peoples R China
关键词
transformers; deep convolutional neural networks; multi-head self-attention; remote sensing scene classification; CONVOLUTIONAL NEURAL-NETWORKS; FEATURES; ATTENTION; MODEL; SCALE;
D O I
10.3390/rs13204143
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Remote sensing scene classification remains challenging due to the complexity and variety of scenes. With the development of attention-based methods, Convolutional Neural Networks (CNNs) have achieved competitive performance in remote sensing scene classification tasks. As an important method of the attention-based model, the Transformer has achieved great success in the field of natural language processing. Recently, the Transformer has been used for computer vision tasks. However, most existing methods divide the original image into multiple patches and encode the patches as the input of the Transformer, which limits the model's ability to learn the overall features of the image. In this paper, we propose a new remote sensing scene classification method, Remote Sensing Transformer (TRS), a powerful "pure CNNs -> Convolution + Transformer -> pure Transformers " structure. First, we integrate self-attention into ResNet in a novel way, using our proposed Multi-Head Self-Attention layer instead of 3 x 3 spatial revolutions in the bottleneck. Then we connect multiple pure Transformer encoders to further improve the representation learning performance completely depending on attention. Finally, we use a linear classifier for classification. We train our model on four public remote sensing scene datasets: UC-Merced, AID, NWPU-RESISC45, and OPTIMAL-31. The experimental results show that TRS exceeds the state-of-the-art methods and achieves higher accuracy.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Efficient recurrent attention network for remote sensing scene classification
    Liang, Le
    Wang, Guoli
    [J]. IET IMAGE PROCESSING, 2021, 15 (08) : 1712 - 1721
  • [22] Remote Sensing Image Scene Classification Based on Fusion Method
    Yin, Liancheng
    Yang, Peiyi
    Mao, Keming
    Liu, Qian
    [J]. JOURNAL OF SENSORS, 2021, 2021
  • [23] Semisupervised Center Loss for Remote Sensing Image Scene Classification
    Zhang, Jun
    Zhang, Min
    Pan, Bin
    Shi, Zhenwei
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 1362 - 1373
  • [24] Pairwise Comparison Network for Remote-Sensing Scene Classification
    Zhang, Yue
    Zheng, Xiangtao
    Lu, Xiaoqiang
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [25] REMOTE SENSING SCENE CLASSIFICATION BASED ON RES-CAPSNET
    Tian, Tian
    Liu, Xiaoyan
    Wang, Lizhe
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 525 - 528
  • [26] Scene Classification in Remote Sensing Images using Dynamic Kernels
    Datla, Rajeshreddy
    Chalavadi, Vishnu
    Mohan, Krishna C.
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [27] Knowledge Guided Evolutionary Transformer for Remote Sensing Scene Classification
    Zhao, Jiaxuan
    Jiao, Licheng
    Wang, Chao
    Liu, Xu
    Liu, Fang
    Li, Lingling
    Ma, Mengru
    Yang, Shuyuan
    [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (10) : 10368 - 10384
  • [28] Scene Classification With Recurrent Attention of VHR Remote Sensing Images
    Wang, Qi
    Liu, Shaoteng
    Chanussot, Jocelyn
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (02): : 1155 - 1167
  • [29] A Fast Deep Perception Network for Remote Sensing Scene Classification
    Dong, Ruchan
    Xu, Dazhuan
    Jiao, Lichen
    Zhao, Jin
    An, Jungang
    [J]. REMOTE SENSING, 2020, 12 (04)
  • [30] Relation-Attention Networks for Remote Sensing Scene Classification
    Wang, Xin
    Duan, Lin
    Ning, Chen
    Zhou, Huiyu
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 422 - 439