TRS: Transformers for Remote Sensing Scene Classification

被引:79
|
作者
Zhang, Jianrong [1 ,2 ]
Zhao, Hongwei [1 ,2 ]
Li, Jiao [3 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Peoples R China
[3] Jilin Univ, Dept Jilin Univ Lib, Changchun 130012, Peoples R China
关键词
transformers; deep convolutional neural networks; multi-head self-attention; remote sensing scene classification; CONVOLUTIONAL NEURAL-NETWORKS; FEATURES; ATTENTION; MODEL; SCALE;
D O I
10.3390/rs13204143
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Remote sensing scene classification remains challenging due to the complexity and variety of scenes. With the development of attention-based methods, Convolutional Neural Networks (CNNs) have achieved competitive performance in remote sensing scene classification tasks. As an important method of the attention-based model, the Transformer has achieved great success in the field of natural language processing. Recently, the Transformer has been used for computer vision tasks. However, most existing methods divide the original image into multiple patches and encode the patches as the input of the Transformer, which limits the model's ability to learn the overall features of the image. In this paper, we propose a new remote sensing scene classification method, Remote Sensing Transformer (TRS), a powerful "pure CNNs -> Convolution + Transformer -> pure Transformers " structure. First, we integrate self-attention into ResNet in a novel way, using our proposed Multi-Head Self-Attention layer instead of 3 x 3 spatial revolutions in the bottleneck. Then we connect multiple pure Transformer encoders to further improve the representation learning performance completely depending on attention. Finally, we use a linear classifier for classification. We train our model on four public remote sensing scene datasets: UC-Merced, AID, NWPU-RESISC45, and OPTIMAL-31. The experimental results show that TRS exceeds the state-of-the-art methods and achieves higher accuracy.
引用
收藏
页数:24
相关论文
共 50 条
  • [11] Remote Sensing Scene Classification with Masked Image Modeling
    Wang, Liya
    Tien, Alex
    [J]. MICROWAVE REMOTE SENSING: DATA PROCESSING AND APPLICATIONS II, 2023, 12732
  • [12] Explaining the Effects of Clouds on Remote Sensing Scene Classification
    Gawlikowski, Jakob
    Ebel, Patrick
    Schmitt, Michael
    Zhu, Xiao Xiang
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 9976 - 9986
  • [13] AN INTROSPECTIVE LEARNING STRATEGY FOR REMOTE SENSING SCENE CLASSIFICATION
    Su, Jingran
    Wang, Qi
    Chen, Shangdong
    Li, Xuelong
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 533 - 536
  • [14] Continual Learning Approach for Remote Sensing Scene Classification
    Ammour, Nassim
    Bazi, Yakoub
    Alhichri, Haikel
    Alajlan, Naif
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [15] Remote Sensing Scene Classification by Unsupervised Representation Learning
    Lu, Xiaoqiang
    Zheng, Xiangtao
    Yuan, Yuan
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (09): : 5148 - 5157
  • [16] Searching for CNN Architectures for Remote Sensing Scene Classification
    Broni-Bediako, Clifford
    Murata, Yuki
    Mormille, Luiz H. B.
    Atsumi, Masayasu
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [17] Attention Consistent Network for Remote Sensing Scene Classification
    Tang, Xu
    Ma, Qiushuo
    Zhang, Xiangrong
    Liu, Fang
    Ma, Jingjing
    Jiao, Licheng
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 2030 - 2045
  • [18] Better Visual Interpretation for Remote Sensing Scene Classification
    Huang, Xu
    Sun, Yuxi
    Feng, Shanshan
    Ye, Yunming
    Li, Xutao
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [19] Integration of heterogeneous features for remote sensing scene classification
    Wang, Xin
    Xiong, Xingnan
    Ning, Chen
    Shi, Aiye
    Lv, Guofang
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2018, 12 (01):
  • [20] A Multiscale Attention Network for Remote Sensing Scene Images Classification
    Zhang, Guokai
    Xu, Weizhe
    Zhao, Wei
    Huang, Chenxi
    Ng, Eddie Yk
    Chen, Yongyong
    Su, Jian
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 9530 - 9545