Faster and Better: A Lightweight Transformer Network for Remote Sensing Scene Classification

被引:9
|
作者
Huang, Xinyan [1 ,2 ,3 ,4 ]
Liu, Fang [1 ,2 ,3 ,4 ]
Cui, Yuanhao [1 ,2 ,3 ,4 ]
Chen, Puhua [1 ,2 ,3 ,4 ]
Li, Lingling [1 ,2 ,3 ,4 ]
Li, Pengfang [1 ,2 ,3 ,4 ]
机构
[1] Xidian Univ, Key Lab Intelligent Percept & Image Understanding, Minist Educ, PL, Xian 710071, Peoples R China
[2] Xidian Univ, Int Res Ctr Intelligent Percept & Computat, Xian 710071, Peoples R China
[3] Xidian Univ, Joint Int Res Lab Intelligent Percept & Computat, Xian 710071, Peoples R China
[4] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
lightweight transformer; convolutional neural network; scene classification; remote sensing; CONVOLUTIONAL NEURAL-NETWORK;
D O I
10.3390/rs15143645
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Remote sensing (RS) scene classification has received considerable attention due to its wide applications in the RS community. Many methods based on convolutional neural networks (CNNs) have been proposed to classify complex RS scenes, but they cannot fully capture the context in RS images because of the lack of long-range dependencies (the dependency relationship between two distant elements). Recently, some researchers fine-tuned the large pretrained vision transformer (ViT) on small RS datasets to extract long-range dependencies effectively in RS scenes. However, it usually takes more time to fine-tune the ViT on account of high computational complexity. The lack of good local feature representation in the ViT limits classification performance improvement. To this end, we propose a lightweight transformer network (LTNet) for RS scene classification. First, a multi-level group convolution (MLGC) module is presented. It enriches the diversity of local features and requires a lower computational cost by co-representing multi-level and multi-group features in a single module. Then, based on the MLGC module, a lightweight transformer block, LightFormer, was designed to capture global dependencies with fewer computing resources. Finally, the LTNet was built using the MLGC and LightFormer. The experiments of fine-tuning the LTNet on four RS scene classification datasets demonstrate that the proposed network achieves a competitive classification performance under less training time.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] A Multiscale Attention Network for Remote Sensing Scene Images Classification
    Zhang, Guokai
    Xu, Weizhe
    Zhao, Wei
    Huang, Chenxi
    Ng, Eddie Yk
    Chen, Yongyong
    Su, Jian
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 9530 - 9545
  • [32] Pairwise Comparison Network for Remote-Sensing Scene Classification
    Zhang, Yue
    Zheng, Xiangtao
    Lu, Xiaoqiang
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [33] Efficient recurrent attention network for remote sensing scene classification
    Liang, Le
    Wang, Guoli
    [J]. IET IMAGE PROCESSING, 2021, 15 (08) : 1712 - 1721
  • [34] A Fast Deep Perception Network for Remote Sensing Scene Classification
    Dong, Ruchan
    Xu, Dazhuan
    Jiao, Lichen
    Zhao, Jin
    An, Jungang
    [J]. REMOTE SENSING, 2020, 12 (04)
  • [35] Scene Classification of Remote Sensing Images Based on RCF Network
    Zhu Shuxin
    Zhou Zijun
    Gu Xingjian
    Ren Shougang
    Xu Huanliang
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (14)
  • [36] A Multiscale Incremental Learning Network for Remote Sensing Scene Classification
    Ye, Zhen
    Zhang, Yu
    Zhang, Jinxin
    Li, Wei
    Bai, Lin
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [37] Multilayer Feature Fusion Network for Scene Classification in Remote Sensing
    Xu, Kejie
    Huang, Hong
    Li, Yuan
    Shi, Guangyao
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (11) : 1894 - 1898
  • [38] Context Residual Attention Network for Remote Sensing Scene Classification
    Wang, Yuhua
    Hu, Yaxin
    Xu, Yuezhu
    Jiao, Peiyuan
    Zhang, Xiangrong
    Cui, Huanyu
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [39] MSNet: A Multiple Supervision Network for Remote Sensing Scene Classification
    Liu, Nanqing
    Celik, Turgay
    Li, Heng-Chao
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [40] Positional Context Aggregation Network for Remote Sensing Scene Classification
    Zhang, Dong
    Li, Nan
    Ye, Qiaolin
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (06) : 943 - 947