Faster and Better: A Lightweight Transformer Network for Remote Sensing Scene Classification

被引:9
|
作者
Huang, Xinyan [1 ,2 ,3 ,4 ]
Liu, Fang [1 ,2 ,3 ,4 ]
Cui, Yuanhao [1 ,2 ,3 ,4 ]
Chen, Puhua [1 ,2 ,3 ,4 ]
Li, Lingling [1 ,2 ,3 ,4 ]
Li, Pengfang [1 ,2 ,3 ,4 ]
机构
[1] Xidian Univ, Key Lab Intelligent Percept & Image Understanding, Minist Educ, PL, Xian 710071, Peoples R China
[2] Xidian Univ, Int Res Ctr Intelligent Percept & Computat, Xian 710071, Peoples R China
[3] Xidian Univ, Joint Int Res Lab Intelligent Percept & Computat, Xian 710071, Peoples R China
[4] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
lightweight transformer; convolutional neural network; scene classification; remote sensing; CONVOLUTIONAL NEURAL-NETWORK;
D O I
10.3390/rs15143645
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Remote sensing (RS) scene classification has received considerable attention due to its wide applications in the RS community. Many methods based on convolutional neural networks (CNNs) have been proposed to classify complex RS scenes, but they cannot fully capture the context in RS images because of the lack of long-range dependencies (the dependency relationship between two distant elements). Recently, some researchers fine-tuned the large pretrained vision transformer (ViT) on small RS datasets to extract long-range dependencies effectively in RS scenes. However, it usually takes more time to fine-tune the ViT on account of high computational complexity. The lack of good local feature representation in the ViT limits classification performance improvement. To this end, we propose a lightweight transformer network (LTNet) for RS scene classification. First, a multi-level group convolution (MLGC) module is presented. It enriches the diversity of local features and requires a lower computational cost by co-representing multi-level and multi-group features in a single module. Then, based on the MLGC module, a lightweight transformer block, LightFormer, was designed to capture global dependencies with fewer computing resources. Finally, the LTNet was built using the MLGC and LightFormer. The experiments of fine-tuning the LTNet on four RS scene classification datasets demonstrate that the proposed network achieves a competitive classification performance under less training time.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] A Lightweight and Multiscale Network for Remote Sensing Image Scene Classification
    Bai, Lin
    Liu, Qingxin
    Li, Cuiling
    Zhu, Chunlin
    Ye, Zhen
    Xi, Meng
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [2] A Lightweight Dual-Branch Swin Transformer for Remote Sensing Scene Classification
    Zheng, Fujian
    Lin, Shuai
    Zhou, Wei
    Huang, Hong
    [J]. REMOTE SENSING, 2023, 15 (11)
  • [3] Remote Sensing Scene Classification Using Spatial Transformer Fusion Network
    Tong, Shun
    Qi, Kunlun
    Guan, Qingfeng
    Zhu, Qiqi
    Yang, Chao
    Zheng, Jie
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 549 - 552
  • [4] RESFORMER: BRIDGING RESIDUAL NETWORK AND TRANSFORMER FOR REMOTE SENSING SCENE CLASSIFICATION
    Li, Mingteng
    Ma, Jingjing
    Tang, Xu
    Han, Xiao
    Zhu, Cheng
    Jiao, Licheng
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3147 - 3150
  • [5] HELViT: highly efficient lightweight vision transformer for remote sensing image scene classification
    Guo, Dongen
    Wu, Zechen
    Feng, Jiangfan
    Zhou, Zhuoke
    Shen, Zhen
    [J]. APPLIED INTELLIGENCE, 2023, 53 (21) : 24947 - 24962
  • [6] HELViT: highly efficient lightweight vision transformer for remote sensing image scene classification
    Dongen Guo
    Zechen Wu
    Jiangfan Feng
    Zhuoke Zhou
    Zhen Shen
    [J]. Applied Intelligence, 2023, 53 : 24947 - 24962
  • [7] An Efficient and Lightweight Convolutional Neural Network for Remote Sensing Image Scene Classification
    Yu, Donghang
    Xu, Qing
    Guo, Haitao
    Zhao, Chuan
    Lin, Yuzhun
    Li, Daoji
    [J]. SENSORS, 2020, 20 (07)
  • [8] BiShuffleNeXt: A lightweight bi-path network for remote sensing scene classification
    Chen, Zhichao
    Yang, Jie
    Feng, Zhicheng
    Chen, Lifang
    Li, Linhong
    [J]. MEASUREMENT, 2023, 209
  • [9] A lightweight and stochastic depth residual attention network for remote sensing scene classification
    Wang, Xinyu
    Xu, Haixia
    Yuan, Liming
    Wen, Xianbin
    [J]. IET IMAGE PROCESSING, 2023, 17 (11) : 3106 - 3126
  • [10] A Hierarchical Graph-Enhanced Transformer Network for Remote Sensing Scene Classification
    Li, Ziwei
    Xu, Weiming
    Yang, Shiyu
    Wang, Juan
    Su, Hua
    Huang, Zhanchao
    Wu, Sheng
    [J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 17 : 20315 - 20330