Hyperspectral Image Classification Using Multi-Scale Lightweight Transformer

被引:1
|
作者
Gu, Quan [1 ]
Luan, Hongkang [1 ]
Huang, Kaixuan [1 ]
Sun, Yubao [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Minist Educ, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
hyperspectral image classification; multi-scale spectral attention; Transformer; long-range spectral dependence; SPARSE REPRESENTATION;
D O I
10.3390/electronics13050949
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The distinctive feature of hyperspectral images (HSIs) is their large number of spectral bands, which allows us to identify categories of ground objects by capturing discrepancies in spectral information. Convolutional neural networks (CNN) with attention modules effectively improve the classification accuracy of HSI. However, CNNs are not successful in capturing long-range spectral-spatial dependence. In recent years, Vision Transformer (VIT) has received widespread attention due to its excellent performance in acquiring long-range features. However, it requires calculating the pairwise correlation between token embeddings and has the complexity of the square of the number of tokens, which leads to an increase in the computational complexity of the network. In order to cope with this issue, this paper proposes a multi-scale spectral-spatial attention network with frequency-domain lightweight Transformer (MSA-LWFormer) for HSI classification. This method synergistically integrates CNN, attention mechanisms, and Transformer into the spectral-spatial feature extraction module and frequency-domain fused classification module. Specifically, the spectral-spatial feature extraction module employs a multi-scale 2D-CNN with multi-scale spectral attention (MS-SA) to extract the shallow spectral-spatial features and capture the long-range spectral dependence. In addition, The frequency-domain fused classification module designs a frequency-domain lightweight Transformer that employs the Fast Fourier Transform (FFT) to convert features from the spatial domain to the frequency domain, effectively extracting global information and significantly reducing the time complexity of the network. Experiments on three classic hyperspectral datasets show that MSA-LWFormer has excellent performance.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Spatial Feature Extraction for Hyperspectral Image Classification Based on Multi-scale CNN
    Song, Haifeng
    Yang, Weiwei
    [J]. Journal of Computers (Taiwan), 2020, 31 (04) : 174 - 186
  • [32] Multi-Scale Superpixel-Guided Structural Profiles for Hyperspectral Image Classification
    Wang, Nanlan
    Zeng, Xiaoyong
    Duan, Yanjun
    Deng, Bin
    Mo, Yan
    Xie, Zhuojun
    Duan, Puhong
    [J]. SENSORS, 2022, 22 (21)
  • [33] Hyperspectral Image Classification Based on Multi-scale Superpixel Texture Preservation and Fusion
    Tu Bing
    Zhu Yu
    Zhou Chengle
    Chen Siyuan
    He Wei
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (06) : 2207 - 2215
  • [34] MULTI-SCALE DILATED RESIDUAL CONVOLUTIONAL NEURAL NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Pooja, Kumari
    Nidamanuri, Rama Rao
    Mishra, Deepak
    [J]. 2019 10TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING - EVOLUTION IN REMOTE SENSING (WHISPERS), 2019,
  • [35] Hyperspectral Image Classification Based on Multi-Scale Feature Fusion Residual Network
    Deng Ziqing
    Wang Yang
    Zhang Bing
    Ding Zhao
    Bian Lifeng
    Yang Chen
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [36] HYPERSPECTRAL IMAGE CLASSIFICATION VIA MULTI-SCALE ENCODER-DECODER NETWORK
    Ma, Jingjing
    Wu, Linlin
    Tang, Xu
    Zhang, Xiangrong
    Zhu, Cheng
    Ma, Junyong
    Jiao, Licheng
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1283 - 1286
  • [37] Multi-Scale Efficient Graph-Transformer for Whole Slide Image Classification
    Ding, Saisai
    Li, Juncheng
    Wang, Jun
    Ying, Shihui
    Shi, Jun
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (12) : 5926 - 5936
  • [38] CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
    Chen, Chun-Fu
    Fan, Quanfu
    Panda, Rameswar
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 347 - 356
  • [39] MUSIQ: Multi-scale Image Quality Transformer
    Ke, Junjie
    Wang, Qifei
    Wang, Yilin
    Milanfar, Peyman
    Yang, Feng
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5128 - 5137
  • [40] Hyperspectral Image Classification Based on Multi-Scale Convolutional Features and Multi-Attention Mechanisms
    Sun, Qian
    Zhao, Guangrui
    Xia, Xinyuan
    Xie, Yu
    Fang, Chenrong
    Sun, Le
    Wu, Zebin
    Pan, Chengsheng
    [J]. REMOTE SENSING, 2024, 16 (12)