Adaptive Learnable Spectral-Spatial Fusion Transformer for Hyperspectral Image Classification

被引:1
|
作者
Wang, Minhui [1 ,2 ]
Sun, Yaxiu [1 ,2 ]
Xiang, Jianhong [1 ,2 ]
Sun, Rui [1 ,2 ]
Zhong, Yu [3 ]
机构
[1] Harbin Engn Univ, Coll Informat & Commun Engn, Harbin 150001, Peoples R China
[2] Harbin Engn Univ, Key Lab Adv Ship Commun & Informat Technol, Harbin 150001, Peoples R China
[3] Agile & Intelligent Comp Key Lab Sichuan Prov, Chengdu 610000, Peoples R China
关键词
hyperspectral image (HSI); convolutional neural network (CNN); vision transformer; spectral-spatial features fusion; REMOTE-SENSING IMAGES; DISTANCE;
D O I
10.3390/rs16111912
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In hyperspectral image classification (HSIC), every pixel of the HSI is assigned to a land cover category. While convolutional neural network (CNN)-based methods for HSIC have significantly enhanced performance, they encounter challenges in learning the relevance of deep semantic features and grappling with escalating computational costs as network depth increases. In contrast, the transformer framework is adept at capturing the relevance of high-level semantic features, presenting an effective solution to address the limitations encountered by CNN-based approaches. This article introduces a novel adaptive learnable spectral-spatial fusion transformer (ALSST) to enhance HSI classification. The model incorporates a dual-branch adaptive spectral-spatial fusion gating mechanism (ASSF), which captures spectral-spatial fusion features effectively from images. The ASSF comprises two key components: the point depthwise attention module (PDWA) for spectral feature extraction and the asymmetric depthwise attention module (ADWA) for spatial feature extraction. The model efficiently obtains spectral-spatial fusion features by multiplying the outputs of these two branches. Furthermore, we integrate the layer scale and DropKey into the traditional transformer encoder and multi-head self-attention (MHSA) to form a new transformer with a layer scale and DropKey (LD-Former). This innovation enhances data dynamics and mitigates performance degradation in deeper encoder layers. The experiments detailed in this article are executed on four renowned datasets: Trento (TR), MUUFL (MU), Augsburg (AU), and the University of Pavia (UP). The findings demonstrate that the ALSST model secures optimal performance, surpassing some existing models. The overall accuracy (OA) is 99.70%, 89.72%, 97.84%, and 99.78% on four famous datasets: Trento (TR), MUUFL (MU), Augsburg (AU), and University of Pavia (UP), respectively.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] A Spectral-Spatial Fusion Transformer Network for Hyperspectral Image Classification
    Liao, Diling
    Shi, Cuiping
    Wang, Liguo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [2] Interactive Spectral-Spatial Transformer for Hyperspectral Image Classification
    Song, Liangliang
    Feng, Zhixi
    Yang, Shuyuan
    Zhang, Xinyu
    Jiao, Licheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8589 - 8601
  • [3] Fusion of Spectral-Spatial Classifiers for Hyperspectral Image Classification
    Zhong, Shengwei
    Chen, Shuhan
    Chang, Chein-, I
    Zhang, Ye
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (06): : 5008 - 5027
  • [4] Efficient Spectral-Spatial Fusion With Multiscale and Adaptive Attention for Hyperspectral Image Classification
    Wan, Xiaoqing
    Chen, Feng
    Gao, Weizhe
    He, Yupeng
    Liu, Hui
    Li, Zhize
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 1196 - 1211
  • [5] Hierarchical Spectral-Spatial Transformer for Hyperspectral and Multispectral Image Fusion
    Zhu, Tianxing
    Liu, Qin
    Zhang, Lixiang
    REMOTE SENSING, 2024, 16 (22)
  • [6] Spectral-Spatial Morphological Attention Transformer for Hyperspectral Image Classification
    Roy, Swalpa Kumar
    Deria, Ankur
    Shah, Chiranjibi
    Haut, Juan M.
    Du, Qian
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [7] Spectral-Spatial Morphological Attention Transformer for Hyperspectral Image Classification
    Roy, Swalpa Kumar
    Deria, Ankur
    Shah, Chiranjibi
    Haut, Juan M.
    Du, Qian
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [8] MultiScale spectral-spatial convolutional transformer for hyperspectral image classification
    Gong, Zhiqiang
    Zhou, Xian
    Yao, Wen
    IET IMAGE PROCESSING, 2024, 18 (13) : 4328 - 4340
  • [9] WaveFormer: Spectral-Spatial Wavelet Transformer for Hyperspectral Image Classification
    Ahmad, Muhammad
    Ghous, Usman
    Usama, Muhammad
    Mazzara, Manuel
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [10] Spectral-Spatial Adaptive Weighted Fusion and Residual Dense Network for hyperspectral image classification
    Sun, Junding
    Zhang, Hongyuan
    Ma, Xiaoxiao
    Wang, Ruinan
    Sima, Haifeng
    Wang, Jianlong
    EGYPTIAN JOURNAL OF REMOTE SENSING AND SPACE SCIENCES, 2025, 28 (01): : 21 - 33