Foundation Model-Based Spectral-Spatial Transformer for Hyperspectral Image Classification

被引:0
|
作者
Huang, Lingbo [1 ]
Chen, Yushi [1 ]
He, Xin [1 ]
机构
[1] Harbin Inst Technol, Sch Elect & Informat Engn, Harbin 150001, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Deep learning; foundation model; hyperspectral image (HSI) classification; transformer; zero-shot learning (ZSL); NETWORK;
D O I
10.1109/TGRS.2024.3456129
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recently, deep learning models have dominated hyperspectral image (HSI) classification. Nowadays, deep learning is undergoing a paradigm shift with the rise of transformer-based foundation models. In this study, the potential of transformer-based foundation models, including the vision foundation model (VFM) and language foundation model (LFM), for HSI classification are investigated. First, to improve the performance of traditional HSI classification tasks, a spectral-spatial VFM-based transformer (SS-VFMT) is proposed, which inserts spectral-spatial information into the pretrained foundation transformer. Specifically, a given pretrained transformer receives HSI patch tokens for long-range feature extraction benefiting from the prelearned weights. Meanwhile, two enhancement modules, i.e., spatial and spectral enhancement modules (SpaEMs/ SpeEMs), utilize spectral and spatial information for steering the behavior of the transformer. Besides, an additional patch relationship distillation strategy is designed for SS-VFMT to exploit the pretrained knowledge better, leading to the proposed SS-VFMT-D. Second, based on SS-VFMT, to address a new HSI classification task, i.e., generalized zero-shot classification, a spectral-spatial vision-language-based transformer (SS-VLFMT) is proposed. This task is to recognize novel classes not seen during training, which is more meaningful as the real world is usually open. The SS-VLFMT leverages SS-VFMT to extract spectral-spatial features and corresponding hash codes while integrating a pretrained language model to extract text features from class names. Experimental results on HSI datasets reveal that the proposed methods are competitive compared to the state-of-the-art methods. Moreover, the foundation model-based methods open a new window for HSI classification tasks, especially for HSI zero-shot classification.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Interactive Spectral-Spatial Transformer for Hyperspectral Image Classification
    Song L.
    Feng Z.
    Yang S.
    Zhang X.
    Jiao L.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (09) : 1 - 1
  • [2] A Light-Weighted Spectral-Spatial Transformer Model for Hyperspectral Image Classification
    Arshad, Tahir
    Zhang, Junping
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 12008 - 12019
  • [3] Spectral-Spatial Morphological Attention Transformer for Hyperspectral Image Classification
    Roy, Swalpa Kumar
    Deria, Ankur
    Shah, Chiranjibi
    Haut, Juan M.
    Du, Qian
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [4] A Spectral-Spatial Fusion Transformer Network for Hyperspectral Image Classification
    Liao, Diling
    Shi, Cuiping
    Wang, Liguo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [5] Spectral-Spatial Morphological Attention Transformer for Hyperspectral Image Classification
    Roy, Swalpa Kumar
    Deria, Ankur
    Shah, Chiranjibi
    Haut, Juan M.
    Du, Qian
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [6] WaveFormer: Spectral-Spatial Wavelet Transformer for Hyperspectral Image Classification
    Ahmad, Muhammad
    Ghous, Usman
    Usama, Muhammad
    Mazzara, Manuel
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [7] A multi-range spectral-spatial transformer for hyperspectral image classification
    Zhang, Lan
    Wang, Yang
    Yang, Linzi
    Chen, Jianfeng
    Liu, Zijie
    Wang, Jihong
    Bian, Lifeng
    Yang, Chen
    INFRARED PHYSICS & TECHNOLOGY, 2023, 135
  • [8] Adaptive Learnable Spectral-Spatial Fusion Transformer for Hyperspectral Image Classification
    Wang, Minhui
    Sun, Yaxiu
    Xiang, Jianhong
    Sun, Rui
    Zhong, Yu
    REMOTE SENSING, 2024, 16 (11)
  • [9] Spectral-Spatial Attention Transformer with Dense Connection for Hyperspectral Image Classification
    Dang, Lanxue
    Weng, Libo
    Dong, Weichuan
    Li, Shenshen
    Hou, Yane
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [10] SPECTRAL-SPATIAL TRANSFORMER FOR HYPERSPECTRAL IMAGE SHARPENING
    Chen, Lihui
    Vivone, Gemine
    Qin, Jiayi
    Chanussot, Jocelyn
    Yang, Xiaomin
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1452 - 1455